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SEQ ID NO. 4201: 2603 V/R STRAIN 

ATGGTAAAATTAGTATTCGCACGCCACGGTGAATCTGAGTGGAATAAAGCTAACCTTTTC 
ACTGGATGGG CTGACGT AG ATCTTT CAGAAAAAGGTACACAACAAGCTATTG ATG CTGG G 
AAATTAATT CAAGCAGCAGGTATTGAGTT CGAC CTTGCTTTTACAT CAGTTCTT AAACGT 
GCCATCAAAACAACTAACCTTGCCCrTGAAGCAG CTGATCAACTTTGGGTACCAGTTGAA 
AAAT CATGGCGCTTGAACG AACGT CATTACGGTGGATTGACAGGAAAAAATAAAG CAGAA 
GC^GCTGAACAATTTGGTGATGAGCAAGTTCATATTTGGCGTCGTTCATATGATGTATTG 
CCT CCAG ATATGGCT AAAGATG ATG AACATT CAG CACATACTGATCGT CG CT ATG CTTCA 
CTAG ATG ATT CTGTTATT C CAGATG CAGAAAACCTAAAAGTT ACTTTAG AG CGTGCT CTT 
C CTTT CTGGGAAGAT AAAATTG CT C CTG CT CTTAAAG ATG GT AAAAATGTGTTTGTTGGT 
GCACACGGTAACTCAAT CCGTG CT CTT GTAAAAGATATCAAACAATTGT CAG ATGATGAA 
ATCATGGACGTTGAAATTCCTAACTTCCCACCACTTGTTTTCGAATTTGATGA 

AAC CTTGTTT CAGAATATT ACT TAG GTAAA 

SEQ ID NO. 4202: 090 STRAIN 

GTAAAATTAGTATTCG CACG CCACGGTGAATCTGAGTG 

GAATAAAGCTAACCTTTTCACTGGATGGGCTGACGTAGATCTTTCAGAAA 

AAGGT ACACAACAAG CT ATTGATG CTGGG AAATTAATTCAAG CAG CAGGT 

ATTGAGTT CGAC CTTGCTTTTACAT CAGTTCTTAAACGTGCCATCAAAAC 

AACTAACCTTGCCCTTGAAGCAGCTGATCAACTTTGGGTACCAGTTGAAA 

AATCATGGCGCTTGAACGAACGTCATTACGGTGGATTGACAGGAAAAAA^ 

AAAG CAG AAG CAGCTGAACAATTT GGTGATGAG CAAGTTCATATTTGG CG 

T CGTT CATAT GATGT ATTGCCT CCAG ATATGG CTAAAGATGATG AACATT 

CAG CACATACTGATCGTCG C TATG CTT CACTAGATGATT CTGTTATT CCA 

GATGCAGAAA^CCTAAAAGTTACTTTAGAGCGTGCTCTTCCTTTCTGGGA 

AGATAAAATTGCTCCTGCTCTTAAAGATGGTAAAAATGTGTTTGTTGGTG 

CACACGGTAACTCAATCCGTGCTCTTGTAAAACATATCAAACAATTGTCA 

GATGATGAAATCATGGACGTTGAAATTCCTAACTTCCCACCACr^ 

CG AATTTGATGAAAAATTAAAC CTTGTTT CAGAATATTACTTAGGTAAA 

SEQ ID NO. 4203: A909 STRAIN 

GTAAAATTAGTATTCGCACGCCACGGTGAATCTGAGTGG 

AATAAAGCTAACCTTTTCACTGGATGGGCTGACGTAGATCTTTCAGAAAA 

AGGT ACACAACAAGCTATTG ATGCTGGGAAATTAATT CAAG CAG CAGGTA 

TTGAGTT CG ACCTTG CTTTT ACAT CAGTT CTTAAACGTG C CATCAAAACA 

ACTAACCTTGCCCTTGAAGCAGCTGATCAACTTTGGGTA 

ATCATGGCGCTTAA^CGAACGTCATTACGGTGGATTGACAGGAAAAAATA 

AAGCAGAAGCAGCTGAACAATTTGGTGATGAGCAAGTTCATATTTGGCGT 

CGTTCATATGATGTATTGCCTCCAGATATGGCTAAAGATGATGAACATTC 

AGCACATACTGATCGTCGCTATGCTTC^CTAGATGATTCTGTTATTCCAG 

ATGCAGAAAAC CTAAA^GTT ACTTTAG AG CGTG CT CTTCCTTTCTGG GAA 

GATAAAATTGCTCCTGCTCTTAAAGATGGTAAAAATGTGTTTGTTGGTGC 

ACA.CGGTAACTCAATCCGTGCTCTTGTAAAACATATCAAACAATTGTCAG 

ATGATGAAATCATGGACGTTGAAATTCCTAACTTCCCACCACITGTTTT 

GAATTTGATGAAAAATTAAACCTTGTTTC^GAATATTACTTAGGTAAA 

SEQ ID NO. 4204: H36B STRAIN 

GTAAAATTAGTATTCG CACGC CACG GTGAATCTGAG 

TGGAATAAAGCTAACCTTTTCACTGGATGGGCTGACGTAGATCTTTCAGA 
AAAAGGTACACAACAAGCTATTGATGCTGGGAAATTAATTCAAGCAGCAG 
GTATTGAGTTCGACCTTGCIT^ACATCAGTTCTTAAACGTGCCATCAAA 
ACMCTAACCTTGCCCTTGAAGCAGCTGATC^ACTTTC^GTACCAGTTGA 
AAAATCATGGCGCTTGAACGAACGTCATTACGGTGGATTGACAGGAAAAA 

ATAAAGCAGAAG CAG CTGAACAATTTGGTG ATGAG CAAGTT CATATTTGG 
CGTCGTTCATATGATGTATTGCCTCCAGATATGGCTAAAGATGATCWVCA 
TTC^GC^CATA(^ATCGTCGCTATGCTTC^CrAGATGATTCTGTTATTC 
CAGATGCAG AAAAC CTAAAAGTTACTTTAG AG CGTG CTCTT CCTTTCTGG 
GAAG AT AAAATTG CT CCTG CT CTT AAAGAT GGTAAAAATGTGTTTGTTG G 
TG CACACGGT AACTCAAT C CGTG CTCTTGTAAAACATAT CAAACAATTGT 
CAGATGATGAAATCATGGACGTTGAAATTCCTAACTTCCCACCACTTGTT 
TTCGAATTTGATGAAAAATTAAACCTTGTTTCAGAATATTACTTAGGTAA 

A 

SEQ ID NO. 4205: 18RS21 STRAIN 

GTAAAATTAGTATTCGCACGCCACGGTGAATCTGAGTGG 

A^TAAAG CT AAC CTTTT CACTGGATGG GCT GACGTAG AT CTTT CAGAAAA 

AGGTACACAACAAGCTATTGATGCTGGGAAATTAATTCAAGCAGCAGGTA 

TTGAGTTCGACCTTGCTTTTACATCAGTTCTTAAACGTGCCATCAAAACA 

ACTAACCTTGCCCTTGAAGCAGCTGATCAACTTTGGGTACCAGTTGAAAA 

AT CATGGCGCTTGAACG AACGT CATTACGGTGGATTGACAGGAAAAAATA 

AAGC^GAAGCAGCTGAACAATTTGGTGATGAGCAAGTTCATATTTGGCGT 

CGTTCAT ATG ATGTATTG CCTC CAG AT ATG G CT AAAGATGATGAACATT C 
AGCAC^TACrGATCGTCGCTATGCTTCACrAGATGATTCTGTTATTCCAG 
ATGCAGAAAACCTAAAAGTTACTTTAGAGCGTGCTCTTCCTTTCTGGGAA 
GATAAAATTGCTCCTGCT CTTAAAG ATGGTAAAAATGTGTTTGTTGGTGC 

ACACG GTAACT CAAT C CGTG CT CTTGT AAAACATAT CAAACAATTGT CAG 
ATGATGAAATCATGGACGTTGAAATTCCTAACTTCCCACCACTTGTTTTC 
GAATTTGATGAAAAATTAAACCTTGTTTCAGAATATTACTTAGGTAAA 

SEQ ID NO. 4206: M732 STRAIN 

GTAAAATTAGTATTCGCACGCCACGGTGAATCTGAGTGG 

AATAAAG CT AAC CTTTT CACTGG ATG G G CTG ACGT AG AT CTTTCAGAAAA 

AGGTACACAACAAGCTATTGATGCTGGGAAATTAATTCAAGCAGCAGGTA 



750 



WO 2004/018646 



PCT/US2003/026827 



Table 42: Comparative Sequences relating to SAG 0764 



TTGAGTT CGACCTTG CTTTTACAT CAGTT CTTAAACGTG C CAT CAAAACA 
ACT AAC CITGC C CTTGAAG CAG CTGATCAACTTTGGGTAC CAGTTGAAAA 
ATCATGG CG CTTG AACGAACGT CATTACGGTGGATTG ACAGG AAAAAAT A 
AAG CAGAAG CAG CTG AACAATTTGGTGAT GAG CAAGTTCATATTTGG CGT 
CGTT CAT ATGATGTATTGCCTCCAGAT ATGG CTAAAG ATG ATGAACATT C 
AGCACATACTGAT CGT CGCTATGCTT CACTAGATG ATTCTGTTATT CCAG 
ATGCAGAAAACCTAAAAGTTACTTTAGAGCGTGCTCTTCCTTTCTGGGAA 
GATAAAATTGCTCCTGCTCTTAAAGATGGTAAAAATGTGTTTGTTGGTGC 
ACACGGTAACTCAATCCGTGCTCITGTAAAACATATCAAACAATTGTCAG 
ATGATGAAATCATGGACGTTGAAATTCCTAACTTCCCACCACTTGTTTTC 
GAATTTGATGAA^AATTAAACCTTGTTTCAGAATATTACTTAGGTAAA 

SEQ ID NO. 4207: COH1 STRAIN 

GTAAAATTAGTATTCGCACGCCACGG 

TGAATCTGAGTGGAATAAAGCTAACCTTTTCACTGGATGGGCTGACGTAG 
ATCTTTCAGAAAAAGGTACACAACAAG CT ATTGATGCTG GGAAATTAATT 
CAAG CAGCAGGT ATTGAGTT CG AC CTTG CTTTTACAT CAGTT CTTAAACG 
TGCCATCAAAACAACTAACCTTGCCCTTGAAGCAGCT^ 
TACCAGTTG AAAAAT CATGG CG CTTGAACG AACGT CATTACGGTGGATTG 
ACAGGAAAAAAT AAAG CAGAAG CAG CTGAACAATTTGGTG ATGAGCAAGT 
TCAT ATTTGG CGTCGTTCAT ATG ATGTATTG C CTC GAGAT ATGG CTAAAG 
ATGATGAACATTCAG CACATACTG ATCGTCG CTATGCTT CACTAG ATG AT 
T CTGTTATTC CAGATG CAG AAAAC CTAAAAGTTACTTTAGAGCGTGCTCT 
TCCTTT CTGGGAAGATAAAATTG CT CCTG CT CTTAAAGATGGTAAAAATG 
TGTTTGTTGGTGCACACGGTAACTCAATCCGTGCTCTTGTAAAACATATC 
AAACAATTGT CAGATGATGAAAT CATGG ACGTTGAAATT C CTAACTT CC C 
AC CACTTGTTTTCCLAATTTG ATGAAAAATT A 
ACTTAGGTAAA 

SEQ ID NO. 4208: CJB110 STRAIN 

GTAAAATTAGTATTCGCACGCCACGG 

TGAATCTGAGTGGAATAA^GCTAACCTTTTCACTGGATGGGCTGACGTAG 
AT CTTTCAG AAAAAGGT ACACAACAAG CT ATTGATGCTG GGAAATTAATT 
CAAGCAGCAGGTATTGAGTTCGACCTTGCTTTTACATCAGTTCTTAAACG 
TGCCATCAAAACAACTAACCTTGCCCTTGAAGCAGCTGATG^CTTTGG 
TACCAGTTGAAAAATCATGGCGCTTGAACGAACGTCATTACGGTGGATTG 
ACAGGAAAAAATAAAGCAGAAGCAG CTGAACAATTTGGT G ATGAG CAAGT 
T CATATTTG G CGTCGTT CAT ATGATGTATTGC CT C CAGATATGG CTAAAG 
ATGATC^C^TTC^G(^C^TACTGATCGTCGCTATG(^C^CTAGATGAT 

TCTGTTATT CCAGATG CAGAAAAC CTAAAAGTTACTTTAG AG CGT GCTCT 
TCCITTCTGGGAAGATAAAATTGCTCCTGCTCTTAAAGATGGTAaAAATG 
TGTTTGTTGGTGCACACGGTAACTCAATCCGTGCTCTTGTAAAACATATC 
AAACAATTGTCAGAT GATGAAATCATGGACGTTG AAATT C CTAACTT CC C 
AC CACTTGTTTT CGAATTTG ATGAAAAATTAAAC CTTGTTTCAGAAT ATT 
ACTTAGGTAAA 

SEQ ID NO. 4209: 1169NT STRAIN 

AGTATTCGCACGCCACGGTGAATCTGAGTGGAATAAAGCTAACCTTTTCA 

CTGG ATGGG CTG ACGTAGAT CTTT CAGAAAAAGGT ACACAACAAG CTATT 
GATG CTGGGAAATT AATTCAAG CAG CAGGTATTGAGTTCG AC CTTG CTTT 
TACATCAGTTCTTAAACGTGCCATC^^ACAACTAACCrTGCCCTTGAAG 
CAG CTGATCAACTTTGGGTACCAGTTGAAAAAT CATG G CG CTTGAACGAA 
CGTCATTACGGTGGATTGACAGGAAAAAATAAAGCAGAAGCAGCTGAACA 
ATTTGGTGATGAGCAAGTTCATATTTGGCGTCGTTCATATGATGTATTGC 
CTC CAGATATGG CTAAAGATGATGAACATT CAGCACATACTGATCGTCG C 
TATGCTT(^CTAGATGATTCTGTTATTCCAGATGCAGAAAACCTAAA^GT 

TACTTTAGAG CGTGCT CTTC CTTT CTGGG AAGATAAAATTGCTCCTG CT C 
TTAARGATGGTAAAAATGTGTTTGTTGGTGCACACGGTAACTCAATCCGT 
G CTCTTGTAAAACAT AT CAAACAATTGTCAGATGATG AAATCATGGACGT 
TGAAATTCCTA^CTTCCCACCACTTGTTTTCGAATTTaATGAAAAATTAA 
ACCTTGTTTCAGAATATTACTTAGGTAAA 

SEQ ID NO. 4210: M7 81 STRAIN 

GTAAAATTAGTATTCGCACGCCACGGT 

GAAT CTG AGTGGAAT AAAG CT AACCTTTT CAC TG G ATGG G CTGAC GT AG A 

TCTTTCAGAAAAAGGTACACAACAAGCTATTGATGCTGGGAAATTAATTC 

AAGCAGCAGGTATTGAGTTCGACCTTGCTTTTACATCAGTTCTTAAACGT 

GCCATCAAAACAACTAACCTTGCCCTTGAAGCAGCTGATCAACTTTGGGT 

ACCAGTTGAAAAATCATGGCGCTTGAACGAACGTCATTACGGTGGATTGA 

CAGG AAAAAATAAAG CAG AAGCAG CTG AACAATT TGGTG ATG AG CAAGTT 

CAT ATTTGG CGT CGTTCAT AT G ATGTATTG CCTC CAG AT ATGG CTAAAG A 

TG ATGAACATT CAG CACAT ACT GAT CGTCG CT ATG CTT CACT AGATG ATT 

CTGTTATTCCAGATGCAGAAAACCrrAAAAGTTACTTTAGAGCGTGCTCTT 

CCTTTCTGGGAAGATAAAATTGCTCCTGCTCTTAAAGATGGTAAAAATGT 

GTTTGTTGGTGCACACGGTAACTCAATCCGTGCrCTTGTAAAACATATCA 

AACAATT GT CAG ATG ATGAAATC^TGGACGTTGAAATTCCTAACTTC CCA 

CCACTTGTTTTCGAATTTGATGAAAAATTAAACCTTGTTTC^ 

CTTAGGTAAA 

SEQ ID NO. 4211: JH930013 STRAIN 

GTAAAATTAGTATTCGCACGCCACGGTGAATCT 

G AGTGGAAT AAAG C TAAC CTTTTCACTGG ATG GG CTG ACGTAGAT CTTT C 
AGAAAAAGGTACACAACAAGCTATTGATGCTGGGAAATTAATTCAAGCAG 
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CAGGT ATTG AGTT CG AC CTTG CTTTT ACAT CAGTT CT TAAACGTG CCATC 
AAAACAACT A^CCTTGC C CTTGAAG CAGCTGAT CAACTTTGGGT ACCAGT 
TGAAAAATCATGGCGCTTGAACGAACGTCATTACGGTGGATTGACAGGAA 
AAAATAAAG CAGAAG CAG CTGAACAATTTGGTGATGAG CAAGTT CATATT 
TGG CGTCGTT CATATGATGTATTG C CTC CAGATATGG CTAAAGATGATGA 
ACATT CAG CACATACTG AT CGT CG CT ATG CTTCACTAGATGATT CTGTTA 
TTCCAGATG CAG AAAAC CTAAAAGTTACTTTAGAG CGTG CTCTT CCTTT C 
TGGGAAG AT AAAATTGCT C CTG CT CrTAAAGATGGTAAAAATGTGTTTGT 
TGGTGCACACGGTAACTCAATCCGTGCTCTTGTAAAAC^ 
TGTCAGATGATGAAATC7VTGGACGTTGAAATTCCTAACTTCCCACCACTT 
GTTTTCGAATTTGATGAAAAATTAA^CCTTGTTTCAGAATATT^ 
TAAA 



PRETTY of: /biotmp/msa63264 .2{ *} March 10, 2003 09:30 



msa63264 
msa63264.2{ 
msa63264.2{ 

msa63264 . 
msa63264.2{ 

msa63264 . 

msa63264 ♦ 
msa63264.2{H0 

msa63264 . 

msaS3264 . 

msa63264 . 



.2{110_090; 
110_1169NT/ 
110_18RS21 
2{110_2603 
110_CJB110 
2{110_COH1" 
2{110_H36B; 
_JM9130013 
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Consensus 



gtaaaat 

gtaaaat 

atggtaaaat 

———gtaaaat 

gtaaaat 

gtaaaat 

gtaaaat 

gtaaaat 

gtaaaat 

gtaaaat 

*** 



tAGTATTCGC 

-AGTATTCGC 

tAGTATTCGC 

tAGTATTCGC 

tAGTATTCGC 

tAGTATTCGC 

tAGTATTCGC 

tAGTATTCGC 

tAGTATTCGC 

tAGTATTCGC 

tAGTATTCGC 
_********* 



ACGCCACGGT 

ACGCCACGGT 

ACGCCACGGT 

ACGCCACGGT 

ACGCCACGGT 

ACGCCACGGT 

ACGCCACGGT 

ACGCCACGGT 

ACGCCACGGT 

ACGCCACGGT 

ACGCCACGGT 
********** 



GAATCTGAGT 

GAATCTGAGT 

GAATCTGAGT 

GAATCTGAGT 

GAATCTGAGT 

GAATCTGAGT 

GAATCTGAGT 

GAATCTGAGT. 

GAATCTGAGT 

GAATCTGAGT 

GAATCTGAGT 
********** 



50 

GGAATAAAGC 

GGAATAAAGC 

GGAATAAAGC 

GGAATAAAGC 

GGAATAAAGC 

GGAATAAAGC 

GGAATAAAGC 

GGAATAAAGC 

GGAATAAAGC 

GGAATAAAGC 

GGAATAAAGC 
********** 
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msa63264 . 2{ 
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msa63264 . 
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msa63264 . 



.2{110_090) 
110_1169NT} 
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2{110_2603} 
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2{110_COH1} 
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_JM9130013) 
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51 

TAACCTTTTC 

TAACCTTTTC 

TAACCTTTTC 

TAACCTTTTC 

TAACCTTTTC 

TAACCTTTTC 

TAACCTTTTC 

TAACCTTTTC 

TAACCTTTTC 

TAACCTTTTC 

TAACCTTTTC 
********** 



ACTGGATGGG 

ACTGGATGGG 

ACTGGATGGG 

ACTGGATGGG 

ACTGGATGGG 

ACTGGATGGG 

ACTGGATGGG 

ACTGGATGGG 

ACTGGATGGG 

ACTGGATGGG 

ACTGGATGGG 
********** 



CTGACGTAGA 

CTGACGTAGA 

CTGACGTAGA 

CTGACGTAGA 

CTGACGTAGA 

CTGACGTAGA 

CTGACGTAGA 

CTGACGTAGA 

CTGACGTAGA 

CTGACGTAGA 

CTGACGTAGA 
********** 



TCTTTCAGAA 

TCTTTCAGAA 

TCTTTCAGAA 

TCTTTCAGAA 

TCTTTCAGAA 

TCTTTCAGAA 

TCTTTCAGAA 

TCTTTCAGAA 

TCTTTCAGAA 

TCTTTCAGAA 

TCTTTCAGAA 
********** 



100 

AAAGGTACAC 

AAAGGTACAC 

AAAGGTACAC 

AAAGGTACAC 

AAAGGTACAC 

AAAGGTACAC 

AAAGGTACAC 

AAAGGTACAC 

AAAGGTACAC 

AAAGGTACAC 

AAAGGTACAC 
********** 
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msa63264 
msa63264 

msa63264 * 
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^ , .2{ll0_090} 
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Consensus 



101 

AACAAGCTAT 

AACAAGCTAT 

AACAAGCTAT 

AACAAGCTAT 

AACAAGCTAT 

AACAAGCTAT 

AACAAGCTAT 

AACAAGCTAT 

AACAAGCTAT 

AACAAGCTAT 

AACAAGCTAT 
********** 



TGATGCTGGG 
TGATGCTGGG 
TGATGCTGGG 
TGATGCTGGG 
TGATGCTGGG 
TGATGCTGGG 
TGATGCTGGG 
TGATGCTGGG 
TGATGCTGGG 
TGATGCTGGG 
TGATGCTGGG 
********** 



AAATTAATTC 

AAATTAATTC 

AAATTAATTC 

AAATTAATTC 

AAATTAATTC 

AAATTAATTC 

AAATTAATTC 

AAATTAATTC 

AAATTAATTC 

AAATTAATTC 

AAATTAATTC 
********** 



AAGCAGCAGG 

AAGCAGCAGG 

AAGCAGCAGG 

AAGCAGCAGG 

AAGCAGCAGG 

AAGCAGCAGG 

AAGCAGCAGG 

AAGCAGCAGG 

AAGCAGCAGG 

AAGCAGCAGG 

AAGCAGCAGG 
********** 



150 

TATTGAGTTC 

TATTGAGTTC 

TATTGAGTTC 

TATTGAGTTC 

TATTGAGTTC 

TATTGAGTTC 

TATTGAGTTC 

TATTGAGTTC 

TATTGAGTTC 

TATTGAGTTC 

TATTGAGTTC 
********** 
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151 

GACCTTGCTT 

GACCTTGCTT 

GACCTTGCTT 

GACCTTGCTT 

GACCTTGCTT 

GACCTTGCTT 

GACCTTGCTT 

GACCTTGCTT 

GACCTTGCTT 

GACCTTGCTT 

GACCTTGCTT 
********** 



TTACATCAGT 

TTACATCAGT 

TTACATCAGT 

TTACATCAGT 

TTACATCAGT 

TTACATCAGT 

TTACATCAGT 

TTACATCAGT 

TTACATCAGT 

TTACATCAGT 

TTACATCAGT 
********** 



TCTTAAACGT 

TCTTAAACGT 

TCTTAAACGT 

TCTTAAACGT 

TCTTAAACGT 

TCTTAAACGT 

TCTTAAACGT 

TCTTAAACGT 

TCTTAAACGT 

TCTTAAACGT 

TCTTAAACGT 
********** 



GCCATCAAAA 

GCCATCAAAA 

GCCATCAAAA 

GCCATCAAAA 

GCCATCAAAA 

GCCATCAAAA 

GCCATCAAAA 

GCCATCAAAA 

GCCATCAAAA 

GCCATCAAAA 

GCCATCAAAA 
********** 



200 

CAACTAACCT 

CAACTAACCT 

CAACTAACCT 

CAACTAACCT 

CAACTAACCT 

CAACTAACCT 

CAACTAACCT 

CAACTAACCT 

CAACTAACCT 

CAACTAACCT 

CAACTAACCT 
********** 
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201 

TGCCCTTGAA 
TGCCCTTGAA 
TGCCCTTGAA 
TGCCCTTGAA 
TGCCCTTGAA 
TGCCCTTGAA 
TGCCCTTGAA 
TGCCCTTGAA 
TGCCCTTGAA 
TGCCCTTGAA 



GCAGCTGATC 
GCAGCTGATC 
GCAGCTGATC 
GCAGCTGATC 
GCAGCTGATC 
GCAGCTGATC 
GCAGCTGATC 
GCAGCTGATC 
GCAGCTGATC 
GCAGCTGATC 



AACTTTGGGT 
AACTTTGGGT 
AACTTTGGGT 
AACTTTGGGT 
AACTTTGGGT 
AACTTTGGGT 
AACTTTGGGT 
AACTTTGGGT 
AACTTTGGGT 
AACTTTGGGT 



ACCAGTTGAA 
ACCAGTTGAA 
ACCAGTTGAA 
ACCAGTTGAA 
ACCAGTTGAA 
ACCAGTTGAA 
ACCAGTTGAA 
ACCAGTTGAA 
ACCAGTTGAA 
ACCAGTTGAA 



250 

AAATCATGGC 
AAATCATGGC 
AAATCATGGC 
AAATCATGGC 
AAATCATGGC 
AAATCATGGC 
AAATCATGGC 
AAATCATGGC 
AAATCATGGC 
AAATCATGGC 
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msa63264 .2{ll0_A909} 
Consensus 



TGCCCTTGAA GCAGCTGATC AACTTTGGGT ACCAGTTGAA AAATCATGGC 
********** ********** ********** ********** ********** 
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251 

GCTTgAACGA 
GCTTgAACGA 
GCTTgAACGA 
GCTTgAACGA 
GCTTgAACGA 
GCTTgAACGA 
GCTTgAACGA 
GCTTgAACGA 
GCTTgAACGA 
GCTTgAACGA 
GCTTaAACGA 



ACGTCATTAC 
ACGTCATTAC 
ACGTCATTAC 
ACGTCATTAC 
ACGTCATTAC 
ACGTCATTAC 
ACGTCATTAC 
ACGTCATTAC 
ACGTCATTAC 
ACGTCATTAC 
ACGTCATTAC 



GGTGGATTGA 
GGTGGATTGA 
GGTGGATTGA 
GGTGGATTGA 
GGTGGATTGA 
GGTGGATTGA 
GGTGGATTGA 
GGTGGATTGA 
GGTGGATTGA 
GGTGGATTGA 
GGTGGATTGA 



CAGGAAAAAA 
CAGGAAAAAA 
CAGGAAAAAA 
CAGGAAAAAA 
CAGGAAAAAA 
CAGGAAAAAA 
CAGGAAAAAA 
CAGGAAAAAA 
CAGGAAAAAA 
CAGGAAAAAA 
CAGGAAAAAA 



****_***** ********** ********** ********** 



300 

TAAAGCAGAA 

TAAAGCAGAA 

TAAAGCAGAA 

TAAAGCAGAA 

TAAAGCAGAA 

TAAAGCAGAA 

TAAAGCAGAA 

TAAAGCAGAA 

TAAAGCAGAA 

TAAAGCAGAA 

TAAAGCAGAA 
********** 
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301 

GCAGCTGAAC 

GCAGCTGAAC 

GCAGCTGAAC 

GCAGCTGAAC 

GCAGCTGAAC 

GCAGCTGAAC 

GCAGCTGAAC 

GCAGCTGAAC 

GCAGCTGAAC 

GCAGCTGAAC 

GCAGCTGAAC 
********** 



AATTTGGTGA 
AATTTGGTGA 
AATTTGGTGA 
AATTTGGTGA 
AATTTGGTGA 
AATTTGGTGA 
AATTTGGTGA 
AATTTGGTGA 
AATTTGGTGA 
AATTTGGTGA 

AATTTGGTGA 
********** 



TGAGCAAGTT 
TGAGCAAGTT 
TGAGCAAGTT 
TGAGCAAGTT 
TGAGCAAGTT 
TGAGCAAGTT 
TGAGCAAGTT 
TGAGCAAGTT 
TGAGCAAGTT 
TGAGCAAGTT 
TGAGCAAGTT 
********** 



CATATTTGGC 

CATATTTGGC 

CATATTTGGC 

CATATTTGGC 

CATATTTGGC 

CATATTTGGC 

CATATTTGGC 

CATATTTGGC 

CATATTTGGC 

CATATTTGGC 

CATATTTGGC 
********** 



350 

GTCGTTCATA 

GTCGTTCATA 

GTCGTTCATA 

GTCGTTCATA 

GTCGTTCATA 

GTCGTTCATA 

GTCGTTCATA 

GTCGTTCATA 

GTCGTTCATA 

GTCGTTCATA 

GTCGTTCATA 
********** 



msa63264 
msa63264 .2{ 
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msa63264 - 
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msa63264 . 

msa63264 . 
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msa63264 . 

msa63264 . 

msa63264 . 
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2{110~H36B} 
_JM9130013} 
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2{110_M781} 
2{110_A909} 
Consensus 



351 

TGATGTATTG 

TGATGTATTG 

TGATGTATTG 

TGATGTATTG 

TGATGTATTG 

TGATGTATTG 

TGATGTATTG 

TGATGTATTG 

TGATGTATTG 

TGATGTATTG 

TGATGTATTG 
********** 



CCTCCAGATA 

CCTCCAGATA 

CCTCCAGATA 

CCTCCAGATA 

CCTCCAGATA 

CCTCCAGATA 

CCTCCAGATA 

CCTCCAGATA 

CCTCCAGATA 

CCTCCAGATA 

CCTCCAGATA 
********** 



TGGCTAAAGA 

TGGCTAAAGA 

TGGCTAAAGA 

TGGCTAAAGA 

TGGCTAAAGA 

TGGCTAAAGA 

TGGCTAAAGA 

TGGCTAAAGA 

TGGCTAAAGA 

TGGCTAAAGA 

TGGCTAAAGA 
********** 



TGATGAACAT 

TGATGAACAT 

TGATGAACAT 

TGATGAACAT 

TGATGAACAT 

TGATGAACAT 

TGATGAACAT 

TGATGAACAT 

TGATGAACAT 

TGATGAACAT 

TGATGAACAT 
********** 



400 

TCAGCACATA 

TCAGCACATA 

TCAGCACATA 

TCAGCACATA 

TCAGCACATA 

TCAGCACATA 

TCAGCACATA 

TCAGCACATA 

TCAGCACATA 

TCAGCACATA 

TCAGCACATA 
********** 
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Consensus 



401 

CTGATCGTCG 

CTGATCGTCG 

CTGATCGTCG 

CTGATCGTCG 

CTGATCGTCG 

CTGATCGTCG 

CTGATCGTCG 

CTGATCGTCG 

CTGATCGTCG 

CTGATCGTCG 

CTGATCGTCG 
********** 



CTATGCTTCA 
CTATGCTTCA 
CTATGCTTCA 
CTATGCTTCA 
CTATGCTTCA 
CTATGCTTCA 
CTATGCTTCA 
CTATGCTTCA 
CTATGCTTCA 
CTATGCTTCA 

CTATGCTTCA 
********** 



CTAGATGATT 
CTAGAT GATT 
CTAGATGATT 
CTAGATGATT 
CTAGATGATT 
CTAGATGATT 
CTAGATGATT 
CTAGATGATT 
CTAGATGATT 
CTAGATGATT 
CTAGATGATT 



CTGTTATTCC 
CTGTTATTCC 
CTGTTATTCC 
CTGTTATTCC 
CTGTTATTCC 
CTGTTATTCC 
CTGTTATTCC 
CTGTTATTCC 
CTGTTATTCC 
CTGTTATTCC 
CTGTTATTCC 



450 

AGATGCAGAA 
AGATGCAGAA 
AGATGCAGAA 
AGATGCAGAA 
AGATGCAGAA 
AGATGCAGAA 
AGATGCAGAA 
AGATGCAGAA 
AGATGCAGAA 
AGATGCAGAA 
AGATGCAGAA 



********** ********** ********** 
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451 

AACCTAAAAG 

AACCTAAAAG 

AACCTAAAAG 

AACCTAAAAG 

AACCTAAAAG 

AACCTAAAAG 

AACCTAAAAG 

AACCTAAAAG 

AACCTAAAAG 

AACCTAAAAG 

AACCTAAAAG 
********** 



TTACTTTAGA 

TTACTTTAGA 

TTACTTTAGA 

TTACTTTAGA 

TTACTTTAGA 

TTACTTTAGA 

TTACTTTAGA 

TTACTTTAGA 

TTACTTTAGA 

TTACTTTAGA 

TTACTTTAGA 
********** 



GCGTGCTCTT 
GCGTGCTCTT 
GCGTGCTCTT 
GCGTGCTCTT 
GCGTGCTCTT 
GCGTGCTCTT 
GCGTGCTCTT 
GCGTGCTCTT 
GCGTGCTCTT 
GCGTGCTCTT 
GCGTGCTCTT 



CCTTTCTGGG 
CCTTTCTGGG 
CCTTTCTGGG 
CCTTTCTGGG 
CCTTTCTGGG 
CCTTTCTGGG 
CCTTTCTGGG 
CCTTTCTGGG 
CCTTTCTGGG 
CCTTTCTGGG 
CCTTTCTGGG 



500 

AAGATAAAAT 
AAGATAAAAT 
AAGATAAAAT 
AAGATAAAAT 
AAGATAAAAT 
AAGATAAAAT 
AAGATAAAAT 
AAGATAAAAT 
AAGATAAAAT 
AAGATAAAAT 
AAGATAAAAT 



********** ********** ********** 
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501 

TGCTCCTGCT 
TGCTCCTGCT 
TGCTCCTGCT 
TGCTCCTGCT 
TGCTCCTGCT 
TGCTCCTGCT 
TGCTCCTGCT 
TGCTCCTGCT 
TGCTCCTGCT 



CTTAAAGATG 
CTTAAAGATG 
CTTAAAGATG 
CTTAAAGATG 
CTTAAAGATG 
CTTAAAGATG 
CTTAAAGATG 
CTTAAAGATG 
CTTAAAGATG 



GTAAAAATGT 
GTAAAAATGT 
GTAAAAATGT 
GTAAAAATGT 
GTAAAAATGT 
GTAAAAATGT 
GTAAAAATGT 
GTAAAAATGT 
GTAAAAATGT 



GTTTGTTGGT 
GTTTGTTGGT 
GTTTGTTGGT 
GTTTGTTGGT 
GTTTGTTGGT 
GTTTGTTGGT 
GTTTGTTGGT 
GTTTGTTGGT 
GTTTGTTGGT 



550 

GCACACGGTA 
GCACACGGTA 
GCACACGGTA 
GCACACGGTA 
GCACACGGTA 
GCACACGGTA 
GCACACGGTA 
GCACACGGTA 
GCACACGGTA 



753 



WO 2004/018646 



PCT/US2003/026827 



Table 42: Comparative Sequences relating to SAG 0764 



msa63264 . 2 { 110_M78l} 
msa63264.2{ll0_A909) 
Consensus 



TGCTCCTGCT CTTAAAGATG GTAAAAATGT GTTTGTTGGT GCACACGGTA 

TGCTCCTGCT CTTAAAGATG GTAAAAATGT GTTTGTTGGT GCACACGGTA 
********** ********** ********** ********** ********** 
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551 

ACTCAATCCG 
ACTCAATCCG 
ACTCAATCCG 
ACTCAATCCG 
ACTCAATCCG 
ACTCAATCCG 
ACTCAATCCG 
ACTCAATCCG 
ACTCAATCCG 
ACTCAATCCG 
ACTCAATCCG 



TGCTCTTGTA 
TGCTCTTGTA 
TGCTCTTGTA 
TGCTCTTGTA 
TGCTCTTGTA 
TGCTCTTGTA 
TGCTCTTGTA 
TGCTCTTGTA 
TGCTCTTGTA 
TGCTCTTGTA 
TGCTCTTGTA 



AAACATATCA 
AAACATATCA 
AAACATATCA 
AAACATATCA 
AAACATATCA 
AAACATATCA 
AAACATATCA 
AAACATATCA 
AAACATATCA 
AAACATATCA 
AAACATATCA 



********** ********** ********** 



AACAATTGTC 

AACAATTGTC 

AACAATTGTC 

AACAATTGTC 

AACAATTGTC 

AACAATTGTC 

AACAATTGTC 

AACAATTGTC 

AACAATTGTC 

AACAATTGTC 

AACAATTGTC 
********** 



600 

AGATGATGAA 

AGATGATGAA 

AGATGATGAA 

AGATGATGAA 

AGATGATGAA 

AGATGATGAA 

AGATGATGAA 

AGATGATGAA 

AGATGATGAA 

AGATGATGAA 

AGATGATGAA 
********** 
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601 

ATCATGGACG 

ATCATGGACG 

ATCATGGACG 

ATCATGGACG 

ATCATGGACG 

ATCATGGACG 

ATCATGGACG 

ATCATGGACG 

ATCATGGACG 

ATCATGGACG 

ATCATGGACG 
********** 



TTGAAATTCC 

TTGAAATTCC 

TTGAAATTCC 

TTGAAATTCC 

TTGAAATTCC 

TTGAAATTCC 

TTGAAATTCC 

TTGAAATTCC 

TTGAAATTCC 

TTGAAATTCC 

TTGAAATTCC 
********** 



TAACTTCCCA 
TAACTTCCCA 
TAACTTCCCA 
TAACTTCCCA 
TAACTTCCCA 
TAACTTCCCA 
TAACTTCCCA 
TAACTTCCCA 
TAACTTCCCA 
TAACTTCCCA 
TAACTTCCCA 



CCACTTGTTT 
CCACTTGTTT 
CCACTTGTTT 
CCACTTGTTT 
CCACTTGTTT 
CCACTTGTTT 
CCACTTGTTT 
CCACTTGTTT 
CCACTTGTTT 
CCACTTGTTT 
CCACTTGTTT 



********** ********** 



650 

TCGAATTTGA 
TCGAATTTGA 
TCGAATTTGA 
TCGAATTTGA 
TCGAATTTGA 
TCGAATTTGA 
TCGAATTTGA 
TCGAATTTGA 
TCGAATTTGA 
TCGAATTTGA 

TCGAATTTGA 
********** 
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651 

TGAAAAATTA 

TGAAAAATTA 

TGAAAAATTA 

TGAAAAATTA 

TGAAAAATTA 

TGAAAAATTA 

TGAAAAATTA 

TGAAAAATTA 

TGAAAAATTA 

TGAAAAATTA 

TGAAAAATTA 
********** 



AACCTTGTTT 

AACCTTGTTT 

AACCTTGTTT 

AACCTTGTTT 

AACCTTGTTT 

AACCTTGTTT 

AACCTTGTTT 

AACCTTGTTT 

AACCTTGTTT 

AACCTTGTTT 

AACCTTGTTT 
********** 



CAGAATATTA 

CAGAATATTA 

CAGAATATTA 

CAGAATATTA 

CAGAATATTA 

CAGAATATTA 

CAGAATATTA 

CAGAATATTA 

CAGAATATTA 

CAGAATATTA 

CAGAATATTA 
********** 



690 

CTTAGGTAAA 

CTTAGGTAAA 

CTTAGGTAAA 

CTTAGGTAAA 

CTTAGGTAAA 

CTTAGGTAAA 

CTTAGGTAAA 

CTTAGGTAAA 

CTTAGGTAAA 

CTTAGGTAAA 

CTTAGGTAAA 
********** 



SEQ ID NO. 4212: 2603 V/R STRAIN 

VKLVPARHGESEWNKANLFTGWADVDIjSEKGTCK3AIDAGKLIQAAGIEFDIiAFTSVLKRA 
IKTTNLALEAADQLWVPVEKSWRIjNERHYGG 

PDMAKDDEHSAHTDRRYASLDDSVIPDAE^KVTLERALPFWEDKIAPAIjKDGKNVFVGA 
HGNS I RALVKHI KQLSDDE IMDVEI PNFPPLVFEFDEKLNLVSEYYLGK 

SEQ ID NO. 4213: 090 STRAIN 

VKLVFARHGESE^KANLFTGWADVDLSEKGTCX}AIDAGKLIQAAGIEFDLAFTSVLKRA 
I KTTNLALEAADQLWVP VEKS WRLNERHYGGLTGKNKAEAAEQFGDEQVH I WRRS YD VIiP 
PDMAKDDEHSAHTDRRYASLDDSVIPDAENLKVTLERALPFWEDKIAPALKDGKNVFVGA 
HGNS I RALVKHI KQLSDDE IMDVEI PNFPPLVFEFDEKLNLVSEYYLGK 

SEQ ID NO. 4214: A909 STRAIN 

VKLVFARHGE SEWNKANLFTGWAD VDLS E KGTQQAI DAGKL I CAAGI E FDLAFTS VLKRA 
I KTTNLALEAADQLWVP VEKS WRLNERHYGGLTGKNKAEAAEQFGDEQVH I WRRS YD VL P 
PDMAKDDEHSAHTDRRYASLDDSVIPDAENLKVTLERALPFWEDKIAPALKDGKNVFVGA 
HGNS I RALVKH I KQLSDDE I MD VE I PNFPPLVFEFDEKLNLVSEYYLGK 

SEQ ID NO. 4215: H36B STRAIN 

VKLVFARHGESEWNKANLFTG WAD VDLS E KGTQQAI DAGKL I QAAG I EFDLAFTS VLKRA 
IKTTNLALEAADQLWVPVEKSWRLNERHYGGLTGKNKAEAAEQFGDEQVHIWRRSYDVLP 
PDMAKDDEH S AHTDRR Y AS LDD S V I PDAENLKVTLERALPFWEDKI APALKDGKNVFVGA 
HGNS I RALVKH I KQLSDDE IMDVEI PNFPPLVFEFDEKLNLVSEYYLGK 

SEQ ID NO. 4216: 18RS21 STRAIN 

VKLVFARHGESEWNKANLFTGWADVDLSEKGTQQAIDAGKLIQAAGIEFDLAFTSVLKRA 
I KTTNLALEAADQLWVP VEKS WRLNERHYGGLTGKNKAEAAEQFGDEQVH I WRRS YD VL P 
PDMAKDDEHSAHTDRRYASLDDSVIPDAENLKVTLERALPFWEDKIAPALKDGKNVFVGA 
HGNS I RALVKHI KQLSDDE I MDVE I PNFPPLVFEFDEKLNLVSEYYLGK 

SEQ ID NO. 4217: M732 STRAIN 

VKLVFARHGESEWNKANLFTGWADVDLSEKGTQQAIDAGKLI QAAGI EFDLAFTSVLKRA 
I KTTNLALEAADQLWVP VE KS WRLNERHYGGLTGKNKAEAAEQFGDEQVH I WRRS YD VL P 
PDMAKDDEHSAHTDRRYASLDDSVIPDAENLKVTLERALPFWEDKIAPALKDGKNVFVGA 
HGNS I RALVKHI KQLSDDE IMDVE I PNFPPLVFEFDEKLNLVSEYYLGK 



SEQ ID NO. 4218: COHl STRAIN 



754 



WO 2004/018646 
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Table 42: Comparative Sequences relating to SAG 0764 



VKLVFARHGESEWNKANLFTGWADVDLSEKGTQQAIDAGKLI QAAGI EFDLAFTSVLKRA 
IKTTNKAXiEAADQLWVPVEKSWRiN^ 

PDMAKDDEHSAHTDRRYASLDDSVIPDAENLKVTLERALPFWEDKIAPAL KDGKNVFVGA 
HGNS I RALVKHI KQLSDDE I MOVE I PNFPPLVFEFDEKLNLVSEYYLGK 

SEQ ID NO. 4219: CJB110 STRAIN 

VKLVFARHGESEWNKANLFTGWAD VDLSEKGTQQAIDAGKLI QAAGI EFDLAFTSVLKRA 
I KTTNLALEAAD QL WVP VE KS WRLNERHYGGLTG KNKAEAAEQ FGDE QVH I WRR S YD VL P 
PDMAKDDEHSAHTDRRYASLDDSVIPDAENIiKVTLERALPFWEDKIAPALKDGKNVFVGA 
HGNS I RALVKHI KQLSDDE I MD VE I PNFPPLVFEFDEKLNLVSEYYLGK 

SEQ ID NO. 4220: 1169NT STRAIN 

VFARHGESEWNKANLFTGWADVDLSEKGTQQAIDAGKLIQAAGIEFDLAFTSVLKRT^ICT 
TNIiALEAADQLWPVEKSWRLNERHYGGLTGKNKAEAAEQFGDEQVHIWRRSYDVLPPDM 
AKDDEHSAHTDRRYASLDDSVIPDAENLKVTLERALPFWEDKIAPALKDGKNVFVGAHGN 
SI RALVKHI KQLSDDE IMDVE I PNFPPLVFEFDEKLNLVSEYYLGK 

SEQ ID NO. 4221: M781 STRAIN 

VKLVFARHGESEWNKANLFTGWAD VDLSEKGTQQAIDAGKL I GAAG I EFDLAFTSVLKRA 
I KTTNLALEAAD QL WVP VE KS WRLNERHYGGL TG KNKAEAAEQFGDE QVH I WRR S YDVLP 
PDMAKDDEHSAHTDRRYASLDDSVIPDAENLKVTLERALPFWEDKIAPALKDGKNVFVGA 
HGNS I RALVKHI KQLSDDE IMDVE I PNFPPLVFEFDEKLNLVSEYYLGK 

SEQ ID NO. 4222: JM9130013 STRAIN 

VKLVFARHGE SEWNKANLFTGWADVDLSEKGTQQAIDAGKL I QAAGI EFDLAFTSVLKRA 
I KTTNLALEAADQLWVP VEKS WRLNERHYGGLTGKNKAEAAEQFGDEQVH I WRRS YDVLP 
PDMAKDDEHSAHTDRRYASIJDDSVIPDAENLKVTLERALPFWEDKIAPALKDGKNVFVGA 
HGNS IRALVKHI KQLSDDE IMDVE I PNFPPLVFEFDEKLNLVSEYYLGK 

PRETTY of : /biotmp/msa70722 . 2 { * } March 10, 2003 09:33 
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vklVFARHGE 

vklVFARHGE 

vklVFARHGE 

vklVFARHGE 

vklVFARHGE 

vklVFARHGE 

vklVFARHGE 

vklVFARHGE 

vklVFARHGE 

vklVFARHGE 

VFARHGE 

******* 



SEWNKANLFT 

SEWNKANLFT 

SEWNKANLFT 

SEWNKANLFT 

SEWNKANLFT 

SEWNKANLFT 

SEWNKANLFT 

SEWNKANLFT 

SEWNKANLFT 

SEWNKANLFT 

SEWNKANLFT 
********** 



GWADVDLSEK 

GWADVDLSEK 

GWADVDLSEK 

GWADVDLSEK 

GWADVDLSEK 

GWADVDLSEK 

GWADVDLSEK 

GWADVDLSEK 

GWADVDLSEK 

GWADVDLSEK 

GWADVDLSEK 
********** 



GTQQAIDAGK 
GTQQAIDAGK 
GTQQAIDAGK 
GTQQAIDAGK 
GTQQAIDAGK 
GTQQAIDAGK 
GTQQAIDAGK 
GTQQAIDAGK 
GTQQAIDAGK 
GTQQAIDAGK 
GTQQAIDAGK 



50 

LI QAAGI EFD 
LIQAAGIEFD 
LIQAAGIEFD 
LIQAAGIEFD 
LIQAAGIEFD 
LIQAAGIEFD 
LIQAAGIEFD 
LIQAAGIEFD 
LIQAAGIEFD 
LIQAAGIEFD 
LIQAAGIEFD 



********** ********** 
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51 

LAFTSVLKRA 

LAFTSVLKRA 

LAFTSVLKRA 

LAFTSVLKRA 

LAFTSVLKRA 

LAFTSVLKRA 

LAFTSVLKRA 

LAFTSVLKRA 

LAFTSVLKRA 

LAFTSVLKRA 

LAFTSVLKRA 
********** 



IKTTNLALEA 
IKTTNLALEA 
IKTTNLALEA 
IKTTNLALEA 
IKTTNLALEA 
IKTTNLALEA 
IKTTNLALEA 
IKTTNLALEA 
IKTTNLALEA 
IKTTNLALEA 
IKTTNLALEA 
********** 



ADQLWVPVEK 
ADQLWVPVEK 
ADQLWVPVEK 
ADQLWVPVEK 
ADQLWVPVEK 
ADQLWVPVEK 
ADQLWVPVEK 
ADQLWVPVEK 
ADQLWVPVEK 
ADQLWVPVEK 
ADQLWVPVEK 



SWRLNERHYG 
SWRLNERHYG 
SWRLNERHYG 
SWRLNERHYG 
SWRLNERHYG 
SWRLNERHYG 
SWRLNERHYG 
SWRLNERHYG 
SWRLNERHYG 
SWRLNERHYG 
SWRLNERHYG 



100 

GLTGKNKAEA 
GLTGKNKAEA 
GLTGKNKAEA 
GLTGKNKAEA 
GLTGKNKAEA 
GLTGKNKAEA 
GLTGKNKAEA 
GLTGKNKAEA 
GLTGKNKAEA 
GLTGKNKAEA 
GLTGKNKAEA 



********** ********** ********** 
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101 

AEQFGDEQVH 

AEQFGDEQVH 

AEQFGDEQVH 

AEQFGDEQVH 

AEQFGDEQVH 

AEQFGDEQVH 

AEQFGDEQVH 

AEQFGDEQVH 

AEQFGDEQVH 

AEQFGDEQVH 

AEQFGDEQVH 
********** 



I WRRS YDVLP 

IWRRSYDVLP 

IWRRSYDVLP 

IWRRSYDVLP 

IWRRSYDVLP 

IWRRSYDVLP 

IWRRSYDVLP 

IWRRSYDVLP 

IWRRSYDVLP 

IWRRSYDVLP 

IWRRSYDVLP 
********** 



PDMAKDDEHS 

PDMAKDDEHS 

PDMAKDDEHS 

PDMAKDDEHS 

PDMAKDDEHS 

PDMAKDDEHS 

PDMAKDDEHS 

PDMAKDDEHS 

PDMAKDDEHS 

PDMAKDDEHS 

PDMAKDDEHS 
********** 



AHTDRRYASL 

AHTDRRYASL 

AHTDRRYASL 

AHTDRRYASL 

AHTDRRYASL 

AHTDRRYASL 

AHTDRRYASL 

AHTDRRYASL 

AHTDRRYASL 

AHTDRRYASL 

AHTDRRYASL 
********** 



150 

DDSVIPDAEN 

DDSVIPDAEN 

DDSVIPDAEN 

DDSVIPDAEN 

DDSVIPDAEN 

DDSVIPDAEN 

DDSVIPDAEN 

DDSVIPDAEN 

DDSVIPDAEN 

DDSVIPDAEN 

DDSVIPDAEN 
********** 



msa70722 
msa70722.2{ 
msa70722. 
msa70722 . 
msa70722 .2{ 
msa70722 . 
msa70722 . 
msa70722.2{ll0 
msa70722 . 



.2{ll0__090 
110JL8RS21 
2{110_2603 
2{llO_A909' 
110_CJB110; 
2{llO_COHl 
2{110_H36B 
_JM9130Q13 
2{110_M732 



151 

LKVTLERALP 
LKVTLERALP 
LKVTLERALP 
LKVTLERALP 
LKVTLERALP 
LKVTLERALP 
LKVTLERALP 
LKVTLERALP 
LKVTLERALP 



FWEDKIAPAL 
FWEDKIAPAL 
FWEDKIAPAL 
FWEDKIAPAL 
FWEDKIAPAL 
FWEDKIAPAL 
FWEDKIAPAL 
FWEDKIAPAL 
FWEDKIAPAL 



KDGKNVFVGA 
KDGKNVFVGA 
KDGKNVFVGA 
KDGKNVFVGA 
KDGKNVFVGA 
KDGKNVFVGA 
KDGKNVFVGA 
KDGKNVFVGA 
KDGKNVFVGA 



HGNSIRALVK 
HGNS I RALVK 
HGNSIRALVK 
HGNSIRALVK 
HGNSIRALVK 
HGNSIRALVK 
HGNSIRALVK 
HGNSIRALVK 
HGNSIRALVK 



200 

HI KQLSDDE I 
HI KQLSDDE I 
HI KQLSDDE I 
HI KQLSDDE I 
HI KQLSDDE I 
HI KQLSDDE I 
HI KQLSDDE I 
HI KQLSDDE I 
HI KQLSDDE I 
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msa70722 .2{llO_M78l) 
msa70722.2{ll0_1169NT} 

Consensus 



msa70722 
msa70722.2{ 

rasa70722 . 

msa70722 . 
msa70722 .2{ 

msa70722 . 

msa70722 . 
msa70722.2{H0 

msa70722. 

msa70722 . 
msa70722.2{ 



.2{110_090; 
110_18RS21; 
2{ll0_2603 
2{llO~A909 
110_CJB110 
2{llO_COHi; 
2{llO_H36B 
_JM9130013 
2{110_M732} 
2{llO__M78l} 
110_1169NT} 
Consensus 



LKVTLERALP FWEDKIAPAL KDGKNVFVGA HGNSIRALVK HIKQLSDDEI 
LKVTLERALP FWEDKIAPAL KDGKNVFVGA HGNSIRALVK HIKQLSDDEI 
********** ********** ********** ********** ********** 



201 

MDVEIPNFPP 
MDVEIPNFPP 
MDVEIPNFPP 
MDVEIPNFPP 
MDVEIPNFPP 
MDVEIPNFPP 
MDVEIPNFPP 
MDVEIPNFPP 
MDVETPNFPP 
MDVEIPNFPP 
MDVEIPNFPP 
********** 



LVFEFDEKLN 

LVFEFDEKLN 

LVFEFDEKLN 

LVFEFDEKLN 

LVFEFDEKLN 

LVFEFDEKLN 

LVFEFDEKLN 

LVFEFDEKLN 

LVFEFDEKLN 

LVFEFDEKLN 

LVFEFDEKLN 
********** 



229 

LVSEYYLGK 

LVSEYYLGK 

LVSEYYLGK 

LVSEYYLGK 

LVSEYYLGK 

LVSEYYLGK 

LVSEYYLGK 

LVSEYYLGK 

LVSEYYLGK 

LVSEYYLGK 

LVSEYYLGK 
********* 
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SEQ ID NO. 4301: 2603 V/R STRAIN 

ATGAATCTTTTAATTATGGGTTTGCCTGGTGCTGGTAAAGGTACT 

GTTGAAGAATTTGGTGTTGCTCACATCTCAACAGGGGATATGTTCCGCGCCGCAATGGCT 
AAT CAAAC CG AAATGGGACGTTT AG CTAAAAGTTAT ATTGATAAAG GTGAATT GGTTC CT 
GATGAAGTAAOU^CGGGATTGTAAAAGAGraCTTAGCTGAGGATGATATCGCAGAAAAA 
GGTTTTTTACTTGATGGATATCCACGTACTATTGAAC^^ 

CTTGAAGAACT AGGACTACG CTT AG ATGGTGTTATTAATATTAAAGTGGAT CCAT CATGT 
CTTATAGAGCGTTTGAGTGkTCGTATTATCAATCGTAAAACTGGTGAAACTTTCCACAAA 
GTGTT CAACC CAC CAGTAGATT ATAAAGAAGAAGATTACTATCAACGTGAAGATGATAAG 
CCTGAAACTGTCAAACGTCGCTTGGACGTTAATATTGCTC^GGAGAACCTATTCTTGAA 
CACTAT CGTAAG CTT GGT CTTGT TACAGATATTGAAGGTAAT CAAGAAATAACAGAAGTT 
TTTGCAGATGTTGAAAAAGCGTTGCTAGAACTCAAA 

SEQ ID NO. 4302s 090 STRAIN (reverse complement) 

AATCTTTTAATTATGGGTTTGCCTGGTGCTGGTAAAGGTACTCA 

AGCAGCTAAGATCGTTGAAGAATTTGGTGTTGCTCAGA.TCTCAACAGGGGATATGTTCCG' 
CG C CG CAATGG CTAAT CAAAC CGAAATGGGACGTTTAGCTAAAAGTTATATTGATAAAGG 
TGAATTGGTT C CTGATGAAGT A^CAAACGGGATTGT AAAAGAG CG CTTAG CTGAGG ATGA 

CTTAGATG CTACG CTTGAAGA^CTAGGACTACG CTT AG ATGGTGT TATTAATATTAAAGT 
GGATCCATCATGTCTTATAGAGCGTTTGAGTGGTCGTATTATCAATCGTAAAACTG 
A^CTTTCCACAAAGTGTT CAAC CCAC CAGTAGATT AT AA^GAAGAAGATTACTAT CAACG 
TGAAGATGATAAG CCTGAAACTGTCAAACGT CX3 CTTGGACGTTAATATTG CTCAAGGAGA 
ACCTATTCTTGAACACTATCGTAAGCTTGGTCTTGTTACAGATATTGA^ 
AATAACAGAAGTTTTTGCAGATGTTGAAAAAGCGTTG 

SEQ ID NO. 4303: 3.169NT STRAIN (REVERSE COMPLEMENT) 

TGGTAAAGGGACTCAAGCAGCTAAGATTGTTGAAGAATTTGGTGTTGCGCA 

AGGGGATATGTTC CG CGCCGCAATGGCTAAT CAAACCGAAATGGGACGTTTAG CTAAAAG 

TTATATTGATAAAGGTGAATTGGTTCCTGATCAAGTAACAAACGGGA^ 

CTTAG CTGAGGATGATATCX3 CAG AAAA^GGTTTTTT ACTTGATGGGTATCCACGTAOT 

TGAACAAG CAC ACG C CTTAGATG CTACX3 CTTGAAG AACTAGGACT ACG CTTAG ATGGTGT 

TATTAATATTAAAGTGGATCCATCATGTCTTATAGAGCGTTTGAGTGGTCGTATTATC^A 

TCGTAAAACTGGTGAAACTTTCCACAAAGTGTTCAA 

AGATTACTATCAACGTGAAGATGATAAGCCTGAAACTGTCAAACGTCGCT 

TATTGCTCAAGGAGAACCTATTCTTGAAC^ 

TGAAGGTAATCAAGAAATAA 

SEQ ID NO. 4304: 18RS21 STRAIN (REVERSE COMPLEMENT) 

AAT CTTTT AAC CACGGGTTCG CCTGGTG CTGGTAAAGGTACTCAAG CAGCTAAGAT CG 
TTGAAGAATTTGGTGTTGCTCACATCTCIAACAGGG C CG CGCCG CAATGG CTA 

ATCAAACCGAAATGGGACGTTTAGCTAAAAGTTATATTGATAAAGGTGAATTC 
ATGAAGTAACAAACGGGATTGTAAAAGAGCGCTTAG CTGAGGATGATATCG CAGAAAAAG 
GTTTTTT ACTTGATGG ATATCCACGTACTATTGAACAAGCACACG C CTTAGATG CTACG C 
TTGAAGAACTAGGACTACGCTTAGATGGTGTTATTAATATTAAAGTGGATCCATCATGTC 
TTATAGAG CGTTTGAGTGGT CGTATTAT C^TCGT AAAACTGGTGAAACTTTCCACAAAG 
TGTTC^^CCCACCAGTAGATTATAAAGAAGAAGATTACTATCAACGTGAAGATGATAAGC 
CTGAAACTGTCAAACGTCGCTTGGACGTTAATATTGCT CAAGGAGAAC CTATT CTTGAAC 
ACTATCGTAAGCITGGTCTTGTTACAGATATTGAAGGTA^ 
TTGCAGATGTTGAAAAAGCGTTG 

SEQ ID NO. 43 05: A909 STRAIN (REVERSE COMPLEMENT) 

AATCTTTTAATTATGGGTTTG C CTGGTG CTGGTAAAGGTACT CAAG CAG 
CTAAGATCGTTGAAG A^TTTGGTGTTGCTCACATCT CAACAGGGGATATGTT C CG CG C CG 
CAATGG CTAAT CAAAC CGAAATGGGACGTTTAG CTAAAAGTTATATTGATAAAGGTGAAT 
TGGTTC CT GATGAAGT AA.CAAACGGGATTGTAAAAGAG CG CTTAG CTGAGGAT GATATCG 
CAG AAAAAGGTTTTTTACTTGATGGATAT C CACGTACTATTGAACAAGCACACGC CTTAG 
ATG CTACG CTTGAAGAACTAGGACTACG CTT AGATGGTGTTATTAATATTAAAGTGGAT C 
CATCATGT CITATAG AGCGTTTG AGTGGTCGTATTATCAAT CGTAAAACTGGTGAAACTT 
TCCACAAAGTGTT CAAC C CACCAGTAGATT ATAAA.GAAGAAGATTACTATCAACGTGAAG 
ATG ATAAG CCTGAAACTGTCAA^CGT CG CTTGGACGTT AATATTG CTCAAGGAG 
TTCITGAACACTATCGAAAGCTTGGTCTTGTTAC^ 

SEQ ID NO. 4306: CJB110 STRAIN (REVERSE COMPLEMENT) 

AATCTTTTAACCACGGGTTTGCrTGGTGCTGGTAAAGGTACTC^ 
GATCGTTGAAGAATTTGGTGTTGCTCACATCTCA^CAGGGGATATGTTCCGCG 
GGCTAATCAAACCGAAATGGGACGTTTAGCTAAAAGTTATATTa^TAAAGGTGAATTGGT 
TCCTG ATGAAGTAACA&ACGGGATTGTAAAAGAG CG CTTAG CTG 

AAAAGGTTTTTTACTTGATGGATAT C CACGTACTATTGAACAAGCACACG C CTTAGATGC 
TACGCTTGAAGAACTAGGACTACGCTTAGATGGTGTTATTAATATTAAAGTGGAT 
ATGTCTTATAGAGCGTTTGAGTGGT CGTATTAT CAAT CGTAAAA.CTGGTG AAACTTT C CA 
CAAAGTGTT CAAC C CACCAGTAG ATTAT AAA.GAAG AAGATTACTATCAACGTGAAGATGA 
TAAGC CTGAAACTGT CAAACGTCGCTTGGACGTT AATATTG CT CAAGGAGAACCTATT CT 
TGAACACTATAG 

SEQ ID NO. 4307: COH1 STRAIN (REVERSE COMPLEMENT) 

AT CITTTAATT ATGGGTTTG CCTGGTG CTGGTAAAGGTACT CAAG CAG CT AAGATTGTTG 
AAG AATTTGGTGTTGCTCACATCTCAACAGGGG ATATGTT C CG CGC CG CAATGG CTAATC 
AAAC CCAAATGGGACGTTTAG CTAAAAGTTATATTGATAAAGGTGAATTGGTT CCTGATG 
AAGTAACAAACGGGATTGTAAAAGAG CG CTTAGCTGAGGATGATAT CGCAGAAAAAGGTT 
TTTTACTTGATGGATATCCACGTACTATTGAGCAAGC^ 

AAGAACTAGGACTACG CTTAG ATGGTGTTATTAAT ATT AAAGTGGATC CAACATG CCT^ 
TAGAGCGTTTGAGTGGCCGTATTATCAATCGTAAAACTGGTGAAAC^ 
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T CAAC C CAC CAGTAG ATT AT AAAGAAGAAGATT ACTAT CAACGTG AAGATGATAAG CCTG 
AAACTGTCAAAC£nTCGCTTGGACGTTAATATTGCT 

ATCGTAAGCTTGGTCTTGTTACAGATATTGAAGGTAATCAAa\AATAACAGAAGTTTTTG 
CAGATGTTGAAAAAGCGTTG 



SEQ ID HO. 4308: H36B STRAIN {REVERSE COMPLEMENT) 

CAGGGGATATGTTCCGCGCCGCAATGGCTAATCAAACCGAAATGGGACGTTTAGCTAAAA 
GTTATATTGATAAAGGTGAATTGGTTCCTGATGAAGTAACAAACGGGATTGTAAAAGAGC 
GCTTAGCTGAGGATGATATCGCAGAAAAAGGTTTTTTACTTGATGGATATCCACGTA 
TTGAACAAG CACACG C CITAGAT GCTACGCTTG AAG AACTAGGACTACG CTTAGATGGTG 
TTATTAAT ATTAAAGTGG AT C CATCATGTCTTATAGAG CGTTTGAGTG GT C GT ATTAT CA 
AT CGTAAAAC TGGTGAAACTTTC C^CAAAGT GT TCAAC CCAC CAGTAG ATTATAAAGAAG 
AAGATT ACTAT CAACGTGAAGATGATAAG C CTGAAACTGT CAAACGT CGCTTGGACGTTA 
ATATTG CT CAAGGAGAATCTATT CTTGAACACTAT CGTAAG CTTGGT CTTGTTACAGATA 
TTGAAG GTAAT C^GAAATAACAGAAGTTTTTG CAGATGTTGAAAAAG CGTTG 

SEQ XD NO. 4309: JM9130013 STRAIN (REVERSE COMPLEMENT) 

AATCTTTTAATTATGGGTTTGCCTGGTG CTGGTAAAGGT 

ACTCAAGCAGCTAAGATCGTTGAAGAATTTGGTGTTGCrrCA.CAT CT CAACAGGGG ATATG 
TTCCGCGC CG CAATGG CTAAT CAAAC CGAAATGGGACGTTTAG CTAAAAGTTATATTGAT 
AAAGGTGAATTGGTTCCrGATGAAGTAAOVAACGGGATTGTAA 

G ATGATAT CG CAGAAAAAGGTTTTTT ACTTGAT GGATAT C CACGTACTATT GAACAAGCA 
CACGC CTT AGATG CTACG CTTGAAG AACTAGGACTACGCIT'AGATGGTGTTATTAATATT 
AAAGTGGATC CATCATGTCTTAT AGAGCGTTTGAGTGGT CGTATTATCAAT CGTAAAACT 
GGTGAAACTTT C CACAAAGTGTT CAACC CAC CAGTAGATTATAAAGAAGAAGATT ACTAT 
CAACGTGAAGATGATAAG CCTGAAACTGTTAAACGTCGCTTGGACGTTAATAT^ 
GGAGAACCTATT Cl'1'GAACACTATAAAAAG CTTGGTCTTGTTACAGATATTGAAGGTAAT 
CA 

SEQ ID NO. 4310: M732 STRAIN (REVERSE COMPLEMENT) 

CTTTTAATTATGGGTTTG CCTGGTG CTGGTAAAGGTACTCAAG CAGCTAAGATTGTTGAA 

G AATTTGGTGTTGCT CACATCTCAACAGGGGATATGTT^ CAA 

AC C CAAATGGGACGTTTAGCT AAAAGTTATATTGATAAAGGTGAATTGGTT CCT 

GTAACAAACGGGATTGTAAAAGAGCG CTTAG CTGAGGATGATATCG CAGAAAAAGGTTTT 

TTACTTGATGGATAT CCACGTACTATTGAG CAAGCACACGCCTTAGATGCTACG CTTGAA 

GAACTAGGAOTACGCTTAGATGGTGTTATTAATATTAAAGTGGATCCM 

GAGCXSTTTGAGTGGCCGTATTATCAATCGTAAA^CTGGTC 

AACCCACCAGTAGATTATAAAGAAGAAGATTACTATCAACGTGAA 

ACTGTCAAACGTCGCTTGGACGTTAATATTGCTCAAGGAGAACCT 

CGTAAGCTTGGTCTTGTTAC^GATATTGAAGGTAATCAAGAA^ 

G ATGTTGAAAAAG CGTTG 



SEQ ID NO. 4311: M781 STRAIN (REVERSE COMPLEMENT) 

AAT CITTTAATTACGGGTTTG CCIGGTGCTGGTAAAGG 

G CAGCTAAGATTGTTG AAGAATTTGGTGTTG CT CACAT CTCAACAGGGGATATGTT CCG C 

GCCGCAATGGCTAATCAAACCCAAATGGGACGTTTAGCTAAAAGTTA 

GAATTGGTTCCTGATGAAGTAACAAACGGGATTGTAAAAGAGCGCTT 

AT CGCAGAAAAAGGTTTTTT ACTTGATGGATATCCACX3TACT CAAGCACACG CC 

TTAGATGCTACGCTTGAAGAACTAGGACTACGCTTAGATC 

GAT CCAACATG CCTTATAGAG CGTTTGAGTGGCCGT ATTAT CAATCGTAAAACTGGTGAA 
ACTTT C CACAAAGTGTTCAACC CACCAGTAG ATTAT AAAGAAGAAGATTACTATCAACGT 
GAAGATGATAAGCCTGAAACTGTCAAACGTCGCITGJGAOT 



MSA Alignment Results: Pretty output 

PRETTY of : /biotTnp/msa25038 . 2 { * } April 17, 2002 08:53 
PRETTY of : /biotmp/msa252229 .2{*} January 31, 2003 03:05 .. 

1 50 

msa252229.2{ll4_COHl} atcttt taattatggg tttgcctggt gctggtaaag gtactcaagc 

msa252229 . 2 {114_M732} 'Cttt taattatggg tttgcctggt gctggtaaag gtactcaagc 

msa252229.2{ll4_M78l} Aatcttt taattacggg tttgcctggt gctggtaaag gtactcaagc 

msa252229.2{H4_A909} Aatcttt taattatggg tttgcctggt gctggtaaag gtactcaagc 

msa252229.2{ll4__JM9130013} Aatcttt taattatggg tttgcctggt gctggtaaag gtactcaagc 

msa252229.2{ll4_CJB110 J Aatcttt taaccacggg tttgcttggt gctggtaaag gtactcaagc 

msa252229 .2{ll4_090> Aatcttt taattatggg tttgcctggt gctggtaaag gtactcaagc 

msa252229.2f 114_2603) atgAatcttt taattatggg tttgcctggt gctggtaaag gtactcaagc 

msa252229.2{114__H36B} 

msa252229.2f 114_18RS2l} Aatcttt taaccacggg ttcgcctggt gctggtaaag gtactcaagc 

msa252229.2(114_1169NT} — tggtaaag ggactcaagc 

Consensus **** 



msa2 5 2 2 2 9 . 2 f 1 14_COHl )■ 
msa252229 . 2 (114_M732 ; 
msa252229.2{ll4_M781 
msa252229 . 2 { 114__A909 
msa252229.2{H4_JM9130013 
msa252229 . 2 { 114_CJB110 ■ 
msa2 52229 . 2{ 114__090 
msa252229 . 2 { 114_2603 
mS a252229 . 2 {114_H36B 
msa252229 . 2 f 114_18RS21 
msa252229 - 2 { 114_1169NT 



51 

agctaagatt 
agctaagatt 
agctaagatt 
agctaagatc 
age t a agate 
agctaagatc 
agctaagatc 
agctaagatc 



gttgaagaat 
gttgaagaat 
gttgaagaat 
gttgaagaat 
gttgaagaat 
gttgaagaat 
gttgaagaat 
gttgaagaat 



ttggtgttgc 
ttggtgttgc 
ttggtgttgc 
ttggtgttgc 
ttggtgttgc 
ttggtgttgc 
ttggtgttgc 
ttggtgttgc 



tcacatctca 
tcacatctca 
tcacatctca 
tcacatctca 
tcacatctca 
tcacatctca 
tcacatctca 
tcacatctca 



agctaagatc gttgaagaat 
agctaagatt gttgaagaat 



ttggtgttgc tcacatctca 
ttggtgttgc gcacatctca 



100 

aCAGGGGATA 
aCAGGGGATA 
aCAGGGGATA 
aCAGGGGATA 
aCAGGGGATA 
aCAGGGGATA 
aCAGGGGATA 
aCAGGGGATA 
-CAGGGGATA 
aCAGGGGATA 
aCAGGGGATA 
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Consensus 



_********* 



msa252229. 

msa252229. 

msa252229. 

msa252229. 
msa252229.2{ll4 
msa252229 - 2 { 
msa2 52229 

msa252229. 

msa252229. 
msa252229 .2{ 
msa252229 .2{ 



2{114_C0H1} 
2(ll4__M732~ 
2{114_M781 
2{ll4_A909 
_JM9130013} 
114_CJB110} 
.2{114_090} 
2{114_2603} 
2{114_H36B; 
114_18RS21 
114_1169NTj 
Consensus 



101 

TGTTCCGCGC 

TGTTCCGCGC 

TGTTCCGCGC 

TGTTCCGCGC 

TGTTCCGCGC 

TGTTCCGCGC 

TGTTCCGCGC 

TGTTCCGCGC 

TGTTCCGCGC 

TGTTCCGCGC 

TGTTCCGCGC 
********** 



CGCAATGGCT 

CGCAATGGCT 

CGCAATGGCT 

CGCAATGGCT 

CGCAATGGCT 

CGCAATGGCT 

CGCAATGGCT 

CGCAATGGCT 

CGCAATGGCT 

CGCAATGGCT 

CGCAATGGCT 
********** 



AATCAAACCc 
AATCAAACCC 
AATCAAACCC 
AATCAAACCg 
AATCAAACCg 
AATCAAACCg 
AATCAAACCg 
AATCAAACCg 
AATCAAACCg 
AATCAAACCg 
AATCAAACCg 



AAATGGGACG 
AAATGGGACG 
AAATGGGACG 
AAATGGGACG 
AAATGGGACG 
AAATGGGACG 
AAATGGGACG 
AAATGGGACG 
AAATGGGACG 
AAATGGGACG 
AAATGGGACG 



150 

TTT AG CT AAA 
TTTAGCTAAA 
TTTAGCTAAA 
TTTAGCTAAA 
TTTAGCTAAA 
TTTAGCTAAA 
TTTAGCTAAA 
TTTAGCTAAA 
TTTAGCTAAA 
TTTAGCTAAA 
TTTAGCTAAA 



*********_ ********** ********** 



msa252229 . 

msa252229 . 

msa252229. 

msa252229. 
msa252229.2{H4 
msa252229 .2{ 
msa252229 

msa252 22 9 . 

msa252229. 
msa252229 . 2 { 
msa252229,2{ 



2{ll4_COHl} 
2{ll4_M732} 
2lll4_M78l} 
2{114_A909} 
_JM9130013} 
114_CJB110} 
.2{114_090' 
2{114__2603 
2{114_H36B 
114_18RS21 
114_1169NT 
Consensus 



151 

AGTTATATTG 
AGTTATATTG 
AGTTATATTG 
AGTTATATTG 
AGTTATATTG 
AGTTATATTG 
AGTTATATTG 
AGTTATATTG 
AGTTATATTG 
AGTTATATTG 
AGTTATATTG 



ATAAAGGTGA 
ATAAAGGTGA 
ATAAAGGTGA 
ATAAAGGTGA 
ATAAAGGTGA 
ATAAAGGTGA 
ATAAAGGTGA 
ATAAAGGTGA 
ATAAAGGTGA 
ATAAAGGTGA 
ATAAAGGTGA 



********** ********** 



ATTGGTTCCT 

ATTGGTTCCT 

ATTGGTTCCT 

ATTGGTTCCT 

ATTGGTTCCT 

ATTGGTTCCT 

ATTGGTTCCT 

ATTGGTTCCT 

ATTGGTTCCT 

ATTGGTTCCT 

ATTGGTTCCT 
********** 



GATgAAGTAA 

GATgAAGTAA 

GATgAAGTAA 

GATgAAGTAA 

GATgAAGTAA 

GATgAAGTAA 

GATgAAGTAA 

GATgAAGTAA 

GATgAAGTAA 

GATgAAGTAA 

GATcAAGTAA 
***_****** 



200 

CAAACGGGAT 

CAAACGGGAT 

CAAACGGGAT 

CAAACGGGAT 

CAAACGGGAT 

CAAACGGGAT 

CAAACGGGAT 

CAAACGGGAT 

CAAACGGGAT 

CAAACGGGAT 

CAAACGGGAT 
********** 



msa252229 . 2 ( 114_COHl ) 
msa252229 . 2 { 114_M732 } 
msa252229 . 2 { 114_M781 } 
msa252229 . 2 { 114_A909 } 
msa252229 - 2 {114_JM9130013 } 
msa252229 . 2 { 114_CJB110 } 
msa252229 . 2 { 114_090 } 
msa252229 . 2 f 114J2603 } 
msa252229 . 2 { 114_H36B) 
rasa252229 . 2 {ll4_18RS21 } 
rasa252229.2{ll4_1169NT} 

Consensus 



201 

TGTAAAAGAG 

TGTAAAAGAG 

TGTAAAAGAG 

TGTAAAAGAG 

TGTAAAAGAG 

TGTAAAAGAG 

TGTAAAAGAG 

TGTAAAAGAG 

TGTAAAAGAG 

TGTAAAAGAG 

TGTAAAAGAG 
********** 



CGCTTAGCTG 

CGCTTAGCTG 

CGCTTAGCTG 

CGCTTAGCTG 

CGCTTAGCTG 

CGCTTAGCTG 

CGCTTAGCTG 

CGCTTAGCTG 

CGCTTAGCTG 

CGCTTAGCTG 

CGCTTAGCTG 
********** 



AGGATGATAT 

AGGATGATAT 

AGGATGATAT 

AGGATGATAT 

AGGATGATAT 

AGGATGATAT 

AGGATGATAT 

AGGATGATAT 

AGGATGATAT 

AGGATGATAT 

AGGATGATAT 
********** 



CGCAGAAAAA 

CGCAGAAAAA 

CGCAGAAAAA 

CGCAGAAAAA 

CGCAGAAAAA 

CGCAGAAAAA 

CGCAGAAAAA 

CGCAGAAAAA 

CGCAGAAAAA 

CGCAGAAAAA 

CGCAGAAAAA 
********** 



250 

GGTTTTTTAC 
GGTTTTTTAC 
GGTTTTTTAC 
GGTTTTTTAC 
GGTTTTTTAC 
GGTTTTTTAC 
GGTTTTTTAC 
GGTTTTTTAC 
GGTTTTTTAC 
GGTTTTTTAC 
GGTTTTTTAC 
********** 



msa252229 . 

msa252229 . 

msa252229 . 

msa252229. 
msa252229.2{H4 
msa252229 .2{ 
msa252229 

msa252229 . 

msa252229 . 
msa252229 
msa252229 



2lll4_COHl} 
2{114_M732} 
2{114_M781} 
2{114_A909} 
_JM9130013} 
114__CJB110* 
.2{114_090 
2{114_2603, 
t ^ .2{114_H36B} 
.2(114_18RS21} 
.2{114_1169NT} 
Consensus 



251 

TTGATGGaTA 

TTGATGGaTA 

TTGATGGaTA 

TTGATGGaTA 

TTGATGGaTA 

TTGATGGaTA 

TTGATGGaTA 

TTGATGGaTA 

TTGATGGaTA 

TTGATGGaTA 

TTGATGGgTA 
*******_** 



TCCACGTACT 
TCCACGTACT 
TCCACGTACT 
TCCACGTACT 
TCCACGTACT 
TCCACGTACT 
TCCACGTACT 
TCCACGTACT 
TCCACGTACT 
TCCACGTACT 
TCCACGTACT 
********** 



ATTGAgCAAG 

ATTGAgCAAG 

ATTGAgCAAG 

ATTGAaCAAG 

ATTGAaCAAG 

ATTGAaCAAG 

ATTGAaCAAG 

ATTGAaCAAG 

ATTGAaCAAG 

ATTGAaCAAG 

ATTGAaCAAG 
*****_**** 



CACACGCCTT 

CACACGCCTT 

CACACGCCTT 

CACACGCCTT 

CACACGCCTT 

CACACGCCTT 

CACACGCCTT 

CACACGCCTT 

CACACGCCTT 

CACACGCCTT 

CACACGCCTT 
********** 



300 

AGATGCTACG 

AGATGCTACG 

AGATGCTACG 

AGATGCTACG 

AGATGCTACG 

AGATGCTACG 

AGATGCTACG 

AGATGCTACG 

AGATGCTACG 

AGATGCTACG 

AGATGCTACG 
********** 



rasa252229 . 

msa252229 . 

msa252229 . 

msa252229 . 
msa252229.2{H4 
msa252229.2{ 
msa252229 

msa252229 . 

msa252229 . 
msa252229 
msa252229 



.2{ 
.2{ 



2fll4_COHl} 
2{114_M732} 
2{114_M781 
2{114_A909 
_JM9130013 
114_CJB110 
.2{114_090 
2{114_2603 
2{114_H36B 
114_18RS21 
114_1169NT} 
Consensus 



301 

CTTGAAGAAC 

CTTGAAGAAC 

CTTGAAGAAC 

CTTGAAGAAC 

CTTGAAGAAC 

CTTGAAGAAC 

CTTGAAGAAC 

CTTGAAGAAC 

CTTGAAGAAC 

CTTGAAGAAC 

CTTGAAGAAC 
********** 



TAGGACTACG 

TAGGACTACG 

TAGGACTACG 

TAGGACTACG 

TAGGACTACG 

TAGGACTACG 

TAGGACTACG 

TAGGACTACG 

TAGGACTACG 

TAGGACTACG 

TAGGACTACG 
********** 



CTTAGATGGT 

CTTAGATGGT 

CTTAGATGGT 

CTTAGATGGT 

CTTAGATGGT 

CTTAGATGGT 

CTTAGATGGT 

CTTAGATGGT 

CTTAGATGGT 

CTTAGATGGT 

CTTAGATGGT 
********** 



GTTATTAATA 

GTTATTAATA 

GTTATTAATA 

GTTATTAATA 

GTTATTAATA 

GTTATTAATA 

GTTATTAATA 

GTTATTAATA 

GTTATTAATA 

GTTATTAATA 

GTTATTAATA 
********** 



350 

TTAAAGTGGA 

TTAAAGTGGA 

TTAAAGTGGA 

TTAAAGTGGA 

TTAAAGTGGA 

TTAAAGTGGA 

TTAAAGTGGA 

TTAAAGTGGA 

TTAAAGTGGA 

TTAAAGTGGA 

TTAAAGTGGA 
********** 



msa25222 9 . 2 { 114_C0H1 
msa252229 . 2 { 114_M732 
msa252229-2{ll4_M781 
msa252229 . 2 { 114_A909 
msa252229 . 2 { 114_JM9130013 

msa252229 . 2 { 114_CJB110 
msa252229 . 2 { 114_090 
msa252229 . 2 {114_2603 
msa252229.2{H4_H3 6B 

msa25222 9.2{ll4_18RS21 



351 

TCCAaCATGc 
TCCAaCATGc 
TCCAaCATGc 
TCCAtCATGt 
TCCAtCATGt 
TCCAtCATGt 
TCCAtCATGt 
TCCAtCATGt 
TCCAtCATGt 
TCCAtCATGt 



CTTATAGAGC 
CTTATAGAGC 
CTTATAGAGC 
CTTATAGAGC 
CTTATAGAGC 
CTTATAGAGC 
CTTATAGAGC 
CTTATAGAGC 
CTTATAGAGC 
CTTATAGAGC 



GTTTGAGTGg 
GTTTGAGTGg 
GTTTGAGTGg 
GTTTGAGTGg 
GTTTGAGTGg 
GTTTGAGTGg 
GTTTGAGTGg 
GTTTGAGTGk 
GTTTGAGTGg 
GTTTGAGTGg 



CCGTATTATC 
cCGTATTATC 
CCGTATTATC 
tCGTATTATC 
tCGTATTATC 
tCGTATTATC 
tCGTATTATC 
tCGTATTATC 
tCGTATTATC 
tCGTATTATC 



400 

AATCGTAAAA 
AATCGTAAAA 
AATCGTAAAA 
AATCGTAAAA 
AATCGTAAAA 
AATCGTAAAA 
AATCGTAAAA 
AATCGTAAAA 
AATCGTAAAA 
AATCGTAAAA 



759 



WO 2004/018646 



PCT/US2003/026827 



Table 43: Comparative Sequences relating to SAG0079 



msa252229. 2{ll4_1169NT} TCCAtCATGt CTTATAGAGC GTTTGAGTGg t CGTATTATC AATCGTAAAA 

Consensus ****-****_ ********** *********_ _********* ********** 



msa252229 . 

rasa252229. 

msa252229 - 

msa252229 . 
msa252229.2{ll4 
msa252229.2{ 
msa252229 

msa252229. 

msa252229 
msa252229.2 
msa252229.2 



2{ll4_COHl) 
2{114_M732} 
2{ll4_M78l} 
2{ll4_A909} 
_JM9130013j 
114_CJB110} 
.2{ll4_090} 
2{114_2603" 
2{114_H36B; 
114_18RS21.' 
114_1169NT} 
Consensus 



401 

CTGGTGAAAC 
CTGGTGAAAC 
CTGGTGAAAC 
CTGGTGAAAC 
CTGGTGAAAC 
CTGGTGAAAC 
CTGGTGAAAC 
CTGGTGAAAC 
CTGGTGAAAC 
CTGGTGAAAC 
CTGGTGAAAC 



TTTCCACAAA 
TTTCCACAAA 
TTTCCACAAA 
TTTCCACAAA 
TTTCCACAAA 
TTTCCACAAA 
TTTCCACAAA 
TTTCCACAAA 
TTTCCACAAA 
TTTCCACAAA 
TTTCCACAAA 



GTGTTCAACC 
GTGTTCAACC 
GTGTTCAACC 
GTGTTCAACC 
GTGTTCAACC 
GTGTTCAACC 
GTGTTCAACC 
GTGTTCAACC 
GTGTTCAACC 
GTGTTCAACC 
GTGTTCAACC 



CACCAGTAGA 
CACCAGTAGA 
CACCAGTAGA 
CACCAGTAGA 
CACCAGTAGA 
CACCAGTAGA 
CACCAGTAGA 
CACCAGTAGA 
CACCAGTAGA 
CACCAGTAGA 
CACCAGTAGA 



450 

TTATAAAGAA 
TTATAAAGAA 
TTATAAAGAA 
TTATAAAGAA 
TTATAAAGAA 
TTATAAAGAA 
TTATAAAGAA 
TTATAAAGAA 
TTATAAAGAA 
TTATAAAGAA 
TTATAAAGAA 



********** a********* ********** ********** ********** 



msa252229. 

msa252229. 

msa252229 . 

msa252229 - 
msa252229.2{ll4 
msa252229.2{ 
msa252229 

msa252229. 

msa252229 . 
msa252229.2( 
msa252229.2{ 



2{ll4_COHl} 
2(ll4_M732) 
2|ll4_M78l} 
2{114_A909 
_JM9130013 
114_CJB110 
.2{ll4_090 
2{114_2603; 
2{114_H36B 
114_18RS21 
114JL169NT 
Consensus 



451 

GAAGATTACT 
GAAGATTACT 
GAAGATTACT 
GAAGATTACT 
GAAGATTACT 
GAAGATTACT 
GAAGATTACT 
GAAGATTACT 
GAAGATTACT 
GAAGATTACT 
GAAGATTACT 



ATCAACGTGA 
ATCAACGTGA 
ATCAACGTGA 
ATCAACGTGA 
ATCAACGTGA 
ATCAACGTGA 
ATCAACGTGA 
ATCAACGTGA 
ATCAACGTGA 
ATCAACGTGA 
ATCAACGTGA 



********** ********** 



AGATGATAAG 

AGATGATAAG 

AGATGATAAG 

AGATGATAAG 

AGATGATAAG 

AGATGATAAG 

AGATGATAAG 

AGATGATAAG 

AGATGATAAG 

AGATGATAAG 

AGATGATAAG 
********** 



CCTGAAACTG 
CCTGAAACTG 
CCTGAAACTG 
CCTGAAACTG 
CCTGAAACTG 
CCTGAAACTG 
CCTGAAACTG 
CCTGAAACTG 
CCTGAAACTG 
CCTGAAACTG 
CCTGAAACTG 



500 

TcAAACGTCG 
TcAAACGTCG 
TcAAACGTCG 
TcAAACGTCG 
TtAAACGTCG 
TcAAACGTCG 
TcAAACGTCG 
TcAAACGTCG 
TcAAACGTCG 
TcAAACGTCG 
TcAAACGTCG 



********** *_******** 



msa252229 . 2 {ll4_COHl} 
msa252229 . 2 {114_M732 } 
msa252229.2{ll4_M78l} 
msa252229.2{ll4_A909} 
msa252229.2{ll4_JM913 0013} 
msa252229 . 2 { 114_CJB110 } 
msa252229.2{ll4_090> 
msa252229.2{ll4_2603) 
msa252229 .2 {114_H36B} 
msa252229 . 2 {ll4_JL8RS2l} 
msa25222 9 . 2 {114_1169NT} 

Consensus 



501 

CTTGGACGTT 

CTTGGACGTT 

CTTGGACGTT 

CTTGGACGTT 

CTTGGACGTT 

CTTGGACGTT 

CTTGGACGTT 

CTTGGACGTT 

CTTGGACGTT 

CTTGGACGTT 

CTTGGACGTT 
********** 



aATATTGCTC 

aATATTGCTC 

aATATTGCTC 

aATATTGCTC 

aATATTGCTC 

aATATTGCTC 

aATATTGCTC 

aATATTGCTC 

aATATTGCTC 

aATATTGCTC 

CATATTGCTC 
_********* 



AAggagaacc 
AAggagaacc 
AA 



550 

tattCttgaa cactatcgta 
tattcttgaa cactatcgta 



AAggagaatc 
AAggagaacc 
AAggagaacc 
AAggagaacc 
AAggagaacc 
AAggagaatc 
AAggagaacc 
AAggagaacc 



tattcttgaa 
tattcttgaa 
tattcttgaa 
tattcttgaa 
tattcttgaa 
tattcttgaa 
tattcttgaa 
tattcttgaa 



**. 



cactatcgaa 
cactataaaa 
cactatag — 
cactatcgta 
cactatcgta 
cactatcgta 
cactatcgta 
cactatagta 



msa25222 9 . 2 f 114_COHl } 
msa25222 9 . 2 { 114_M732 } 
msa252229.2fll4_M78l} 
msa252229.2{114__A909} 
msa252229 . 2 {114_JM9130013 } 
msa252229.2{H4_CJB110} 
msa252229.2{H4_090} 
msa252 2 2 9 . 2 { 1 14__2 603} 
msa252229 . 2 { 114_H3 6B} 
msa252229 . 2 { 114_18RS21 } 
msa252229 . 2 { 114_1169NT} 

Consensus 



551 600 
agcttggtct tgttacagat attgaaggta atcaagaaat aacagaagtt 
agcttggtct tgttacagat attgaaggta atcaagaaat aacagaagtt 



agcttggtct tgttacagat attgaaggta a — ~- 
agcttggtct tgttacagat attgaaggta atca- 



agcttggtct 
agcttggtct 
agcttggtct 
agcttggtct 
agcttggcct 



tgttacagat 
tgttacagat 
tgttacagat 
tgttacagat 
tgttacagat 



attgaaggta 
attgaaggta 
attgaaggta 
attgaaggta 
attgaaggta 



atcaagaaat aacagaagtt 
atcaagaaat aacagaagtt 
atcaagaaat aacagaagtt 
atcaagaaat aacagaagtt 
atcaagaaat aa 



msa252229 . 2 { 114_C0H1 ; 
msa252229 . 2 { 114_M732 ; 
msa252 22 9 . 2 ( 114_M7 8 1 
msa252 2 2 9 . 2 { 114_A9 0 9 
msa252229.2{H4_JM9130013 
msa252229 . 2 { 114_CJB110 
Itisa2 5 22 2 9 . 2 { 114__0 9 0 
msa252229 . 2 {114_2603 
msa25222 9 . 2 { 114_H36B 
msa252229 . 2 f 114_18RS21 
msa252229 . 2 { 114 JL169NT 

Consensus 



601 

tttgcagatg ttgaaaaagc gttg* 
tttgcagatg ttgaaaaagc gttg- 



636 



tttgcagatg ttgaaaaagc gttg 

tttgcagatg ttgaaaaagc gttgctagaa ctcaaa 

tttgcagatg ttgaaaaagc gttg 

tttgcagatg ttgaaaaagc gttg — 

— ****** ****** 



SEQ ID NO. 4312: 2603 V/R STRAIN 

MNLLI MGLPGAGKGTQAAKI VEE FGVAH I STGDMFRAAMANCTEMGRLAKS Y IDKGELiVP 
DEVTNGIVKER1AEDDIAEKGFLLIX?YPRTIEQAHAIJDATLEE]^LRLDGVINIKVDPSC 
LIERLSXRIINRKTGETFHKVFNPPVDYKEEDYYQREBDKPETVKRRIjDVNIAQGEPILE 
HYRKLGIiVTDI EGNQE ITEVFADVEKALLELK 



SEQ ID NO. 4313: 090 STRAIN 

NLLI MGLPGAGKGTQAAKI VEEFGVAHISTGDMFRAAMANQTEMGRIAKSY I DKGELVPD 
EVTNGI VKERLAEDD I AEKGFLLDGYPRT I EQAHALDATLEELGLRLDGVINI KVD PS CL 
IERLSGRI INRKTGETraKVFNPPVDYKEEDYYQREDDKPETVKRRLDVNIAQGEPILEH 



760 



WO 2004/018646 



PCT/US2003/026827 



Table 43: Comparative Sequences relating to SAG0079 



YRKLGLVTD I EGNQE I TE VFADVEKALLEL K 
SEQ ID NO. 4314: 116 9NT STRAIN 

GKGTQAAKIVEEFGVAHISTGDMFRAAMANQTEMGRLAKSYIDKGELVPDQVTNGIVKER 
LAEDD I AEKGFLLDGYPRTI EQAHALDATLEELGLRLDGVI NI KVDPSCL I ERLSGRI IN 
RKTGET FHKVFNPPVD YKEED YYQREDDKPETVKRRLDVHI AQGE P I LEHYS KIiGLVTD I 
EGNQE I 

SEQ ID NO. 4315: 18RS21 STRAIN 

NLLTTGSPGAGKGTQAAKI VEE FGVAH I STGDM FRAAMANQTEMGRLAKSY I DKGELVPD 
EVTNG I VKERLAEDDI AEKGFLLDGYPRT I EQAHALDATLEELGLRLDGVINI KVDPSCL 
I ERLSGRI I NRKTGETFHKVFNPPVD YKEED YYQREDDKPETVKRRLDVN I AQGEP I LEH 
YRKLGLVTD I EGNQE I TE VFADVEKALLE 

SEQ ID NO. 4316: A909 STRAIN 

NLL I MGLPGAGKGTQAAKI VEEFGVAH I STGDMFRAAMANQTEMGRLAKSY I DKGELVPD 
EVTNGI VKERLAEDDI AEKGFLLDGYPRTI EQAHALDATLEELGLRLDGVINI KVDPSCL 
I ERLSGRI I NRKTGETFHKVFNPPVD YKEEDYYQREDDKPETVKRRLDVNI AQGE S I LEH 
YRKLGLVTDI EG 

SEQ ID NO. 4317: A909 STRAIN 

NLL IMGLPGAGKGTQAAKI VEEFGVAH I STGDMFRAAMANQTEMGRLAKSY I DKGELVPD 
EVTNG I VKERLAEDDI AEKGFLLDGYPRT I EQAHALDATLEELGLRLDGVINI KVDPSCL 
I ERLSGRI INRKTGETFHKVFNPPVD YKEEDYYQREDDKPETVKRRLDVNI AQGES I LEH 
YRKLGLVTDI EG 

SEQ ID NO. 4318: CJB110 STRAIN 

NLLTTGLLGAGKGTQAAKI VEEFGVAHI STGDMFRAAMANQTEMGRLAKSY I DKGELVPD 
EVTNG I VKERIJVEDDIAEKGFLLDGYPRTIEQAHALDATLEEI^LRLDGVINI KVDPSCL 
IERLSGRIINRKTGETFHKVFNPPVDYKEEDYYQREDDKPETVKRRLDVNIAQGEPILEH 
Y 

SEQ ID NO. 4319: COH1 STRAIN 

LLIMGLPGAGKGTQAAKI VEE FGVAH I STGDMFRAAMANQTQMGRLAKS YI DKGELVPDE 
VTNG I VKERLAEDD I AEKGFLLDGYPRT I EQAHALDATLEELGLRLDG VI NI KVDPTCL I 
ERLSGRI INRKTGETFHKVFNPPVDYKEEDYYQREDDKPETVKRRLDVNI AQGEP I LEHY 
RKLGLVTD I EGNQE I TEVFAD VEKALL 

SEQ ID NO. 4320: H36B STRAIN 

GDMFRAAMANQTEMGRLAKS Y I DKGELVPDEVTNG I VKERLAEDD I AEKGFLLDGYPRT I 
EQAHALDATLEELGLRLDGVINI KVDPSCL I ERLSGRI I NRKTGETFHKVFNPPVD YKEE 
DYYQREDDKPETVKRRLDVNI AQGES I LEHYRKLGLVTD I EGNQE I TEVFAD VEKAL 

SEQ ID NO. 4321: JM9130013 STRAIN 

NLLIMGLPGAGKGTQAAKIVEEFGVAHISTGDMFRAAMANQTEMGRLAKSYIDKGELVPD 
EVTNGI VKERLAEDDI AEKGFLLDGYPRTI EQAHALDATLEELGLRLDGVINI KVDPSCL 
IERLSGRIINRKTGETFHKVFNPPVDYKEEDYYQREDDKPETVKRRLDVNIAQGEPILEH 
YKKLGLVTDI EGN 

SEQ ID NO. 4322: H732 STRAIN 

LL I MGLPGAGKGTQAAKI VEE FGVAH I STGDMFRAAMANQTQMGRLAKS Y I DKGELVPDE 
VTNGI VKERLAEDDI AEKGFLLDGYPRT I EQAHALDATLEELGLRLDGVINI KVDPTCL I 
ERLSGRI I NRKTGETFHKVFNPPVD YKEEDYYQREDDKPETVKRRLDVNI AQGEP I LEHY 
RKLGLVTD I EGNQE I TEVFAD VE KALLELK 

SEQ ID NO. 4323: M781 STRAIN 

NLL I TGLPGAGKGTQAAKI VEEFGVAH I STGDMFRAAMANQTQMGRLAKS Y IDKGELVPD 
EVTNG I VKERLAEDD I AEKGFLLDGYPRTI EQAHALDATLEELGLRLDGVI NI KVDPTCL 
I ERLSGRI INRKTGETFHKVFNP PVD YKEED YYQREDDKPETVKRRLDVNIAQ 



MSA Alignment Results: Pretty output 

PRETTY Of: /biotmp/msa32357 . 2 { *} April 17, 



2002 09:17 



msa252352 . 2 { 114_18RS21 
msa252352 .2{114_M781' 
msa252352.2{H4 CJB110 
msa252352 . 2 { 114_090 
msa252352.2{ll4_JM9130013 
msa252352 . 2 { 114__A909 
msa252352 . 2 { 114_1169NT 
msa252352 . 2 { 114_2603 
msa252352 . 2 { 114_COHl 
msa252352 . 2 { 114_M732 ■ 
msa252352 . 2 { 114_H36B ] 
Consensus 



1 50 

-nllttgspg agkgtqaaki veefgvahis tGDMFRAAMA NQTeMGRLAK 

-nllitglpg agkgtqaaki veefgvahis tGDMFRAAMA NQTqMGRLAK 

-nllttgllg agkgtqaaki veefgvahis tGDMFRAAMA NQTeMGRLAK 

-nllimglpg agkgtqaaki veefgvahis tGDMFRAAMA NQTeMGRLAK 

-nllimglpg agkgtqaaki veefgvahis tGDMFRAAMA NQTeMGRLAK 

-nllimglpg agkgtqaaki veefgvahis tGDMFRAAMA NQTeMGRLAK 

-gkgtqaaki veefgvahis tGDMFRAAMA NQTeMGRLAK 

mnllimglpg agkgtqaaki veefgvahis tGDMFRAAMA NQTeMGRLAK 

— llimglpg agkgtqaaki veefgvahis tGDMFRAAMA NQTqMGRLAK 

— llimglpg agkgtqaaki veefgvahis tGDMFRAAMA NQTqMGRLAK 

-GDMFRAAMA NQTeMGRLAK 

* _********* ***_****** 



msa252352 .2{114_18RS21 
msa252352 .2{ll4_M78i; 
msa2523 52 . 2 { 114_CJB110 [ 



51 100 

S Y I DKGELVP DeVTNGIVKE RLAEDDIAEK GFLLDGYPRT I EQAHALDAT 

SYIDKGELVP DeVTNGIVKE RLAEDDIAEK GFLLDGYPRT I EQAHALDAT 

SYI DKGELVP DeVTNGIVKE RLAEDDIAEK GFLLDGYPRT I EQAHALDAT 



761 



WO 2004/018646 



PCT/US2003/026827 



Table 43: Comparative Sequences relating to SAG0079 



msa252352 . 2 { 114_090 ) 
msa252352 . 2 {114__JM9130013 } 
msa252352 . 2 { 114_A909} 
msa252352 .2{ll4_1169NT} 
msa252352.2f 114_2603} 
msa252352 . 2 { 114_COHl} 
msa252352 . 2 { 114_M732 } 
msa252352.2{H4_H3 6B} 
Consensus 



msa252352 .2{ 
msa252352 . 
msa252352 ,2{ 
msa252352 
msa252352.2{ll4 
msa252352 . 
' msa252352 . 2 { 
msa252352 . 
msa252352 . 
msa252352 . 
msa2523S2 . 



114_18RS21 
2{114_M781 
114_CJB110 
.2{ll4_090} 
_JM9130013} 
2{114_A909} 
114_1169NT} 
2{114_2603} 
2fll4_COHl) 
2{114_M732} 
2{ll4_H36B} 
Consensus 



msa252352 .2{ 

msa252352 . 
msa252352 . 2{ 
msa252352 
msa252352.2{ll4 

msa252352 . 
msa252352 .2{ 

msa252352 . 

msa252352 . 

msa252352 . 

msa252352 . 



114_18RS2l} 
2{ll4J4781* 
114_CJB110 
.2{ll4_090' 
_JM9130013} 
2{114_A909} 
114JL169NT) 
2{114_2603} 
2{114_COHl} 
2{114_M732} 
2{114_H36B} 
Consensus 



SYIDKGELVP 

SYIDKGELVP 

SYIDKGELVP 

SYIDKGELVP 

SYIDKGELVP 

SYIDKGELVP 

SYIDKGELVP 

SYIDKGELVP 
********** 

101 

LEELGLRLDG 
LEELGLRLDG 
LEELGLRLDG 
LEELGLRLDG 
LEELGLRLDG 
LEELGLRLDG 
LEELGLRLDG 
LEELGLRLDG 
LEELGLRLDG 
LEELGLRLDG 
LEELGLRLDG 
********** 

151 

EDYYQREDDK 

EDYYQREDDK 

EDYYQREDDK 

EDYYQREDDK 

EDYYQREDDK 

EDYYQREDDK 

, EDYYQREDDK 

EDYYQREDDK 

EDYYQREDDK 

EDYYQREDDK 

EDYYQREDDK 
********** 



DeVTNGIVKE 
DeVTNGIVKE 
DeVTNGIVKE 
DqVTNGIVKE 
DeVTNGIVKE 
DeVTNGIVKE 
DeVTNGIVKE 

DeVTNGIVKE 
*_******** 



RLAEDDIAEK 
RLAEDDIAEK 
RLAEDDIAEK 
RLAEDDIAEK 
RLAEDDIAEK 
RLAEDDIAEK 
RLAEDDIAEK 
RLAEDDIAEK 



GFLLDGYPRT 
GFLLDGYPRT 
GFLLDGYPRT 
GFLLDGYPRT 
GFLLDGYPRT 
GFLLDGYPRT 
GFLLDGYPRT 
GFLLDGYPRT 



I EQAHALDAT 
I EQAHALDAT 
I EQAHALDAT 
I EQAHALDAT 
I EQAHALDAT 
I EQAHALDAT 
I EQAHALDAT 
I EQAHALDAT 



********** ********** ********** 



LIERLSgRII 
LIERLSgRI I 
LIERLSgRII 
LIERLSgRII 
LIERLSgRII 
LIERLSgRII 
LIERLSgRII 
LIERLSxRII 
LIERLSgRII 
LIERLSgRI I 
LIERLSgRII 



NRKTGETFHK 
NRKTGET FHK 
NRKTGETFHK 
NRKTGETFHK 
NRKTGETFHK 
NRKTGETFHK 
NRKTGETFHK 
NRKTGETFHK 
NRKTGETFHK 
NRKTGETFHK 
NRKTGETFHK 



VINIKVDPSC 
VINIKVDPtC 
VINIKVDPSC 
VINIKVDPSC 
VINIKVDPSC 
VINIKVDPSC 
VINIKVDPSC 
VINIKVDPSC 
VINIKVDPtC 
VINIKVDPtC 
VINIKVDPSC 

********_* ******_*** ********** ********** 



150 

VFNPPVDYKE 
VFNPPVDYKE 
VFNPPVDYKE 
VFNPPVDYKE 
VFNPPVDYKE 
VFNPPVDYKE 
VFNPPVDYKE 
VFNPPVDYKE 
VFNPPVDYKE 
VFNPPVDYKE 
VFNPPVDYKE 



PETVKRRLDV 
PETVKRRLDV 
PETVKRRLDV 
PETVKRRLDV 
PETVKRRLDV 
PETVKRRLDV 
PETVKRRLDV 
PETVKRRLDV 
PETVKRRLDV 
PETVKRRLDV 

PETVKRRLDV 
********** 



200 

nIAQgepile hyrklglvtd iegnqeitev 

nIAQ 

nIAQgepile hy 

nIAQgepile hyrklglvtd iegnqeitev 

nIAQgepile hykklglvtd iegn 

nIAQgesile hyrklglvtd ieg 

hIAQgepile hysklglvtd iegnqei 

nIAQgepile hyrklglvtd iegnqeitev 
nIAQgepile hyrklglvtd iegnqeitev 
nIAQgepile hyrklglvtd iegnqeitev 
nIAQgesile hyrklglvtd iegnqeitev 
_*** 



rasa252352 .2{ 

msa252352 . 
msa252352 .2{ 
rasa252352 
msa252352.2{H4 

msa252352 . 
msa2523 52 .2 { 

msa252352 . 

msa252352 . 

msa252352 . 

rasa252352 . 



201 212 

114_18RS2l} fadvekalle — 

2{ll4_M78l} 

114_CJB110) 

.2{114_090} fadvekalle LK 

_JM9130013) — 

2{114_A909} — 

114_1169NT} 

2(ll4_2603} fadvekalle LK 

2{114_COHl} fadvekall- — 

2{ll4jyi732} fadvekalle LK 

2{114_H36B} fadvekal — — 

Consensus ** 
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SEQ ID NO. 4401 
STRAIN 2603 

GTGGATAAACATCACTCAAAAAAGGCTATTTTAAAGTTAACA 

CTT AT AACAACTAGT ATTTTATTAATG CATAG CAATCAAGTGAATGCAGAGG AG CAAGAA 
TTAAAAAACCAAGAGCAAT CAC CTGTAATTG CTAATGTTG CT CAACAGC CAT CG C CAT CG 
GT AACTACT AAT ACTGTTG AAAA^CATCTGTAACAGCTG CTT CTGCTAGTAATACAG CG 
AAAGAAATGGGTGATACATCTGTAAAAAATGACAAAACAGAAGATGAATTATTAGAAGAG 
TT ATCTAAAAACCTTGATACGT CTAATTTGGGGG CTGAT CTTGAAGAAG AATAT C C CT CT 
AAACCAGAGACAACCAACAATAAAGAAAGCAATGTAGTAAC^^ 

G CACAG AAAGTTCC CT CAG CATATG AAGAGGTGAAG C CAGAAAG CAAGTCAT CG CTTG CT 
GTT CTTG ATACATCTAAAATAACAAAATTACAAG CCATAACCCAAAGAGGAAAGGGAAAT 
GTAGTAGCTATTATTGATACTGGCITTGATAT^ 

CCAAAAGATG AT AAG CACAG CTTTAAAACTAAGACAGAATTTGAGGA^TTAAAAG CAAAA 

CATAATATCACTTATGGGAAATGGGTTAACGATA^GATTGTTTTTGCACATAACT^ 

A^CAATACAGAAACG GTGG CTGAT ATTGCAGCAG CTATG AAAGATGGTTATG GTTCAGAA 

GCAAAGAATATTTCGCATGGTACACACGTTGOrcGTATTTTTGTA^ 

C CAG CAAT CAATGGT CITCrTTTAGAAGGTGCAG CG CCAAATGCT CAAGTCTTATT AATG 

CGTATTCCAGATAAAATTGATT CGGACAAATTTGGTGAAGCATATGCTAAAG CAAT CACA 

GACG CTGTT AAT CTAGG AG CAAAAACG ATTAATATGAGT ATTGGAAAAACAG CTG ATT CT 

TTAATTGCTCTCAATGATAAAGTTAAATTAGC^CTTAAATTAGCT 

G CAGTTGTTGTGG CTGC CGGAAATG AAGG CG CATTTGGTATGGATTATAG CAAAC CATT A 
TCAACTAATCCTGACTACGGTACGGTTAATAGTCCAGCT 

GTTG CT AG CTATGAAT CACT TAAAACT ATCAGTGAGGTCGTTGAAACAACTATTG AAGGT 
AAGTTAGTTAAGTTGCCGATTX3TGACITCTAAACCTTTTG 

GTGGTTTATG C CAATTATGGTG CAAAAAAAGACTTTG AAGGT AAGGACTTTAAAG GTAAG 
ATTG CATTAATTGAG CGTGGTGGTGGACTTGATTTTATGACTA CTACA 
AATGCAGGTGTTGTTGGTAT(^TTATTTTTAACGATCAAGAAAAACGTGGAAATT 
ATT C CTT AC CGTG AAT t AC CTGTGGGGATT ATTAGTAAAGTAGAT GG CGAG CGTATAAAA 
AATA CTTCAAGT CAGTTAACATTTAACCAGAGTTTTGAAGTAGTTGATAG CCAAGGTGGT 
AATCGTATGCTGGAACAATCAAGTTGGGGCGTGACAGCTGAAGGAGCAATCAAGCCTGAT 
G TAACAG CTTCTGG CTTTGAAATTTATT CTT CAAC CT AT AATAATCAATACCAAACAATG 
T CTG GTACAAGT ATG G CTT CAC CACATGTTGCAGGATTAATGACAATGCTTCAAAGT CAT 
TTGG CTGAG AAATATAAAGGGATG AATTTAGATT CTAAAAAATTG CTAGAATTGT CT AAA 
AACAT C CT CATGAG CT CAG CAACAG CATTATATAGTG AAG AGGATAAGG CGTTTT AT TCA 
CCACGTCAGCA^GGTGCAGGTGTAGTTGATGCTGAAAAAGCTATCCAAGCTCAATATTAT 
ATTACTGGAAACGATGGCAAAGCTAAAATTAATCTCAAACGAATGGGAGATAAATTT 
ATCACAGTTACAATTCATAAACTTGTAGAAGGTGTO^AAGA^ 

GTAG CAACAG AACAAGT AAATAAAG GTAAATTTG C CCTT AAAC CACAAG CCTTG CTAGAT 

ACTAATTGGCAGAAAGTAATTCTTCGTGATAAAGAAACACAAGTTCGATTTACrrAT^ 

GCTAGTCAATTTAGTCAGAAATTAAAAGAACAGATGGCAAA^ 

TTTGTACGTTTTAAAGAAGCCAAG GATAGTAATCAGGAGTTAATGAGTATT C CTTTTGT A 

GGATTTA^TGGTGATTTTGCGAACTTACAAGCACTTGAAACACCGATT^ 

T CT AAAG GTAGTTT CT ACTATAAAC CAAATGATACAACT CAT AAAGAC CAATTG G AGT AC 

AATGAATCAG CTCCTTTTGAAAGCAACAA CTATA CTGCCTTGTTAACACAAT CAG CGTCT 

TGGGGCTATGTTGATTATGTCAAAAATGGTGGGGAGTTAGAATTAGCACCGGAGAGTCCA 

AAAAGAATTATTTTAGGAACTTTTGAGAATAAGGTTGAGGATAA 

GA*VAG AG ATGCAG CG AATAAT C CATATTTTGCCATTT CT C CAAA.T AAAGATGG AAAT AGG 
GACGAAATGACTCCCCAGGCAACTTTCTTAAGAAATC 

CTAGAT CAAAATGG AAATGTTATTTGG CAAAGTAAGGTTTTAC CAT CTT ATCGTAAAAAT 
TT CCATAAT AATCCAAAG CAAAGT GATGGTCATTATCGTATGGATGCTCTTCAGTGGAG T 
GGTTTAGATAAGGATGGCAAAGTTGTAGC^GATGGT^ 

ACACCAGTAG CAGAAGG AG CAAATAGTCAGGAGTCAGACTTT AAAGTACAAGTAAGTACT 
AAGT CAC CAAAT CTTCCTT CACGAG CT CAGTTTG ATGAAACT AAT CG AACATT AA.GCTT A 
GCCATGCCTAAGGAAAGTAGTTATGTTCCTACATATCGTTTACAA 
GTTGTAAAAGATGAAGAATATGGGGATGAGACTTCTTACCAT^ 

GAAG GTAAAGTG ACACTT C CTAAAACGGTT AAGATAGGAGAG AGTGAGGTTG CGGT AGAC 
CCTAAGGCCTTGACACTTGTTGTGGA^GATAAAGCTGGTAATTTCGCAACC^ 
TCTGATCT CTTG AAT AAGG CAGTAGT ATCAGAGAAAGAAAACG CTATAGTAATTT CT A*VC 
AGTTTGAAATATTTTGATAACTTGAAAAAAGAACCTATGTTTATTTCT 
" GT AGT AAACAAGAAT CTAG AAG AAATAAT ATT AGTTAAG CCG CAAACTACAGTTACT ACT 
CAATCATTGTCTAAAGAAATAACTAAATCAGGAAA 

AATAATAGTAGCAGAGTAGCTAAGATCATATCACCTAAACATAACGGGGATTCrGTTAAC 
CATACCITACCTAGTACATCAGATAGAGCAACGAATGGTCTATTTGTTGGTACTTTGGCA 
TTGTTATCTAGTTTACTTCITTATTTGAAACCC 

SEQ ID NO. 4402 
STRAIN 090 

GAGGAGCAAGAATTAAAAAACCAAGAGCAATCACCTGTAATTGCT 

AATGTTGCTCAACAGCCATCGCCATCGGTAACTACTAATATTGTTGAAAA 

AACATCTGTAACAGCTGCTTCTGCTAGTAATACAGTGAAAGAAATGGGTG 

ATACATCTGTAAAAAATGACAAAACAGAAGATGAATTATTAGAAGAGTTA 

TCTAAAAACCTTGATACGTCTAATTTGGGGGCTGATCTTGAAGAAGAATA 

TCCCTCTAAACCAGAGACAACCAACAA.TAAAGAAAGCAATGTAGTAACAA 

AT G CTTCAACTG CAATAGCACAGAAAGTT C CCTCAG CGTATGAAG AGGTG 

AAGCCAGAAAGCAAGTCATCGCTTGCTGTTTTTGATACATCTAAAATAAC 

AAAATTG CAAG C CAT AACC CAAAG AGG AAAGGG AAATGTAGT AG CTATT A 

TTGATACTGGCTTTGATATTAACCaTGATATTC 

AAAGATG ATAAG CACAG CTTTAAAACTAAAGCAGAATT CG AGG AATTAAA 
AGCAAAACATAATATCACTTATGGGAAATGGGTTAACGATAAGATTGTTT 
TTGCACATAA.CTACGCCAACAATACAGA Z U\CGGTGGCTGATATTGCAGCA 
GCTATGAAAGATGGTTATGGGTCAGAAGCAAAGAATATTTCGCATGGTAC 
ACACGTTGCTGGTATTTTTGTAGGTAATAGTAAACGTCCAGCAATCAATG 
GT CT T CTTTT AG AAG GTG CAG CG C CAAATGCTCAAGT CTTATTAATG CGT 
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ATTCCAGATAAAATTGATT CGG ACAAATTTGGAG AAG CATATG CTAAAG C 
AATCACAGACGCTG t TAAT CT AGG AG CAAAAa CG ATT AATATGAG CCTTG 
GAAAAACAGCAGATTCTTTAAt t G C aCTCAATGATAAAGTTAAATTAG CA 
CTTAAATTAGCTTCTGAGAAGGGCGTTGCAGTTGTTGTGGCTGCCGGAAA 
TG AAGGTG CATTTGGTATGGATTATAG CAAAC CATTATCAACTAAT c CTG 
ACTACGGTACGGTTAATAGTCCAGCTATTTCTGAAGATACTtTGAGTGTT 
GCTAGCTATGAATCACTTAAAACTATCAGTGAGGTCGTTGAAACAACTAT 
TGaaGGTAAGTTAGTTAAGTTGCCGATTGTGACTTCrAAACCTTTtGACA 
AAGGTAAGGC CTACG ATGTGGTTTATG C CAATTATGGTG CAaAAAAAGAC 
TTTGAAGGTAAgGACTTTAAAGGTAAGATTGCATTAATtGAGCGTGGtGG 
TGGACITGATTTTATGACTAAaa t CACTcATGCTACAAATGCAgGTGTTG 
t TGGTaTCGT tATT 1 1 1 AACgAt CAAGAa aAACG t GGAAATTTTcTAATT 
CCTTACCGTGAATTACCTGTGGGGGTTATTAGTAAAGTAGATGGCGAGCG 
TATAAAAAATACTT CAAGT CAGTTAACATTTAAC CAG AGTTTTg AAGTAG 
TTGATAGCCAAGGTGGCAATCGTATGCTGGAACAATCAAGTTGGGGCGTG 
ACAGCTGAAGGAGCAATGAAGCCTGATGTAACAGCTTCTGGCrTTGAAAT 
TTATT CTTCAAC CTAT AAT AATCAATAC CAAACAATGT CTGGTACAAGT A 
TGGCTTCACCACATGTTGCAGGATTAATGACAATGCTTCAAAGTCATTTG 
GCTGAGAAATATAAAGGGATGAATTTAgATTCTAAAAAATTGCTAGAATT 
GTCTAaAAACATCCTCATGAGCTCAGCaaCAGCATTATATAGTgAAGAgG 
ATAAGGCGTtTtATTCaCCACGTCAGCAAGGtGCAGGtGTAGTTGATGCT 
GAAAAAGCTATCCAAGCTCAATATTATGTTACTGGAAACGATGGCAAAGC 
TAAAATTAAT CT CAAACGAGTGGGAGATAAATTTG ATATCACAGTTAC7VA 
TTCATAAACTTGTAG AAGGTGT CAAAG AATTGTAT TATCAAG CTAATGT A 
GCAACAGAACaAGTAAATAAAGGTAAATTTGCCCTTAAACCACAAGCCtT 
GCTAGATACTAATTGGCAGAaAGTAATTCTTcGTGATAAAGAAACACAAG 
TTcGATTTACTATTGATGCTAGTCAATTTAGTCAGAAATTAAAAGAACAG 
ATGG CAAATGGTTATTT CTTAg AAGGTTTTGTACGTTTT AAAGAAG CCAA 
GGATAGt AATCAGGAGTTAaTGAGTATTCCTT tTGTAGGATt t AATGGTG 
ATTTTGCGAACTTACAAGCACTTGAAACACCGATTTATAAGACGCTTTCT 
AAAGGTAGTTTCTACTATAAACCAAATGATACAACTCATAAAGACCAATT 
GGAGTACAATGAAT CAG CT CCTTTTGAAAG CAACAACTAT ACTG C CTTGT 
TAAGACAATCAGCGTCITGGGGCTATGTTGATTATG 

GAGTTAG AATTAG CACCGG Ag AGTc CAAAAAGAATTATTTTAgG AACTTT 

TGAGAATAAGGTTGAGGATAAAACAATTCATCTTTTGGAAAGAGATGCAG 

Cg AAT AAT CCAT ATTTTG C CATTT CTC CAAATAAAGATGG AAATAGG GAT 

GAAATCACTCCCC^GGCAA(2TTTCTTAAGAAATGTTAAGGATATTTCTGC 

TCAAGTT CTAGATCAAAATGGAAATGTTATTTGG CAAAGT AAGGTTTTAC 

CATCTTATCGTAAAAATTT C CAT AATAATC CAAAG CAAAGTGATGGT CAT 

TATCGTATGGATGCCTTTCAGTGGAGTGGTTTAGATAAGGATGGCAAAGT 

TGTAGCAGATGGTTTTTATACTTATCGCCTACGTTAC^CACCAGTAGCA^ 

AAGGAGCAAATAGTCAGGAGTCAGACITTAAAGTTCAAGTAAGTACTAAG 

TCACCAAAT CTT C CTTTACTAGCT CAGTTTGATG AAACT AAT CGAACATT 

AAGCTTAGCCATGCCTAAGGAAAGTAGTTATGTTCCTAC^TATCGTTTAC 

AATTAGTTTTATCTCATGTTGTAAAAGATGAAGAATATGGGGATGAGACT 

TCTTACCATTATTTCCATATAGATCAAGAAGGTAAAGTGACACTTCCTAA 

AACGGTT AAGAT AGG AG AG AGTGAGGTTG CAGTAGAC CCTAAGG C CTTGA 

CACTTGTTGTGG AAGAT AAAG CTG GTAATTTTG CAACGGTAAAATTGTCT 

GACCT CTTG AATAAGGCAGTAGTAT CAGAG AAAG AAAACG CTATAGT AAT 

TTCTAACAGTTTCAAATATTTTGATAACTTGAAAAAAGAATCTA 

TTTCTAAAGAAGGAAAAGTAGTAAACAAGAATCTAGAAGAAATAACATTA 

GTTAAG C CG CAAACT ACAGTTACTACT CAATCATTGT CTAAAGAAATAAC 

TAAATCAGGAAATGAGAAAGTCCTCACTTCTACAAACAATAATAGTAGCA 

GAGTAGCTAAGATCAT AT CACCT AAACATAACGGGGATT CTGTT AAC CAT 

ACC 

SEQ ID NO. 4403 
STRAIN A90 9 

GAGGAGCAAG AATTAAAAAACCAAGAG CAAT 

CACCrGTAATTGCTAATGTTGCTCAACAGCCATCGCCATCGGTAACTACT 
AATACTGTTGAAAAAACATCTGTAACATCTGCTT CTG CTAGT AAT ACAG C 
GAAAGAAATGGGTGATACATCTGTAAAAAATGACAAAACAGAAGATGAAT 
TATTAGAAG AGTTAT CTAAAAAC CTTG ATA CGTCTAATTTGGGGG CT GAT 
CTTG AAG AAG AATATC CCT CTAAAC CAGAG ACAAC CAACAAT AAAG AAAG 
CAATGTAGTAACAAATGCTTCAACTGCAATAGCACAGAAAGTTCCCTCAG 
CATATGAAGAGGTGAAGCCAGAAAGC^^GTCATCACTTGCTGTTCTTGAT 
ACAT CTAAAATAACAAAATTG CAAG CCATAAC CCAAAGAG GAAAGGG AAA 
TGTAGTAGCTATTATTGATACrGGCTTTGATATTAACCATGATATTTTTC 
GTTTAGATAGCCCAAAAGATgaTAAGCACAGCTTTAaAACTAAGGCAGAA 
TTTGAGGAATTAAAAGCAAAACATAATATCACTTATGGGAAATGGGTTAA 
CGATAAGATTGtTTTTGCACATAACTACGCCAaCAATACAGAAACGGTGG 
CTGAT ATTGCAG CAG CT ATG AAAGATGGTT ATGGGTCAGAAG CAAAG AAT 
ATTTCGCATGGTAC7VCACGTTGCTGGTATTTTTGTAGGTAATAGTAAACG 
TCCAGCAATCAATGGTCITCTTTTAGAAGGTGCAGCGCCAAATGCTCAAG 
T CTTATTAATG CGT ATTCCAG ATAAAATTGATTCGGA CAAATTTG GTGAA 
G CATATG CTAAAG CAATCACAGACG CTGTT AATCT AG GAG CAAAAACGAT 
TAAT ATG AG C CTTGG AAAAACAG CAGATT CTT TAATTG C T CT CAATG AT A 
AAGTTAAAT TAG CACTTAAATT AG CTT CTG AG AAG GG CGTTG CAGTTGTT 
GTGG CTG C CGGAAATG AAGGTG CATTTGGTATGGATTATAG CAAAC CATT 
ATCAACTAATCCTGACTACGGTACGGTTAATAGTCCAGCTATTTCTGAAG 
ATACTTTGAGTGTT G CT AG CTATGAATCAC TTAAAACTAT CAGTGAGGTC 
GTTGAAACAACTATTGAAGGTAAGTTAGTTAAGTTGCCGATTGTGACTTC 
TAAACCTTtTGACAAAGGTAAGGCCTACGATGTGGTTTATGCCAATTATG 
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GTGCAAAAAAAAGACTTTGAAGGTAAGGACTTTAAAGGTAAGATTGCATT 
AATTGAG CGTGGTGGTGGACTTGATTTTATGACT AAAAT CACT CATG CTA 
CAAATG CAGGTGTTGTTGGTAT CGTTATTTTTAACGAT CAAG AAAAACGT 
GGAAATTTTCTAATTCCTTACCGTGAATTACCTGTGGGGGTTATTAGTAA 
AGTAGATGG CGAG CGTATAAAAAAT ACTT CAAGT CAGTTAACATTTAACC 
AGAGTTTTGAAGTAGTTGATAGCCAAGGTGGCAATCGTATGCTGGAACAA 

TCAAGTTGGGGCGTG ACAGCTGAAGGAG CAAT CAAG C CTG ATGTAACAG C 
TTCTGGCTTTGAAATTTATTCTTCAACOTATAATAATCAATACCAAACAA 
TGTCTGGTACAAGTATG G CTT CACCACATG t TGCAGG ATTAATGACAATG 
CTTCAAAGT CATTTGG CTGAGaAATATAAAGGGATGAATTTAGATT CTAA 
AAAATTGCT AGaATTGT CTAAAAACAT c CT CATG AG CTCAGCAACAG CAT 
s TATATAGTGAAGAGGATAAGGCGTTTTATTCACCACGTCAGCAAGGTGCA 

GGTGT AGTTGAT G CTGAAAAAG CTATCCAAG CTCAAT ATT ATGTTACTGG 
AAACG ATGG CAAAG CTAAAATT AAT CT CAAACGAGTG GG AGATAAATTTG 
ATATCACAGTTACAATTCATAAACTTGTAGAAGGTGTCAAAG AAT TGTAT 
TAT CAAG CT AAT GT AG CAACAG AACAAGTAAAT AAAGGTAAATTTGCCCT 
TaAACCaCAAGCCTTGCTAGATACTAATTGGCAGAAAGTAATTCTTcGTG 
ATAAAGAAACACAAGTTCGATTTACTAtTGATTCTAGTCAATTTAGTCAG 
AAATTAAAAGAACAGATGGCAAATGGTTATTTCTTAGAAGGTTTTGTACG 
TTTTAAAGAAG C CAAGG AT AGT AATCAGGAGTTAATG AGT ATT CCTTTTG 
TAGGATTTAATGGTGATTTTGCGAACTTACAAGCACTTGAAACACCGATT 
TATAAGACG CTTTCT AAAGGTAGTTTCTACTATAAAC CAAATGATACAAC 
T CATAAAGACCAATTGG AGT ACAATGAATCAG CT C CTTTTGAAAG CAACA 
ACTATACTG C CTTGTTAACACAATCAG CGT CTTGGGG CT ATGTTG ATTAT 
GTCAAAAATGGTGGGGAGTTAGAATTAGCACCGGAGAGTCCAAAAAGAAT 
TATTTTAGGAACITTTGAGAATAAGGTTGAGGATAAAACAATTC^ 
TGGAAAGAGATG CAG CGAAT AAT CCAT ATTTTGCCATTT CTCCAAATAAA 
GATGGAAATAGGGATGAAATCACTCCCCAGGCAACTTTCTTAAGAJ\ATGT 
TAAGGATATTTCTG CTCAAGTTCTAGAT CAAAATGGAAATGTTATTTGGC 
AAAGTAAGGTTTTAC CATCTTATCGTAAAAATTTC CATAATAAT C CAAAG 
CAAAGTGATGGTO^TTATCGTATGGATGCCCTTCAGTGGAGTGGTTTAGA 
TAAGGATGGCAAAGTTGTAGCAGATGGTTTTTATACTTATCGTTTACG'rT 
ACACACCAGTAG CAG AAGG AG CAAAT AGT CAGGAGTCAG ACTTT AAAGTT 
CAAGTAAGT ACTAAGT CAC CAAAT CTT C CTTCACGAG CT CAGTTTGATG A 
AA CT AAT CGAACATT AAG CTTAGC CATG C CTAAGG AAAGT AGTTATGTTC 
CT ACATATCGT CTACAATTAGTTTT AT CT CATGTTGTAAAAG ATG AAGAA 
TATGGAG ATG AGACTT CTT AC CATT ATTT CCATAT AG ATCGAGAAGGTAA 
AGTGACACTT CCTAAAACAGTT AAGATAGG AGAG AGTGAGGTTG CAGTAG 
ACCCTAAGACCTTGACACTTGTTGTGGAAGATAAAGCTGGTAATTTCGCA 
ACGGTAAAATTGTCTGAC CT CTTGAAT AAGG CAGTAGTAT CAGAG AAAG A 
AAACGCTATAGTAATTTCTAACAATTTCAAATATTTTGATAACTTGA 
AAGAAC CTATGTTT ATTT CTAAAG AAG GAAAAGTAGT AAA CAAG AAT CTA 
GAAGAAATAGCATTAGTTAAGCCGCAAACTACAGTTACTACTCAATCATT 
GT CTAAAGAAAT AACT CAATCAGG AAATG AG AAAGT C CT CACTT CT ACAA 
ACAATAATAGTAGCAGAGTAGCTAAGATCATATCACCTAAACATAACGGG 

GATTCTGTTAACCATACC 

SSQ XD MO. 4404 
STRAIN H36B 

GAGGAGCAAG AATTAAAAAAC CAAGAG CAAT CACCTGT AATTG C 

TAATGTTGCTCAACAGCGATCGCCATCGGTAACTACrAATACTGTTGAAA 

AAACATCTGTAACATCTGCTTCTGCTAGTAATACAGCGAAAGAAATGGGT 

GATACAT CTGTAAAAAATGACAAAACAGAAGATG AATTATTAGAAGAGTT 

ATCTAAAAAC CTTG AT ACGTCTAATTT GG GGG CTG AT CTTGAAGAAGAAT 

AT CC CTCTAAAC CAG AG ACAAC CAACAATAAAGAAAG CAATGTAGTAACA 

AATG CTT CAACTGCAAT AGCACAG AAaGTT CCCT CAG CAT ATGAAGAGGT 

aAAGCCAGAAAGCAAGTCATCACTTGCTGTTCITGATACATCTAAAATAA 

CAAAATTGCAAGCCATAAC C CAAAG AGGAAAGGGAAATGT AG TAG CT ATT 

ATTGATACTGGCITTGATATTAACCATGATATTTTTCGTTTAGATAGCCC 

AAAAGATGATAAGCACAGCTTTAAAACTAAGGCAGAATTTGAGGAATTAA 

AAGCAAAACATAATATCACTTATGGGAAATGGGTTA^CGATAAGATTGTT 

TTTGCACATAACTACGCCAaCAATACAGAAACGGTGGCTGATATTGCAGC 

AGCTATGAAAGATGGTTATGGGTCAGAAGCAAAGAATATTTCGCATGGTA 

CACACGTTGCTGGTATTTTTGTAGGTAATAGTAAACGTCCAGCAATCAAT 

GGTCTTCTTTTAGAAGGTGCAGCGCCAAATGCTCAAGTCTTATTAATGCG 

TATTCCAGATAAAATTGATTCGGACAAATTTGGTGAAGCATATGCTAAAG 

CAAT CACAGACG CTGTT AAT CT AG GAGCAAAAACG ATTAATATGAG C CTT 

GGAAAAACAGC^GATTCTTTAATTGCTCrr(^TGATAAAGTTAAATTAGC 

ACTTAAATTAGCTTCTGAGAAGGGCGTTGCAGTTGTTGTGGCTGCCGGAA 

ATGAAGGTG CATTTG GT ATGGATT ATAG CAAAC CATT AT CAACT AATCCT 

GACTACGGTACGGTTAATAGTCCAGCrATTTCTGAAGATACTTTGAGTGT 

TGCTAGCTATGAATCACTTAAAACTATCAGTGAGGTCGTTGAAACAACTA 

TTGAAGGTAAGTTAGTTAAGTTGCCGATTGTGACTTCTAAACCTTtTGAC 

AAAGGTAAGGCCTACCATGTGGTTTATGCCAATTATGGTGCAAAAAAAGA 

CTTTGAAGGTAAGGACTTTAAAGGTAAGATTGCATTAATTGAGCGTGGTG 

GTGGACrTGATTTTATGACTAAAATCACTCATGCTACAAATGCAGGTGTT 

GTTGGTATCGTTATTTTTAACGATCAAGAAAAACGTGGAAATTTTCTAAT 

TCCTTACCGTGAATTACCTGTGGGGGTTATTAGTAAAGTAGATGGCGAGC 

GTATAAAAAATACTTCAAGTCAGTTAACATTTAACCAGAGTTTTGAAGTA 

GTTGATAGCCAAGGTGGCAATCGTATG CTGGAACAAT CAAGTTGGGGCGT 

G A CAG CT CAAGG AG CAAT CAAG CCTGATGTAA CAG CTTCTGGCTTTG AAA 

TTTATTCTT C AACCT AT AATAAT CAAT AC C AAACAAT GT CTGGTACAAGT 
ATGG CTT CAC CACATGTTG CAG GAT T AATG ACAATG CTT CAAAGTCATTT 
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GGCTGAGAAATATAAAGGGATGAATTTAGATTCTAAAAAATTGCTAGAAT 
TGT CTAAAAACATC CT CAT GAGCT CAG CAACAGCATTATATAGTGAAGAG 
GATAAGGCGTTTTATTCACCACGTaAGCAA.GGTGCAGGTGTAGTTaA.TGC 
TGAAAAAGCTATCCAAGCTCAATATTATGTTACTGGAAACGATGGCAAAG 
CTAAAATTAA.TCTCAAACGAGTG^GAGATAAATTTGATATCACAGTTACA 
ATTCATAAACTTGTAGAAGGTGTCAAAGAATTGTATTATCAAGCTAATGT 
AGCAACAGAACAAGTAAATAAAGGTAAATTTGCCCTTAAACCaCAAGCCT 
TG CTAGATACTAATTGG CAG AAAGTAATT CTT CGTGATAAAGAAACACAA 
GTTCGATTTACTATTGATTCTAGTCAATTTAGTCAGAAATTAAAAaAACA 
GATGGCAAATGGTTATTTCTTAGAAGGTTTTG t ACGTTTTAAAGAAGCCA 
AGGATAGTAAT CAGG AGTTAATGAGT ATT C CTTTTGT AGGATTT AAT GGT 
GATTTTG CG AACT t ACAAG CACTTG AAACACCG ATTTAT AAG ACG CTTT C 
TAAAGGTAGTTT CT ACT AT AAACCAAATG AT ACAACT CATAAAG ACCAAT 
TGGAGTACAATGAATCAGCTCCTTTTGAAAGCAACAACTATACTGCCTTG 
TTAACACAATCAGCGTCTTGGGGCTATGTTGATTATGTCAAAAATGGTGG 
GGAGTTAgAATT Ag CAC CG G AG AGT CCAAAAAaAATTATTTT AGG AACTT 
TTGAGAATAAGGTTG AGGATAAAACAATT CATCTTTTGG AAAGAG ATGCA 
GCGAATAAT CCATATTTTG C GATTTCT COAAATAAAG ATGGAAAT AG GGA 
TGAAAT CACTCC CCAGG CAACTTT CTT AAGAAATGTTAAGGATATTT CTG 
CTCAAGTTCT AG AT CAAAATGG AAATGTTATTTGG CAAAGTAAGGTTTT A 
CCAT CTTAT CGTAAAAATTT C CAT AAT AATCCAAAG CAAAGTGATGGTCA 
TTATCGT ATGGATG C C CTT CAGTGG AGTGGTTT AGATAAGGATGG CAAAG 
TTGTAGC^GATGGTTTTTATACITATCGTTTACG 

GAAGGAGCAAATAGT(^GGAGTCAGACITTAAAGTTCAAGTAAGTACTAA 
GT CACCAAATCTTC CTT CACGAG CT CAGTTTG ATGAAACTAAT CG AACAT 
TAAGCTTAGCCATGCCTAAGGAAAGTAGTTATGTTCCTACATATCGTCTA 
CAATTAGTTTTATCTCATGTTGTAAAAGATGAAGAATATGGAGATGAGAC 
TTCTTACCATTATTTCCATATAGATCAAGAAGGTAAAGTGACACTTCCTA 
AAACAGTTAAGATAGaAGAGAGTGAGGTTGCAGTAGACCCTAAGACCTTG 
ACACTTGTTGTGGAAGATAAAGCTGGTAATTTCGCAACX3GTAAAATTGTC 
TGACCTCTTGAATAAGGCAGTAGTATCAGAGAAAGAAAACGCTATAGTAA 

TTTCTAACAATTTCAAATATTTTGATAACTTGAAAAAAGA 
ATTTCTAAAGAAGGAAAAGTAGTAAACAAGAATCTAGAAGAAATAGCATT 
AGTT AAG CCG CAAAC TACAGTTACT ACTCAATCATTGT CTAAAGAAATAA 
CTCAATCAGGAAATGAGAAAGTCCTCACTTCTACAAACAATAATAGTAGC 
AGAGTAG CT AAG AT CAT AT CAC CT AAACATAACG G GGATT CTGTTAACCA 
TACC 

SEQ ID NO. 4405 
STRAIN 18RS21 

GAGG AG CAAG AATT AAAAAA C C AAGAG CAAT CAC C 

TGTAATTGCTAATGTTGCTCAACAGCCATCGCCATCGGTAACTACTAATA 
CTGTT GAAAAAACAT CTGTAACAG CTG CTT CTG CT AGTAATACAG CGAAA 
GAAATGG GTGATACAT CTGTAAAAAATGACAAAACAG AAGATGAATTATT 
AGAAGAGTTATCTAAAAACCITC^TACGTCTAATTTGGGGGCTGATCTTG 
AAGAAGAATATC CCTCT AAAC CAG AGACAACCAACAATAAAGAAAGCAAT 
GTAGTAACAAATG CTTCAACTG CAAT AGCACAGAAAGTT CCCTCAG CATA 
TGAAGAGGTGAAG C CAGAAAG CAAGTCAT CGCTTG CTGTT CTTGATACAT 
CTAAAATAACAAAATT ACAAG C CAT AACCCAAAGAGGAAAGGGAAATGTA 
GTAGCTATTATTGATACTGGCTTTGATATTAACCATGATATTTTTCGTTT 
AGATAGCCaAAAAGATGATAAGCACAGCTTTAAAACTAAGACAGAATTTG 
AGGAATTAAAAG CAAAACATAATAT CACTTATGGGAAATGGGTTAACGAT 
AAGATTGTTTTTGCACATAACTACGCCAACAATACAGAAACGGTGGCTGA 
TATTG CAGCAG CTATGAAAGATGGTTATGGTT OA.GAAG CAAAG AAT ATTT 
CGCATGGTACACACGTTGCTGGTATTTTTGTAGGTAATAGTAAACGTCCA 
GCAATCAATGGTCTTCTTTTAGAAGGTGCAGCGCCAAATGCTCAAGTCTT 
ATTAATG CGTATTC CAG AT AAAATTGATT CGG ACAAATTTGGTGAAG CAT 
ATGCTAAAG CAAT aAOAGACGCTGTT AAT CTAGGAGCAAAAACG ATT AAT 
ATGAGTATTGGAAAAACAGCTGATTCTTTAATTGCTCTCAATGATAAAGT 
TAAATTAGCACTTAAATTAGCTTCTGAGAAGGGCGTTGCAGTTGTTGTGG 
CTGCCGGAAATGAAGGCGCATTTGGTATGGATTATAGCAAACCATTATCA 
ACTAAT C cTG ACTACGGT ACGGTT AATAGTCCAG CTATTT CTG AAGAT AC 
TTTGAGTGTTG CTAG CT ATG AATCACTTAAAACT AT CAGTG AGGTCGTTG 
AAACAACTATTGAAGGTAAGTTAGTTAAGTTGCCGATTGTGACTTCTAAA 
CCTTTTGACAAAGGTAAGGC CTACG ATGTG GTTTATG C CAATTATGGTG C 
AAAAAAAG ACTTTG AAGGT AAGGACTTT7VAAG GTAAG AT TG CATT AATTG 
AGCGTGGTGGTGGACTTGATTTTATGACTAAAATCACTCATGCTACAAAT 
GCAGGTGTTGTTGGTATCGTTATTTTTAACGATCAAGAAAAACGTGGAAA 
TTTTCTAATT CCTTACCGTGAATT AC CTGT GGGG ATT ATT AgTAAAGT AG 
ATGG CGAGCGTATAAAAAATACTT CAAGT C AGTT AAC ATT t AACCAg AGT 
TTTG AAG t AGTT GATAGCCAAGGTGGtAATCGTaTGCTGGAACAATCAAG 
TTGGGGCGTGACAGCTGAAGGAGCAATCAAGCCTGATGTAACAGCTTCTG 
G CTTTGAAATTT ATT CTTCAAC CT AT AATAAT CAATAC CAAa CAATGT CT 
GGTACAAGTATGGCTT CAC CACATGTTG CAGG ATT AATGACAATG CTTC A 
AAGTCATTTGGCTGAGAAATATAAAGGGATGAATTTAGATTCTAAAAAAT 

tgctagaattgtctaaaaacatcctcatgagctcagcaacagcattatat 
agtgaagaggataaggcgttttattcaccacgtcagcaaggtgcaggtgt 
agttgatgctgaaaaagctatccaagctcaatattatattactggaaacg 
atgg caaag ctaaaatt aat ct caaacgaatg g g ag ataaatttg atat c 
a(^gttacaattcataaacttgtagaaggtgtcaaagaattgtattatca 
agcraatgtagcaacagaacaagtaaataaaggtaaatttgcccttaaac 
cacaagccttgctagatactaattggcagaaagtaattcttcgtgataaa 
gaaacacaagttcgatttactattgatgctagtcaatttagtcagaaatt 
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AAAAGAACAGATGGCAAATGGTTATTTCITAgAAGQTTTTGTACGTTTTA 
AAGAAGCCAAGGATAGTAATCAGGAGTTAATGAGTATTCCTTTTGTAGGA 
TTTAATGGTGATTTTGCGAACTTACAAGCACT^ 

GACGATTTCT AAAGGTAGTTT CTACTATAAACCAAATGATACAACT CAT A 
AAGACCAATTGGAGTACAATGAATCAG CT C CTTTTGAAAG CAACAACTAT 
ACTG CCTTGTTAACACAAT CAG CGT CTTG G GG CTATGTT G ATT ATGT CAA 
AAATGGTGG GGAGTT AGAATTAGC a CCGG AGAGT C CAAAAAGAATTATTT 
TAGGAACTTTTGAGAATAAGGTTGAGGATAAAACAATTCATCTTTTGGAA 
AGAGATG CAGCGAAT AAT C CAT ATTTTG CCATTTCT C CAAAT AAAGATGG 
AAATAGGGACGAAAT CACTC CCCAG G CAAC t TTCTTAAG AAATGTT AAG G 
ATATTTCTGCTCAAGTTCTAGATCAAAATGGAAATGTTATTTGGCAAAGT 
AAGGTTTTACCATCTTATCGTAAAAATTTCCATAATAATCCAAAGCAAAG 
TGATGGTCATTATCGTATGGATGCTCTTCAGTGGAGTGGTTTAGATAAGG 
ATGGCAAAGTTGTAGCAGATGGTTTTTATACTr^^ 

C CAGT AG CAGAAGG AGCAAATAGT CAG GAGTCL^G ACTTT AAAGT ACAAGT 
AAGTACT AAGT CAC CAAAT CTT C CTTCACGAG CT CAGTTTGATGAAACT A 
AT CGAACATT A^G CTTAG CCATG CCTAAGGAAAGT AGTTATGTTC CTACA 
TATCGTTTACAATTAGTTTTATCTCATGTTGTAAAAGATGAAGAATATGG 
GGATG AGACTT CTTAC CATTATTT C CATATAG AT CAAGAAGGTAAAGTGA 
CACTT CCTAAAACGGTT AAG ATAGG AG AG AGTGAGGTTG CGGTAGACCCT 
AAGGCCTTGACACTTGTTGTGGAAGATAAAGCTGGTAATTTCGcAACGGT 
AAAATTGT CTGAT CT CTTGAAT AAGGCAGTAGTAT CAGAG AAAG AAAACG 
CTATAGTAATTT CTAACAGTTTC!AAATATTTTGAT AACTTGAAAAAAGAA 
C CTATGTTTATTTCTAAAAAAG AAAAAGT AGT AAACAAG AAT CT AGAAG A 
AATAAT ATT AGTTAAGC CG CAAACTACAGTTACT ACT CAAT CATTGT CT A 
AAGAAATAACTAAAT CAGG AAATGAGAAAGT C CTCACTT CTACAAACAAT 
AATAGTAGCAGAGTAGCTAAGATCATATCACCTAAACATAACGGGGA'rrC 

TGTTAAC CAT AC C 

SEQ ID NO. 4406 
STRAIN M732 

GAGGAGCAAGAATTAAAAAACCAAGAG CAATCAC CT 
GTAATTG CTAATGTTG CT CAACAG CCAT CG CCAT CGGT AACTACT AATAT 
TGTTG AAAAAACAT CTGTAACAGCTGCTT CTG CTAGT AAT ACAGTGAAAG 
AAATGGGTGATACATCTGTAAAAAATGACAAAACAGAAGATGA^TTATTA 
GAAGAGTTATCTAAAAACCTTGATACGTCTAATTTGGGGGCTGATCTTGA 
AGAAG AATAT C C CT CTAAACCAGAG ACAAC CAACAAT AAAG AAAG CAATG 
TAGTAACAAATGCTT CAACTG CAATAG CACAGAAAGTTC C CT CAG CAT AT 
GAAGAGGTG AAGT CAGAAAG CAAGT CAT CG CTTGCTGTT CTTGATACAT C 
TAAAATAACAAAATTAC^GCCACAACCCAAAGAGGAAAGGGAAATGTAG 
TAGCT ATTATTGATACTGG CTTTGATATTAAC CATGATATTTTTCGTTT A 
GATAGCCOU^AGATGATAAGCACAGCTTTAAAACTAAGGCAGAATTTGA 
GGAATTAAAAG CAAAACATAAT ATCACTTATG GG AAATGGGTTAACG AT A 
AG ATTGTTTTTG C!ACATAACTACGC CAACAAT A CAGAAACGGTGG CTGAT 
ATTG CAG CAG CTATGAAAG ATG GTTATGGGTCAG AAG CAA^GAAT ATTTT 
G CATGGTACACACX3TTG CT GGTATTTTTGTAGGTAATAGTAAACGT CCAG 
CAAT CAATAGTCTT CTTTTAGAAGGTG CAGCG C CAAATG CT CAAGT CTTA 
TTAATGCGTATTCCLA.GATAAAATTGATTCGGACAAATTTGGAGAAGCATA 
TG CTAAAG CAATCAT AGACG CTGTTAAT CTAGGAG CAAAAACGATTAATA 
TGAG CCTGGGAAAAACGGCTGATT CTTTAATTGCT CTCAATGAT AA^GTT 
AAATTAG CACTTAAATT AG CTT CTGAG AAGGG CGTTG CAGTTGTTGTGG C 
TG CCGGAAATGAAGGTG CATTTGGTATGGATT ATAG CAAACCATT ATCAA 
CTAATCCTGACTACGGT ACGGTTAATAGTC CAG CTATTT CTGAAGAT ACT 
TTGAGTGTTGCTAGCTATGAATCACTTAAAACTATCAGTGAGGTCGTTGA 
AACAACTATTGAAGGTAAGTTAGTTAAGTTGC CG AITGTGAC'IT CTAAAC 
CTT t TGACAAAGGTAAGGCCTACGATGTGGTTTATGCCAATTATGGTGCA 
AAAAAGATTTTGAAGGTAAGGACTTTAAAGGTAAGATTGCATTAATTGAG 
CGTGGTGGTGGACTTGATTTTATGACTAAAAT CACTCATG CTACAAATG C 
AGGTGTTGTTGGTATCGTTATTTTTAACGATCAAGAAAAACGTGGAAATT 
TT CT AATTC CTTACCGT G AATTACCTGTGGGGGTT ATTAGTAAAGT AGAT 
GG CGAG CGTATAAAAAATACTT CAAGTCAGTTAACATTT AAC CAG AGTTT 
TGAAGTAGTTGATAG CCAAGGTGG CAATCGTATG CTGGAACAAT CAAGTT 
GGGGCGTGA CAG CTGAAGGAG CAAT CAAGC CTTG ATGT AA CAG CTT CTGGC 
TTTGAAATTTATTCTTCAAC CT AT AAT AAT CAAT ACT AAACAATGTCTGG 
TACAAGTATGGCTTCA.CCACATGTTGCAGGATTAATGACAATGCTTCAAA 
GTCATTTG^CTG AG AAATAT AAAG GGATG AATTTAGATT CTAAAAAATTG 
CT AG AATTGT CTAAAAACAT C CTCATG AG CTCAG CAACAG CATT ATATAG 
TGAAGAGGATAAGGCGTTTTATTCACCACGTCAGCAAGGTGCAGGTGTAG, 
TTGATGCTGAAAAAGCTATCCAAGCTCAATATTATGTTACTGGAAACGAT 
GG CAAAGTT AAAATT AAT CT CAAACGAGAGGG AG ATAAAT TTG AT ATCAC 
AGTTACAATTCATaAACITGTAGAAGGTGTCAAAGAATTGTATTATCAAG 
CTAATGT AG CAACAGAaCAAGTAAATAAAG GTAAATTTG C C CTT aAACCA 
CAAG C CTTG CT AGAT ACT AATTGG CAGAAAGT AATTCTT CGTG AT AAAG A 
AACA CAAGTT CGATTTACTATTGAT G CTAGT CAATTT AGT CAGAAATTAA 
AAGAACAGATGGCAAATGGTTATTTCTTAGAAGGTTTTGTACGTTT^ 
GAAGCCAAGGATAGTAATCAGGAGTTAATGAGTATTCCTTTTGTAGGATT 
TAATGGTGATTTTGCGAACTTACAAGCACTTGAAACaCCGATTTATAAGA 
CGCTTTCT AAAG GTAGTTT CTACTATAAAC CAAATG ATACAACT CAT AAA 
GACCAATTGGAGTAC^TGAATCAGCTCCTTTTGAAAGCAACAACTATAC 
TGCCTTGTTAACACAATCAGCGTCTTGGGGCTATGTTGATTATGTCAAAA 
ATGGTGG GG AGTTAG AATTAG CAC CGG AG AGT C CAAAAAG AATTATT TTA 
GGAACrrTTTGAGAATAAGGTTGAGGATAAAACAATTCATCTTTTGGAAAG 
AG ATG CAG CG AAT AATC CAT ATTTTGC CATTT CT C CAAAT AAAG ATGGAA 
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AT AGGGACGAAAT CACT C C C CAGG CAACTTTCTT AAGAAATGTTAAGGAT 
ATTT CTG CTCAAGTT CTAGAT CAAAATGG AAATGTTATTTGG CAAAGT AA 
GGTTTTAC CATCTT ATCGT AAAAATTT C CATAAT AAT CCAAAG CAAAGTG 
ATGGTGATTATCGTATGGATGCTCrTCAGTGGAGTGGTTTAGATAAGGAT 
GGCAAAGTTGTAGCAGATGGTTTTTATACTTATCGCITACGTTACACACC 
AGTAG CAGAAGG AG C aAATAGTCAGGAGT CAG ACTTTAAAGTT CAAGTAA 
GTACTAAGTCACCAAATCTTCCTTCACGAGCTCAGTTTGATGAAACTAAT 
CGAACATTAAG CTT AGC CATG C CTAAGGAAAGTAGTTATGTT CCT ACAT A 
TCGTTTACAATTAGTTTTATCTCATGTTGTAAAAGATGAAGAATATGGGG 
ATG AG ACTT CTT AC CATTATTT CCATATAGAT CAAGAAGGTAAAGTG ACA 
CTTCCTAAAACGGTTAAGATAGGAGAGAGTGAGGTTGCGGTAGACCCTAA 
GGCCTTGACACTTGTTGTGGAAGATAAAGCTGGTAATTTTGCAACGGTAA 
AATTGTCTG AC CTCTTG AAT AAGG CAGTAGTAT CAGAGAAAG aAAACGCT 
AT AGT AATTT CTAACAGTTT CAAATAT TTTGATAACTTG AAGAAAGAACC 
TATGTTTATTTCTAAAG AAGGAAAAGT AGT AAACAAGAAT CT AG AAG AAA 
TAACATTAGTTAAG C CT CAAACTACAGTTACTACTCAAT CATTGT CTAAA 
GAAATAACTAAATCAGGAAATG AG AAAGT C CT CACTTCTACAAACAATAA 
TAGT AGCAGAGTAG CTAAG ATCATAT CAC CTAAACATAACGGGGATTCTG 
TTAACCATACC 

SEQ ID NO. 4407 
STRAIN COK1 

GAGGAGCAAGAATTAAAAAACCAAGAGCAATCACCTGT 
AATTGCTAATGTTGCTCAACAGCCATCGCCATCGGTaACTACTAATATTG 
TTGAAAAAACAT CTGTAACAG CTG CTT CTG CTAGTAATACAGTG AAAGAA 
ATGGG t gAT ACAT CTGTAAAAAATGACAAAACAGAAGATGAATTATT AGA 
AGAGTTATCTAAAAACCTTGATACGTCTAATTTGGGGGCTGATCTTGAAG 
AAGAATATC C CT CT AAAC CAGAGaCAACCAACAATAAAGAAAGCAATGT A 
GT AACAAATG CTT CAACTGCAATAG CACAG AAAGTTCCCT CAGCAT ATG A 
AGAGGTGAAGTCAGAAAGCMGTCATCGCTTGCTGTTCTTGATACATCTA 
AAATAACAAAATTACAAG C CACAACC CAAAGAGG AAAGG G AAATGTAGTA 
GCTATTACTGATACTGGCTTTGATACTAACCATGATATTTTTCGTTTAGA 
TAG CC CAAAAGATG AT AAG CACAG CTTTAAAACT AAGGCAGAATTTG AGG 
AAtTAAAAGCAAAACATAATATCACTTATGGGAAATGGGTTAACGATAAG 
ATTGTTTTTGCACATAACTACGCCAaCAATACAGAAACGGTGGCTGATAT 
TG CAG CAG CT ATGAAAG ATGGTTATGGGT CAGAAG CAAAG AATATTTTG C 
ATGGTACACACGCTGCTGGTATTTTTGTAGGTAATAGTAAACGTCCAGCA 
ATCAATAGTCTT CTTTT AG AAG GTG CAG CG CCAAATG CT CAAGT CTTATT 
AATGCGTATTCCAGATAAAATTGATTCGGACAAATTTGGAGAAGCATATG 
CTAAAGCAATCATAGACGCTGTTAATCTAGGAGCAAAAACGATTAATATG 
AGCCTGGGAAAAACGGCTGATTCTTTAATTGCTCTCAATGATAAAGTTAA 
ATTAG CACTTAAATTAG CTTCT GAGAAGGG CGTTG CAGTTGTTGTGG CTG 
CCGGAAATGAAGGTG CATTTGGTATGG ATTATAG CAAAC CATTAT CAACT 
AAT CCTG ACTACGGT ACGGTTAAT AGT CCAGCTATTT CTG AAGAT ACTT T 
GAGTGTTG CTAG CT ATG AAT (^CTT AAAACTATCAGTGAGGT CGTTGAAA 
CAACTATTGAAGGT AAGTTAGTTAAGTTG C CGATTGTGACTT CTAAAC CT 
TtTGACAAAGGTAAGGCCTACGATGTGGTTTATGCCAATTATGGTGCAAA 
AAAG ATTTTG AAGGT AAGG ACTTTAAAGGTAAGATTG CATTAATTGAG CG 
TGGTGGTGG ACTTG A'iTTT ATG ACTAAAATCACTCATG CTACAAATG CAG 
GTGTTGTTGGTATCGTT ATTTTTAACGAT CAAGAAAAACGTGGAAATTTT 
CTAATTCCTTACCGTGAATTACCTGTGGGGGTTATTAGTAAAGTAGATGG 
CGAG CGTATAAAAAATACTTCAAGTCAGTTAACATTTAAC CAGAGCTTTG 
AAGTAGTTGATAGCCAAGGTGGCAATCGTATGCTGGAACAATCAAGTTGG 
GGCGTGACAGCTGAAGGAGCAATCAAGCCTGATGTAACAGCTTCTGGCTT 
TGAaATTTATTCTTC^CCTATAATAATCAATACTAAACAATGTCTGGTA 
CAAGTATGGCTTC^CCACATGTTGCAGGATTAATGACAATGCTTCAAAGT 
CATTTGGCTGAGAAATATAAAGGGATGAATTTAGATTCTAaAAAATTGCT 
AG aATTGT CTAa aAACAT C CT CATGAG CT CAG CAACAGCATTATATAGTG 
AAGAGGATAAGG CGTTTTATTCAC CACGT CAGCAAGGTGCAG GTG TAG TT 
GATGCTGAAAAAGCTATCCAAGCTCAATATTATGTTACTGGAAACGATGG 
CAAAGTTAAAATTAAT CT CAAACG AG AGGG AGATAAATTTG ATAT CACAG 
TT ACAATT CATaAACTTGTAG AAGGTGT CAAAGAATTGTATTAT CAAG C T 
AATGTAGCAaCAGAA.CAAGTAAATAAAGGTAAATTTGCCCTTAAACCACA 
AG CCTTG CTAGATACT AATTGG CAG AAAGTAATTCTT cGTGATAAAGAAA 
CACAAGTTCGATTTACTATTGATGCTAGTCAATTTAGTCAGAAATTAAAA 
GAACAGATGGCAAATGGTTATTTCTTAGAAGGTTTTGTACGTTTTAAAGA 
AG C CAAGGATAGTAATCAG G AGTTAATGAGTATT C CTTTTGTAGGATTT A 
ATGGTGATTTTGCGAACTTACAAGCACCTGAAACACCGATTTATAAGACG 
CTTT CTAAAGGTAGT TT CTACTAT AAACCAAATGATACAACTCATAAAG A 
CCAATTGGAGTACAATGAATCAGCTCCTTTTGAAAGCAACAACTATACTG 
C CTTGTT AACACAAT CAG CGT CTTGGGG CT ATGTTG ATT ATGTCAAAAAT 
GGTGGGGAGTTAGAATTAGC^CCGGAGAGTCCAAAAAGAATTATTTTAGG 
aACTTTTGAGAATAAGGTTGAGGATAAAACAATT CAT CTTTTGG AAAGAG 
ATG CAG CGAATAAT C CAT ATTTTG C CATTT CT C CAAATAAAG ATGGAAAT 
AGGG ACGAAAT CACT C CC CAGG CaACTTT CTTAAG AAATGTT AAG GATAT 
TT CTG CT CAAG t T CTAG ATCAAAATGG AAATGTT ATTTGG CAAAGTAAG G 
. TTTT ACCAT CTTAT CGT AAAAATTT CCATAAT aAT CCAAAGCAAAGT GAT 
GGTCATTATCGTATGGATGCTCTTCAGTGGAGTGGTTTAgATAAGGATGG 
CAAAGTTGTAgCAGATCGtTTTTATACTTATCGCTTACGTTACACACCAG 
T AGCAG AAGGAG CAAATAGT CAGG AGT CAG ACTTT aAAGTTCAAGTAAGT 
AcTAAGT(^CCAAATCTTCCTTC^CGAGCTCAGTTTGATGaAACTAATCG 
AACATT AAG CTT AG C CATG C CTAAG GAAAGTAGTTATGTT CCTACATAT C 
GTTTAC^ATTAGTTTTATCTCATGTTGTAAAAGATGAAGAATATGGGGAT 
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GAGACTT CTT ACCATTATTT CCATATAGAT CAAGAAGGTAAAGTGACACT 

TCCTAAAACGGTTAAGATAGGAGAGAGTGAGGTTGCGGTAGACCCTAAGG 

C CTTGACACTTGTTGTGGAAG ATAAAG CTGGTAATTTTG CAACGGTAAAA 

TTGTCTG AC CTCTTGAATAAGGCAGTAGTATCAGAGAAAG AAAACG CTAT 

AGTAATTTCTAACAGTTTCAAATATTTTGATAACTTG 

TGTTTATTT CTAA^G AAGGAAAAGTAGTAAACAAG AATCTAGAAGAAATA 

AC^TTAGTTAAGCCTC^AACTAC^GTTACTACTCAATCATTGTCTAAAGA 

AATAACTAAAT CAGG AAATG AGAAAGTCCTCACT T CTACAAACAATAATA 

GTAG CAG AGT AG CT AAGAT CATAT CAC CTAAACAT AACG GGG ATTCTGTT 

AACCATACC 

SEQ ID NO. 4408 
STRAIN M781 

GAGGAGCAAGAATTAAAAAACCAAGAGCAATCACCTGT 
AATTGCTAATGTTGCTCAACAGCCATCGCCATCGGTAACTACTAATATTG 
TTG^AAAAACAT CTGTAACAGCTG CTTCTG CTAGTAA.TACAGTGAAAGAA 
ATGGGTGATACATCTGTAAAAAATGACAAAACAGAAGATGAATTATTAGA 
AGAGTTATCTAAAAACCTTGATACGTCTAATTTGGGGGCTGATCTTGAAG 
AAGAATATCCCTCTAAACCAGAGACAACCAACAATAAAGAAAGCAATGTA 
GTAACAAATG CTTCAACTG CAATAG CACAG AAAGTTC C CT CAG CATATG A 
AGAGGTGAAGTCAG AAAG CAA.GTCATCGCTTG CTGTT CI^GATACAT CTA 
AAATAACAAAATTACAAGCCA(^^CCCAAAGAGGAAAGGGAAATGTAGTA 
G CTATT ATTGATACTGG CTTTGATATTAACCATGATATTTTT CGTTTAGA 
TAG CC CAAAAGATG ATAAG CACAG CTTTAAAACTAAGG CAGAATTTG AG G 
AATTAAAAG CAAAACATAATATCACTTATGGGAAATGGGTTAACG ATAAG 
ATTGTTTTTGCACATAACTACGCCAaCAATACAGAAACGGTGGCTGATAT 
TG CAG CAG CT ATGAAAG ATGGTTATGGGTCAGAAG CAAAG AATATTTTG C 
ATGGTACACACGTTGCTGGTATTTTTGTAGGTAATAGTAAACGTCCAGCA 
ATCAATAGTCTTCTTTTAGAAGGTGCAGCGCCAAATGCTCAAGTCT^ 
AATGCGTATTCCAGATAAAATTGATTCGGACAAATTTGGAGAAGCATATG 
CTAAAG CAAT CATAGACG CTGTTAATCTAGGAGCAAAAACGATTAATATG 
AG CCTGGGAAAAACGGCTG ATTCTTTAATTG CTCT CAATGATAAAGTTAA 
ATTAGCACTTAAATTAGCTTCTGAGAAGGGCGTTGCAGTTGTTGTGGCTG 
C CGG AAATGAAG GTG CATTTGGTAT GGATTATAG CAA a C CATTATCAa CT 
AATCCTGACTACGGTACGGTTAATAGTCCAGCTATTTCTGAAGATACTTT 
GAGTGTTGCTAGCTATGAATCACTtAAAACTATCAGTGAGGTCGTTGAAA 
CAACTATTGAAGGTAAGTTAGTTAAGTTGCCGATTGTGACtTCTAaACCT 
TTTGACAAAGGTAAGGCCTACGATGTGGTTTATGCCAATTATGGTGCAAA 
AAAGATTTTGAAGGTAAGG A CTTTAAAGGTAAGATTG CATTAATTGAG CG 
TGGTGGTGGACTTGATTTTATGACTAAAATCA.CTCATGCTACAAATGCAG 
GTGTTGTTGGTATCGTTATTTTTAACGATCAAGAAAAACGTGGAAATTTT 
cTAATTCCTTACCGTGAATTACCTGTGgGGGTTATTAGTAAAGTAGATGG 
CG AG CGTAT AAAAAATACTT CAAGT CAGTTAACATTTAAC CAGAGTTTTg 
AAGTAGTTG ATAG CCAAGGTGG CAATCGTATGCTGGAACAATCAAGTTGG 
GGCGTGACAGCTGAAGGAGCAATCAAGCCTGATGTAACAGCTTCTGGCTT 
' TGAAATTTATTCTTCAACCTATAATAATCAATACTAAACAATGTCTGGTA 
CAAGTAT GG CTTCAC CACATGTTG CAG GATTAATG AC AATG CTT CAAAGT 
CATTTGG CTGAGAAATATAAAGGGATGAATTTAGATTCTAAAAAATTGCT 
AGAATTGTCTAAAAACATCCTCATGAGCTCAGCAACAGC^TTATATAGTG 
AAGAGGATAAGGCX3TTTTATT(ZACCACGTCAGCAAG^ 
GATG CTG AAAAAGCT ATCCAAG CT CAATATTATGTTACTGGAAACGATGG 
CAAAGTTAAAATTAATCTCAAA.CGAGAGGGAGATAAATTTGATATCACAG 
TTACAATTCATaaACTTGTAgAAGGTGTCAAAGAATTGTAT^ 
AATGTAGCaaCAGAACAAGTAAATAaAGGTAAATTTGCCCTTaAaCCaCA 
AG C CTTG CTAGATACTAATTGG CAG AaAGT aATT CTT cGTGATAAAGAAA 
CACAAGT Tc G ATTTACTA t TGATG OTAGTCAATTT AGTCAGAAATTAAAA 
GAACAGATGGCAAATGGTTATTTCTTAGAAGGTTTTGTACGTTTTA 
AG CCAAGGATAGTAATCAGGAGTTAATGAGTATTC CTTTTGTAGGATTT A 
ATGGTGATTTTGCGAACTtACAAGCACTTGAAACACCGATTTATAAGACG 
CTTT CTAAAGGTAGTTT CTACTATAAa C CAAATGATACAACT CAT AAAGA 
CCAATTGGAGTACAATGAATCAGCTCCTITTTGAAAGCAAC!AACT 
CCTTGTTAACACAATCAGCGTCTTGGGGCTATGTTGATTATGTCAAAAAT 
GGTGGGG AGTTAGAATT AG CAC CG GAGAGTCCAAAAAG AATT ATTTTAGG 
AACTTTT'GAGAATAAGGTTGAGGATAAAACAATT^TCTTTTGGAAAGAG 
ATG CAG CGAATAAT C CATATTTTG C CATTT CT CCAAATAAAG ATG GAAAT 
AGGG ACG a aATCACT CC C CAGG CaAC t TT CTT AAG AAATGTTAAG GATAT 
TT CTG CT CAAG t TCTAG ATCAAAATGG AAATGTT ATTTGG CAAAGTAAG G 
TTTTAC CAT CTTATCGTAAAAATTT CCATAAT aATC CAAAG CAAAGTGAT 
GGTCATTATCGTATGGATGCTCTTCAGTGGAGTGGTTTAGATAAGGATGG 
CAAAGTTGTAGCAGATGGTTTTTATACTTATCGCT 

TAGC^GAAGGAGCAAATAGTC^GGAGTCAGACriT'AAAGTTCAAGTAAGT 
ACTAAGT CAC CAAAT CTTC CTT CAC GAG CTCAGTTTG ATGAAACTAATCG 
AACATTAAGCTTAGCCATGCCTAAGGAAAGTAGTTATGTTCCTACAtATC 
GTTT ACAATT AGTTTTAT CT CATGTTGTAAAAGATGAAG AAT ATGGGGAT 
GAGACTT CTT AC CATTATTT CCATATAGAT CAAGAAGGTAAAGTGACACT 
T C CT AAAACGGTTAAGATAG G AGAG AGTG AGGTTG CGGTAG ACC CTAAGG 
CCTTGACACTTGTTGTGGAAGATAAAG CTGGTAATTTTG CAACGGTAAAA 
TTGTCTG AC CTCTT G AATAAGG CAGT AGTATCAG AGAAAGAAAACG CTAT 
AGTAATTTCTAA CAGTTTCAAATAT TTTGATAACT TG AAG AAAG AAC CTA 
TGTTTATTT CTAAAGAAGGAAAA.GTAGTAAACAAGAATCTAGAAGAAATA 
ACATTAGTTAAGCCTCAAACTACAGTTACrACTCAATCATTGTCTAAAGA 
AATAACTAAATCAGG AAATG AG AAAGTC CT CACT T CTACAAACAATAATA 
GTAG CAG AGT AG CT AAG ATCATATCAC CTAAACAT AACGG GG ATT CTGTT 
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AACCATACC 

SEQ ID NO. 4409 
STRAIN CJB110 

GAGG AGCAAGAATTAAAAAACCAAG AG CAATCAC CTGTAA 

TTGCTAATGTTGCTCAACAGCCATCGCCATCGGTA^CTACTAATATTGTT 

GAAAAAACATCTGTAnCAGCTGCTTCTGCTAGTAATACAGCGAAAGAAAT 

GGGTG AT ACAT CTGTAA?WVATGACAAA^CAG AAGATGAATTATTAGAAG 

AGTTATCTAAAAACCTTGATACGTCrAATv/rGGGGGCTGATCTTGAAGAA 

GAATAT C CCTCTAAACCAGAGACAAC CAACAATAAAGAAAG CAATGTAGT 

AACAAATGCTTCAACTG CAATAG CACAGAAAGTT C CCTCAGCGTATG AAG 

AGGTGaAG C CAGAAAG CAAGTCATCG CTTG CTGTTTTTGATACATCTAAA 

ATAACAAAATTG CAAG C CATAACC CAAAGAGGAAAGGGAAATGTAGTAG C 

TATT ATTGATACTGG CTTTG ATATTAACCATGATATTTTT CGTTTAGAT A 

GCCCAAAAGATGATAAGCACAGCTTTAAAACTAAAGCAGAATTCGAGGAA 

tTAAAAGCAAAACATAATATCACTTATGGGAAATGGGTTAACGATAAGAT 

TGTTTTTGCACAlTAACTACG CCAACAATACAGAAACGGTG G CTGATATTG 

CAGCAGCTATGAAAGATGGTTATGGGTCAGAAGCAAAGAATATTTCGCAT 

GGTACACACGTTGCTGGTATTTTTGTAGGTAA.TAGTAAACGTCCAGCAAT 

CAATGGT CTTCTTTTAG AAGGTG CAGCG C CAAATG CT CAAGT CTT ATTAA 

TGCGTATTCCAGATAAAATTGATTCGGACAAATTTGGAGAAGCATATGCT 

AAAGCAATCACAGACG CTGTTAAT CTAGG AGCAAAAACGATTAATATGAG 

CCTTGGAAAAACAGCAGATTCTTTAATTGCACTCAATGATAAAGTTAAAT 

TAgCACITAAATTAGCTTcTGAGAAGGGCGTTGCAGTTGTTGTGGCTGCC 

GGAAATGAAGGTGCATTTGGTATGGATTATAgCAAACCATTATCAACTAA 

Tc CTGACTACGG t ACGGTTAATAGT CCAG CTATTT cTGAAGATACTTTG A 

GTGTTG CTAG CTATGAAT CACTTAAAACTATCAGTGAGGTCGTTG aAACA 

ACTATTGAAGGTAAGTTAGTTAAGTTGCCGATTGTGACTTcTAAACCTTT 

TGACAAAGGTAAGGCCTACGATGTGGTTTATGCCAATTATGGTGCAAAAA 

AAGACTTTGAAGGTAAGGAltituAAAGGTAAGATTGCATTAATTGAGCGT 

GGTGGTGGACTTGATTTTATGACT AAAATCACTCATGCTACAAATG CAGG 

TGTTGTTGGTATCGTTATTTTTAACGATCAAGAAAA^CGTGGAAATTTTc 

TAATTCCTTACCGTGAATTACCTGTGgGGGTTATTAGTAAAGTAGATGGC 

GAGCGTATAAAAAATACTTCAAGTCAGTTAACATTTAACCAgAGTTTTGA 

AGTAgTTGATAGCCAAgGTGGCAATCGTATGCTGGAACAATCAAGTtGGG 

GCGTGACAGCTGAAGGAGCAATCAAGCCTGATGTAACAGCTTCTGGCTTT 

GAAATTTATT CTTCAAC CTATAATAAT CAATACCAAACAATGTCTGGTAC 

AAGTATGGCrTCACCAGATGtTGCAGGATTAATGACAATGCTTCAAAATC 

ATTTGGCTGAGAAATATAAAGGGATGAATTTAGATTCTAAAAAATTGCTA 

GAATTGTCTAAAAACATCCT CATG AGCT CAG CAACAG CAT TAT AT AGTG A 

AG AGGAT AAGGCGTTTTATTCAC CACGTCAGCAAGG t G CAGGTGT AGTTG 

ATGCrGAAAAAGCTATCCAAGCTCAATATTATGTTACTGGAAACGATGGC 

AAAG CTAAAATTAAT CT CAAACGAGTG GGAGATAAATTTG AT AT CACAGT 

TACAATTCATAAACTTGTAGAA.GGTGTCAAAGAATTGTATTATCAAGCTA 

ATGTAGCAACAGAACAAGTAAATAAAGGTAAATTTGCCCrTaAACCACAA 

GCCTTG CTAGATACTAATTGG CAG AAAGTAATTCTT C GTGATAAAGAAAC 

ACAAGTTCGATTTACTAtTGATGCTAGTCAATTTAgTCAGAAATTAAAAG 

AACAGATGGCAAATGGTTATTTCTTAgAAGGTTTTGTACGTTTTAAAGAA 

G C CAAGGAT AGT AAT CAGG AGTTAATG AG T ATTC CTTTTG T AGG ATTT AA 

TGGTGATTTTGCGAACTtACAAGCACTTGAAACACCGATTTATAAGACGC 

TTTCTAAAGGTAGT t T CTACTATAAAC CAAATGATACAAC TCATAAAGAC 

CAATTGGAGTACAATGAATCAGCTCc t TTTGAAAG CAACAACTATACTGC 

CTTGTTAA(CACAATCAGCGTCTTGGGGCTATGTTGATTATGTCAAA 

GTGGGGAGTTAGAATTAG CACCGGAGAGTCCAAAAAG AATTATTTTAGGA 

ACITTTGAGAATAAGGTTGAGGATAAAACAAT^ 

TG CAGCG AATAATCCATATTTTGCCATTTCTCCAAATAAAGATGGAAATA 
GGGATGaaAT CACTCC CCAGGCAAC t TT CTTAAGAAATGTTAAGGATATT 
TCTGCTCAAGTTCTAGATCAAAATGGAAATGTTATTTGGCAAAGTAAGGT 
TTTACCATCT T ATCGTAAAAATTTC CATAATAATCCAAAG CAAAGTGATG 
GTCATT ATCGT ATGG ATG C CTTT CAGTGG AGTGGTTTAg ATAAgGATGG C 
AAAGTTGTAGCAGATGGTTTTTATACTTATCGCCTACGTTAC^CACCAGT 
AG CAG AAgGAGCAAATAGT CAGGAGT CAg ACTTTAAAGTTCAAGTAAGTA 
CTAAGT CAC CAAAT CTT CCTTT ACTAG CT CAGTTTGATG AAACTAAT CG A 
ACATTAAGCTT AG C CATG CCTA^GG AAAGTAGTT ATGTTC CT ACATATCG 
TTTACAATTAGTTTTATCrCATGTTGTAAAAGATGAAGAATATGGGGATG 
AGACTTCTTACCATTATTTCCATATAGATCAAGAAGGTAAAGTGACACTT 
CCTAAAACGGTTAAGATAGGAGAGAGTGAGGTTGCAGTAGACCCTAAGGC 
CTTGACACTTGTTGTGGAAGATAAAGCTGGTAATTTTGCAACGGTaAAAT 
TGTCTGACCTCTTGAaTAAgGCAGTAGTATCAGAGAAAGAAAACGCTATA 
GTAATTTCTAAC^GITTCAAATATTITGATAACTTGAAAAAAGAATCTAT 
GTTTATTT CT AAAGAAGGAAAAGT AGTAAACAAG AAT CT AGAAG AAATAA 
(^TTAGTTAAGCCGCAaACTACAGTTACTACTCAATCATTGTCTAAAGAA 
ATAACTAAAT CAGG AAATG AGAAAGTCCT CACTT CTACAAACAATAATAG 
TAG CAGAGT AG CTAAGAT CATATCACCTAAACATAACGGG GATT CTGTTA 
ACCATACC 

SEQ ID NO. 4410 
STRAIN 116 9NT 

GAGG AG CAAG AATTAAAAAAC CAAG AG CAATC 

ACCTGTAATTGCTAATGTTGCTCAACAGCCATCGCCATCGGTAACTACTA 
ATATTGTTGAAAAAACATCTGTAACAGCTGCTTCTGCTAGTAATACAGCG 
AAAG AAATG G GTGATACAT CTGTAAAAAATG ACAAAACAG AAGATG AATT 
ATTAG AAG AGTTATCTAAAAAC CTTGATACGT CTAATATGGGGG CTG AT C 
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TTGAAG AAGAAT ATC C CT CT AAAC CAGAG ACAAC CAACAATAAGG AAAG C 
AATGTAGTAACAAATGCTT CAACTG GAAT AG CACAGAAAGTT C CCTCAGC 
ATATGAAGAGGTGAAGCCAAAAAGCAAGTCATCGCTTGCTGTTCTTGATA 
CATCT AAAATAACAAAATTG CAAG CCATAAC C CAAAG AGGAAAGG GAAAT 
GTAGTAGCrATTATTGATACTGGCTTTGATATTAACCATGATATTTTTCG 
TTTAGAT AG CCCAAAAG ATG ATAAG CACAG CTTTAAAAAT AAGG CAGAAT 
T CGAGGAATTAAAAG CAAAA.CATAATATCACTTATGG GAAAT GG GTT AAC 
GATAAGATTGTTTTTGCACATAACTACGCCAACAATACAGAAACGGTGGC 
TGATATTG CAG CAG CTATGAAAgATGGTTATGGTT CAGAAG CAAAGAATA 
TTTCG CATGGT ACACACGTTG CTGGTATT t TTGTAGGTAATAGT AAACGT 
CCAGCAATCAATGGTCTTCTTTTAgAAGGTGCAgCGCCAAATGCTCAAGT 
CTTATTAATG CGTAT TC CAG ATAAAAT t G ATT CG G ACAAATT t GG AG AAG 
CATATG CT AAAG CAATCACAGACG CTGTT AATCTAGG AG CTaAAACG ATT 
AATATGAGTATTGG AAAAACAG CTG ATTCTTTAATTGCT CTCAATGATAA 
AGTTAAATTAgCACTTAAATTAGCTTCTGAGAAGGGCGTTGCAGTTGTTG 
TGGCTGcCGGAAATGAAGGCGCATTtGGTATGGATTATAGCAAACCGTTA 
TCAACTAAT c CTGACTACG G t ACGG t T AATAGT C CAG CT ATTTCTGAAGA 
TACTTTG AGTGTTG CTAGCTATGAATCACTTAAAACTAT CAGTG AGGT CG 
TTGAAACAACTATT G AAGGTAAGTTAGTTAAGT t G C CGATTG t G ACTT CT 
AAAC CTT 1 1 GACAAAGGTAAGG CCTACGATGTGGTTTATG CCAATTATGG 
TG CAAAAAAAGACTTTG AAGGTAAGGACTTTAAAG GTAAGATTG CAT TAA 
TTGAGCGTGK3TGGTGGACTTGATTTTATGACTAA 

AATGCAGGTGTTGTTGGTATCGTTATTTTTAACGATCAAGAAAAACGTGG 
AAATTTT CTAATTCCTT AC C GTGAATTAC CTGTGGGGGTTATTAGTAAAG 
TAGATGG CGAG CGTAT AAAAaATACTTCAAGT CAGTTAACATTT AAC CAg 
AGATTTGAAGTAGTTGATAGCC^^gGTGGCAATCGTATGCTGGAACAATC 
aAGT t GGGG CGTGACAG CTG AAGG AGCAATCAAG CCTGATGTAACAG CTT 
CTGG CTT CG aAATTTATTCTT C a a CCT AT AATAATCAATACCAAACAATG 
TCTGGTACAAGTATGGCTTCACCACATGTTGCAGGATTAATGACAATGCT 
T CAAAGT CATTTGG CTG AG aAATATAAAGGGATGAATTTAgATT CTAaAA 
AATTG CTAG AATTGT CT AAAAACATCCT CATGAG CTCAG CAACAG CATTA 
TATAGTGAAGAGGAT AAGG CGTTTTATT CACCACGTCAG CAAGG t GCAGG 
TGTAG1TGATGCTGAAAAAGCTATCCAAGCTCAATATTATGTTACTGGAA 
ACGATGG CAA^G CTAAAATT AATCT CAAACGAGT GGG AG ATAAATTT GAT 
AT CACAGTTACAATT CATAAACTTGTAGAAGGTGTCAAAGAATTGTATTA 
TCAAG CT AATGT AG CAACAG AACAAGTAAATAAAGGTAAATTTG CCCTTA 
AACCACAAGCCTTGCTAGATACTAATTGGCAGAAAGTAATTCTTcGTGAT 
AAAG AAA CACAAGTT CGATTTACTATT GATG CTAGT CAATTTAgTCAGAA 
ATTAAAAGAACAGATGG CAAATGGTTATTTCITAg A^GGTTTTGTACGTT 
TT AAAGAAG CTAAGGAT AGT AAT CAGGAGTTAATGAGTATT C CTTTTGTA 
GGATTTAATGGTGATTTTGCGAGCTTACAAGCACTTGAAACACCGATTTA 
TAAGACG CTTT CTAAAGGT AGTTT CTACT ATAAAC CAAATGATACAACTC 
ATAAAGAC CAATTG G AGTATAATG AAT GAG CT CCTTTTG AAAG CAACAAC 
TATACTG C CTTGTTAACACAAT CAGCGT CTTGGG G CT ATGTTGATTATGT 
CaAAAATGGTGGGGAGTTAGAATTAGCACCGGAGAGTcCAAAAAGAATTA 
TTTTAGGAACTTTTGAGAATAAGGTTGAGGATAA 

GAAAGAG ATG CAG CG AATAATC CAT ATTTTGC CATTT CT CCAAATAAAG A 
TGGAAATAG G GATGAAATCACT CC C CAGG CAACTTT CTTAAG A^ATGTT A 
AGGATAT1TCTG CTCAAGTT CT AG ATCAAAATGG AA ATGTTATTTGG CAA 
AGTAAGGTTTTACCATCrTATCGTAAAAATTTCCATAATAAT C CAAAG CA 
GAGTGATGGTCATTATCGTATGGATGCCCTTCAGTGGAGTGGTTTAgATA 
AGGATGG CAAAG TTGT AG CAG ATG GTTTTTATACTTATCG CTTACGTTAC 
ACAC CAG TAG CAGAAGG AG CAAATAGTCAGGAGT CAGACTTT AAAGTTCA 
AGTAAGTACTAAGTCACCAAATCTTCCTTCACGAGCTCAGTTTGATGaAA 
CTAATCGAAC^TTAAGCTTAGCCATGCCTAAGGGAAGTAGTTATGTTCCT 
ATATATCGTCTACAATTAGTTTTAT CT CATGTTGTAAAAGATGAAGAATA 
TGGAG ATGAG ACIT CTTACT ATTATTT CCAT ATAGAT CAAGAAGGTAAAG 
CGACACTTCCTAAAACGGTTAAGATAGGAGAGAGTGAGGTTGCAGTAGAC 
CCTAAGGCCTTGACACTTGTTGTGGAAGATAAAGCTGGTAATTTCGCAaC 
GGTAAAATTGT CTGACCTCTTGAATAAGG CAGTAGTAT CAGAGAAAGAAA 
, ACGCTAT AGTAATTTCTAACAGTTT CAAATATTTTGATAACTTGAAAAAA 
' GAACCTATGTTTATTTCTAAAAAAGAAAAAGTAGTAAACAAGAATCTAGA 
AGAaATAATATTAGTTAAGCCGCAcACTA(^AGTTACTACTCAaTCATTGT 
CTAAAGAAATAACTAAATCAGGAAATGAGAAAGTCCTCACTTCTACAAAC 
AATAATAGTAGTAGAGTAGCTAAAATCATATCACCTAAACATAATGGGGA 
TT CTGTT AACCATACC 

SEQ XD NO. 4411 
STRAIN JH9130Q13 

GAGGAGCAAGAATTAAAAAACCAAGAG CAATCACCTGTAA 

TTGCTAATGTTGCTCAACAGCCATCGCCATCGGTAACTACTAATACTGTT 

GAAAAAACATCTGTAACAGCTGCTTCTGCTAGTAATACAGCGAAAGAAAT 

GGGTGATACATCTGTAAAAAATGACAAAACAGAAGATGAATTATTAGAAG 

AGTTATCTAAAAACCrrTGATACGTCTAATTTGGGGGCrrGATCTTGAAGAA 

GAATATCCCTCTAAACCAGAGACAACCAACAATAAAGAAAGCAATGTAGT 

AACAAATGCTTCAACTG CAAT AG C ACAGAAAGTT CC CT CAG CAT ATG AAG 

AGGTGAAGCCAGAAAGCAAGTCATCGCTTGCTGTTCTTGATACATCTAAA 

ATAACAAAATTACAAGCCATAACCCAAAGAGGAAAGGGAAATGTAGTAGC 

TATTATTGATACTGGCTTTGATATTAACCATGATATTTTTCGTTTAGATA 

G C CCAAAAG ATG AT AAG CACAG CT TTAAAACT AAGACAG AATTTG AGGAA 

TTAAAAGCAAAACATAATATCACTTATGGGAAATGGGTTAACGATAAGAT 

TGTTTTT G CACATAACT ACG C CAACAAT ACAG AAACGGTGG CTG ATATTG 

CAGCAGCTATGAAAGATGGTTATGGTTCAGAAGCAAAGAATATTTCGCAT 
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GGTACACACGTTGCTC3GTATTTTTGTAGGTAATAGTAAACGTCCAGCAAT 

CAATGGT CTT CTTTTAG AAGGTG CAG CG CCAAATG CT CAAGT CTT ATTAA 

TG CGTATT C CAG ATAAAATTGATTCGG ACAAATTTGGTGAAG CAT ATGCT 

AAAG CAAT CACAGACGCTGTTAATCTAGG AG CAPAAACGATTAATATGAG 

TATTGGAAAAACAG CTG ATT CTTTAATTG CT CT CAATGATAAAGTTAAAT 

TAGC7ACTTAAATTAGCTTCTGAGAAGGGCGTTGCAGTTGTTC 

GG AAATG AAGG CGCATTTG GTATGG ATTAT AG CAAACCATTATCAACTAA 

TCCTGACTACGGTACGGTTAATAGTCCAGCTATTTCTGAAGATAirTTTGA 

GTGTTGCTAGCTATGAATCACTTAAAACTATCAGTGAGGTCGTTGAAACA 

ACTATTGAAGGTAAGTTAGTTAAGTTGCCGATTGTGACrTCTAAACCTTT 

TGACAAAgGTAAgGCCTACGATGTGGTTTATGCCAATTATGGTGCAAAAA 

AAGACITTGAAGGT AAGGACTTTAAAGGT AAG ATTG CATT AATTG AG CGT 

GGTGGTGGACTTGATTTTATGACTAAAATCACTCATG CTACAAATGCAGG 

TGTTGTTGGTATCGTTATTTTTAACGA.TCAAGAAAAACGTGGAAATTTTC 

TAATTCCTTACCGTGAATTACCTGTGGGGATTATTAGTAA^GTAGATGGC 

GAG CGTATAAAAA^TACITCAAGTCAGTTAACATTTAAC CAGAGTTTTG A 

AGTAGTTGAT AG C CAAGGTGGTAAT CGTATGCTG G AACAATCAAGTTGGG 

GCGTGACAGCTGAAGGAGCAATCAAGCCTGATGTAACAGCTTCTG^ 

GAAATTTATTCTTCAACCTATAATAATCAATACCAAACAATGTCTGGTAC 

AAGTATGGCTTCACCACATGTTGCAGGATTAATGA(2AATGCT 

ATTTGGCTGAGAAATATAAAGGGaTGAATTTAGATTCTAAAAAATTGCTA 

GAATTGTCTAAAAACATC CT CATGAG CTCAGCAACAG CATTATAT AGTGA 

AGAGGATAAGGCGTTTTATTCACCACGTCAGCAAGGTGCAGGTGTAGTTG 

ATG CTGAAAAAG CTATC CAAGCT C aATATTAT ATT ACTGGAAACGAT GG C 

AAAGCTAAAATTAATOTCAAACGAATGGGAGATAAATTTGATATCACAGT 

TACAATT CAT aAACTTGTAGAAGGTGTCAAAG AA t TGTATT ATCAAG CTA 

ATGTAGCAACAGAACAAGTAAATAAAGGTAAATTTGCCCTTaAACCACAA 

GCCTTGCTAGATACTAA.TTGGCAGAAAGTAATTCTTCGTGATAAAGAAAC 

ACAAGTTCGATTTACrATTGATGCTAGTCAATTTAGTCAGAAATTAAAAG 

AACAGATGG CAAATGGTTATTT CTTAG AAGGTTTTGTACGTTTTAAAGAA 

GC CAAGGATAGT AAT (^GG AGTTAATG AGT ATTCCTTTTGTAGG ATTTAA 

TGGTGATTTTGCGAACnTACAAGCACTTGAAACACt^ATTTATAAGACGC 

TTTCTAAAGGTAGTTTCTACTATAAACCAAATGATACAACTCATAAAGAC 

CAATTGGAGTACAATGAAT CAG CT C CTTTTGAAAG CAACAACTAT ACTG C 

CTTGTTAACACAAT CAG CGT CTTGG GGCTATGTTG ATTATGT CAAAAATG 

GTGGGGAGTTAGAATTAGCACCGGAGAGTCCAAAAAGAATTATTTTAGGA 

ACTTTTGAGAATAAGGTTGAGGATAAAACAATTCATCriT^ 

TGCAG CG AATAATC CAT ATTTTG C CATTT CTCCAAATAAAGATG GAAATA 

GGGACGAAATCACTCCCCAGGCAA.CTTTCTTAAGAAATGTTAAGGATATT 

TCTGCTCAAGTTCTAGATCAAAATGGAAATGTTATTTGGCAAAGTAAGGT 

TTTAC CATCTTATCGTAAAAATTTCCAT AATAAT C CAAAG CAAAGTGATG 

GT CATT ATCGTATGGATG CT CTTCAGTGGAGTGGTTTAGAT AAGG ATGG C 

AAAGTTGTAGCAGATGGTTTTTATACTTATCGCTTACGTTACACACCAGT 

AGCAGAAGGAGCAAATAGTCAGGAGTCAGACTTTAAAGTACAAGTAAGTA 

CTAAGTCACCAA^TCTTCCTTCACGAGCTCAGTTTGATGAAACTAATCGA 

ACATTAAGCTT AGC CATG CCTAAGG AAAGTAGTT ATGTT C CTACAT ATCG 

TTTACAATTAGTTTTATCTO\TGTTGTAAAAGATGAAGAATATGGGGATG 

AG ACTT CTTAC CATT ATTT C CATAT AG ATCAAGAAGGTAAAGTGACACTT 

CCTAAAACGGTTAAGATAGGAGAG AGTGAG GTTG CGG TAG ACCCT AAGG C 

CTTGACACTTGTTGTGGAAGATAAAGCTGGTAATTTCGCAaCGGTAAAAT 

TGTCTGATCTCTTGAATAAGGCAGTAGTATCAGAGAAAGAAAACGCTATA 

GTAATTT CT aACAGTTTCAAAT ATTTTGATAACTTGAZUVAAAGAAC CTAT 

GTTTATTTCTAAAAAAGAAAAAGTAGTAAACAAGAATCTAGAAGAAATAA 

TATTAGTTAAGCCG CAAACTACAGTTACTACTCAATCATTGTCTAAAGAA 

ATAACTAAAT CAGG AAA1X3AGAAAGTCCT CACTT CTACAAACAAT AATAG 

TAG CAGAGTAG CTAAGATCAT ATCACCTAAACAT AACGGGGATT CTGTT A 

ACCATACC 

PRETTY of : /biotmp/msal83564 . 2 { * } May 13, 2003 03:28 

1 50 

msal83564 .2fl47_C0Hl} 

msal83564 .2(147_M732} 

msal83564 . 2 { 147__M78 1 } 

msal83564 . 2{l47_2603 } gtggataaac atcactcaaa aaaggctatt ttaaagttaa cacttataac 

msal83564 .2{147_JM9130013 } 

msal83564.2{l47_18RS21 

msal83564 .2{147_090 

msal83564 .2 { 147_CJB110} 

msal83564 . 2{147_A9Q9} 

msa!83564 .2{147_H36B} 

msal83564 .2 {147_1169NT} T 

Consensus ********** ********** ********** ********** ********** 

51 100 

msal83564 . 2 { 147_C0H1 GAGGAGCAAG 

msal83564 .2{ 147_M732 GAGGAGCAAG 

msal83564 .2(l47_M78l} GAGGAGCAAG 

msal83564 .2{147_2603} aactagtatt ttattaatgc atagcaatca agtgaatgca GAGGAGCAAG 

msal83564 .2(14 7_JM9130 013 } • GAGGAGCAAG 

msal83564.2{l47_18RS2l} GAGGAGCAAG 

msal83564.2{l47_090j GAGGAGCAAG 

msal83564.2{l47_OJB110} ~ GAGGAGCAAG 
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msal83564.2fl47_A909} GAGGAGCAAG 

msal83564 .2{147_H36B} GAGGAGCAAG 

msal835S4 .2 {147_1169NT} GAGGAGCAAG 

Consensus ********** ********** ********** ********** ********** 



msal83564 . 

msal83564 . 

msal83564 . 

msal83564. 
msal83564.2{l47 
msal83564 .2{ 
msa!83564 
msal83564 .2{ 

msal83564 . 

tnsal83564 . 
msal83564 .2{ 



2 {147 COHl} 
2{147^M732} 
2(147 M781) 
2 {147^2603} 
_JM9130013} 
147_18RS2l} 
.2{147_090 > 
147_CJB110 
2{147_A909 
2{147 H36B 
147_1169NT, 
Consensus 



101 

AATTAAAAAA 
AATTAAAAAA 
AATTAAAAAA 
AATTAAAAAA 
AATTAAAAAA 
AATTAAAAAA 
AATTAAAAAA 
AATTAAAAAA 
AATTAAAAAA 
AATTAAAAAA 
AATTAAAAAA 



CCAAGAGCAA 
CCAAGAGCAA 
CCAAGAGCAA 
CCAAGAGCAA 
CCAAGAGCAA 
CCAAGAGCAA 
CCAAGAGCAA 
CCAAGAGCAA 
CCAAGAGCAA 
CCAAGAGCAA 
CCAAGAGCAA 



********** ********** 



TCACCTGTAA 

TCACCTGTAA 

TCACCTGTAA 

TCACCTGTAA 

TCACCTGTAA 

TCACCTGTAA 

TCACCTGTAA 

TCACCTGTAA 

TCACCTGTAA 

TCACCTGTAA 

TCACCTGTAA 
********** 



TTGCTAATGT 
TTGCTAATGT 
TTGCTAATGT 
TTGCTAATGT 
TTGCTAATGT 
TTGCTAATGT 
TTGCTAATGT 
TTGCTAATGT 
TTGCTAATGT 
TTGCTAATGT 
TTGCTAATGT 



150 

TGCTCAACAG 
TGCTCAACAG 
TGCTCAACAG 
TGCTCAACAG 
TGCTCAACAG 
TGCTCAACAG 
TGCTCAACAG 
TGCTCAACAG 
TGCTCAACAG 
TGCTCAACAG 
TGCTCAACAG 



********** ********** 



msal83564 . 

msal83564 . 

msal83564 . 

msal83564 . 
msal83564 .2 {147 
msal83564 .2{ 
msal83564 
msal83564 .2{ 

msal83564 . 

msal83564 . 
msal83564 .2{ 



2{l47_COHl} 
2{147_M732} 
2{l47_M78l} 
2{147_2603} 
_JM9130013} 
147_18RS2l} 
.2{147_090 : 
147_CJB110 
2{147_A909 
2{147_H36B 
147_1169NT; 
Consensus 



151 

CCATCGCCAT 
CCATCGCCAT 
CCATCGCCAT 
CCATCGCCAT 
CCATCGCCAT 
CCATCGCCAT 
CCATCGCCAT 
CCATCGCCAT 
CCATCGCCAT 
CCATCGCCAT 
CCATCGCCAT 



CGGTAACTAC 
CGGTAACTAC 
CGGTAACTAC 
CGGTAACTAC 
CGGTAACTAC 
CGGTAACTAC 
CGGTAACTAC 
CGGTAACTAC 
CGGTAACTAC 
CGGTAACTAC 
CGGTAACTAC 



TAATAtTGTT 
TAATAtTGTT 
TAATAtTGTT 
TAATAcTGTT 
TAATAcTGTT 
TAATACTGTT 
TAATAtTGTT 
TAATAtTGTT 
TAATAcTGTT 
TAATAcTGTT 
TAATAtTGTT 



GAAAAAACAT 
GAAAAAACAT 
GAAAAAACAT 
GAAAAAACAT 
GAAAAAACAT 
GAAAAAACAT 
GAAAAAACAT 
GAAAAAACAT 
GAAAAAACAT 
GAAAAAACAT 
GAAAAAACAT 



200 

CTGTAaCAgC 
CTGTAaCAgC 
CTGTAaCAgC 
CTGTAaCAgC 
CTGTAaCAgC 
CTGTAaCAgC 
CTGTAaCAgC 
CTGTAnCAgC 
CTGTAaCAtC 
CTGTAaCAtC 
CTGTAaCAgC 



********** ********** *****_**** ********** *****_** w * 



msal83564. 

msal83564 . 

msal83564 . 

msal83564 . 
n\sal83564 .2(147 
msal83564 .2{ 
msal83564 
msa!83564 .2{ 

msal83564 . 

msal83564 . 
msal83564 .2{ 



2 {147 COHl} 
2 {1473*732} 
2{147__M781> 
2 { 147^2603} 
_JM9130013} 
147_18RS2l} 
.2{147_090} 
147_CJB110} 
2{147_A909} 
2{147_H36B} 
147_1169NT} 
Consensus 



201 

TGCTTCTGCT 
TGCTTCTGCT 
TGCTTCTGCT 
TGCTTCTGCT 
TGCTTCTGCT 
TGCTTCTGCT 
TGCTTCTGCT 
TGCTTCTGCT 
TGCTTCTGCT 
TGCTTCTGCT 
TGCTTCTGCT 



AGTAATACAG 
AGTAATACAG 
AGTAATACAG 
AGTAATACAG 
AGTAATACAG 
AGTAATACAG 
AGTAATACAG 
AGTAATACAG 
AGTAATACAG 
AGTAATACAG 
AGTAATACAG 



tGAAAGAAAT 
tGAAAGAAAT 
tGAAAGAAAT 
CGAAAGAAAT 
CGAAAGAAAT 
CGAAAGAAAT 
tGAAAGAAAT 
CGAAAGAAAT 
CGAAAGAAAT 
CGAAAGAAAT 
CGAAAGAAAT 



GGGTGATACA 
GGGTGATACA 
GGGTGATACA 
GGGTGATACA 
GGGTGATACA 
GGGTGATACA 
GGGTGATACA 
GGGTGATACA 
GGGTGATACA 
GGGTGATACA 
GGGTGATACA 



250 

TCTGTAAAAA 
TCTGTAAAAA 
TCTGTAAAAA 
TCTGTAAAAA 
TCTGTAAAAA 
TCTGTAAAAA 
TCTGTAAAAA 
TCTGTAAAAA 
TCTGTAAAAA 
TCTGTAAAAA 
TCTGTAAAAA 



********** ********** _********* ********** ********** 



msal83564 .2{l47_COHl} 
msa!83564 . 2 { 14 7_M732 } 
msal83564 .2{147_M781 
msal83564 . 2{ 147_2603 
msal83564.2{l47_JM9130013 : 
msal83564.2{l47_18RS21; 

msal83564 .2{l47_090 
msal83564 .2{147_CJB110 
msal83564 . 2 { 147_A909 ; 
msal83564 . 2 { 147_H36B] 
msal83564 .2{147_1169NT] 

Consensus 



251 

ATGACAAAAC 

ATGACAAAAC 

ATGACAAAAC 

ATGACAAAAC 

ATGACAAAAC 

ATGACAAAAC 

ATGACAAAAC 

ATGACAAAAC 

ATGACAAAAC 

ATGACAAAAC 

ATGACAAAAC 
********** 



AGAAGATGAA 
AGAAGATGAA 
AGAAGATGAA 
AGAAGATGAA 
AGAAGATGAA 
AGAAGATGAA 
AGAAGATGAA 
AGAAGATGAA 
AGAAGATGAA 
AGAAGATGAA 
AGAAGATGAA 



TTATTAGAAG 
TTATTAGAAG 
TTATTAGAAG 
TTATTAGAAG 
TTATTAGAAG 
TTATTAGAAG 
TTATTAGAAG 
TTATTAGAAG 
TTATTAGAAG 
TTATTAGAAG 
TTATTAGAAG 



AGTTATCTAA 
AGTTATCTAA 
AGTTATCTAA 
AGTTATCTAA 
AGTTATCTAA 
AGTTATCTAA 
AGTTATCTAA 
AGTTATCTAA 
AGTTATCTAA 
AGTTATCTAA 
AGTTATCTAA 



300 

AAACCTTGAT 
AAACCTTGAT 
AAACCTTGAT 
AAACCTTGAT 
AAACCTTGAT 
AAACCTTGAT 
AAACCTTGAT 
AAACCTTGAT 
AAACCTTGAT 
AAACCTTGAT 
AAACCTTGAT 



********** ********** ********** ********** 



msal83564 . 

msal83564 . 

msal83564 . 

msal83564 . 
msal83564.2{l47 
msal83564 .2{ 
msal83564 
msal83564 .2{ 

msal83564. 

msal83564 . 
msal83564 .2{ 



2/l4 7_COHl) 
2{147_M732) 
2{l47_M78lj 
2{l47J2603 \ 
__JM9130013 
147_18RS21; 
.2{l47_090 
147 CJB110 
2{147_A909) 
2{147_H36B} 
147JL169NT} 
Consensus 



301 

ACGTCTAATt 

ACGTCTAATt 

ACGTCTAATt 

ACGTCTAATt 

ACGTCTAATt 

ACGTCTAATt 

ACGTCTAATt 

ACGTCTAATw 

ACGTCTAATt 

ACGTCTAATt 

ACGTCTAATa 
* ********_ 



TGGGGGCTGA 

TGGGGGCTGA 

TGGGGGCTGA 

TGGGGGCTGA 

TGGGGGCTGA 

TGGGGGCTGA 

TGGGGGCTGA 

TGGGGGCTGA 

TGGGGGCTGA 

TGGGGGCTGA 

TGGGGGCTGA 
********** 



TCTTGAAGAA 

TCTTGAAGAA 

TCTTGAAGAA 

TCTTGAAGAA 

TCTTGAAGAA 

TCTTGAAGAA 

TCTTGAAGAA 

TCTTGAAGAA 

TCTTGAAGAA 

TCTTGAAGAA 

TCTTGAAGAA 
********** 



GAATATCCCT 
GAATATCCCT 
GAATATCCCT 
GAATATCCCT 
GAATATCCCT 
GAATATCCCT 
GAATATCCCT 
GAATATCCCT 
GAATATCCCT 
GAATATCCCT 
GAATATCCCT 



350 

CTAAACCAGA 
CTAAACCAGA 
CTAAACCAGA 
CTAAACCAGA 
CTAAACCAGA 
CTAAACCAGA 
CTAAACCAGA 
CTAAACCAGA 
CTAAACCAGA 
CTAAACCAGA 
CTAAACCAGA 



********** ********** 



msal83.564 .2{l47_COHl} 
msal83564 .2(147_M732} 
msal83564 .2{l47_M78lj 
msal83564.2{l47_2603} 
msal83564.2{l47_JM9130013} 
msal83564 .2{147 18RS21} 
msal83564 .2{l47__090> 



351 

GACAACCAAC 
GACAACCAAC 
GACAACCAAC 
GACAACCAAC 
GACAACCAAC 
GACAACCAAC 
GACAACCAAC 



AATAAaGAAA 
AATAAaGAAA 
AATAAaGAAA 
AATAAaGAAA 
AATAAaGAAA 
AATAAaGAAA 
AATAAaGAAA 



GCAATGTAGT 
GCAATGTAGT 
GCAATGTAGT 
GCAATGTAGT 
GCAATGTAGT 
GCAATGTAGT 
GCAATGTAGT 



AACAAATGCT 
AACAAATGCT 
AACAAATGCT 
AACAAATGCT 
AACAAATGCT 
AACAAATGCT 
AACAAATGCT 



400 

TCAACTGCAA 
TCAACTGCAA 
TCAACTGCAA 
TCAACTGCAA 
TCAACTGCAA 
TCAACTGCAA 
TCAACTGCAA 



i 



773 



WO 2004/018646 



PCT/US2003/026827 



Table 44: Comparative Sequences relating to SAG0416 (strain info highlighted in BOLD) 



msal83564 . 2 { 147_CJB110 } 
msal83564 .2{l47_A909} 
msal83564.2{l47^H3SB} 

msal83564 . 2 { 147_1169NT} 

Consensus 



GACAACCAAC 
GACAACCAAC 
GACAACCAAC 
GACAACCAAC 



AATAAaGAAA 
AATAAaGAAA 
AATAAaGAAA 
AATAAgGAAA 



GCAATGTAGT 
GCAATGTAGT 
GCAATGTAGT 
GCAATGTAGT 



AACAAATGCT 
AACAAATGCT 
AACAAATGCT 
AACAAATGCT 



TCAACTGCAA 
TCAACTGCAA 
TCAACTGCAA 
TCAACTGCAA 



********** *****_**** ********** ********** ********** 



msal83564 . 

msal83564 . 

msal83564 . 

msal83564 . 
msa!83564.2{l47 
msal83564 .2{ 
msa!83564 
msal83564 .2{ 

msal83564 . 

msal83564 . 
msa!83564 .2{ 



2{l47_COHl} 
2 1 147_M732> 
2{l47_M781) 
2{147_2603} 
_JM9130013~ 
147_18RS21 
,2{147_090 
147_CJB110> 
2{l47_A909 
2{l47_H36B 
147_1169NT 
Consensus 



401 

TAGCACAGAA 

TAGCACAGAA 

TAGCACAGAA 

TAGCACAGAA 

TAGCACAGAA 

TAGCACAGAA 

TAGCACAGAA 

TAGCACAGAA 

TAGCACAGAA 

TAGCACAGAA 

TAGCACAGAA 
********** 



AGTTCCCTCA 
AGTTCCCTCA 
AGTTCCCTCA 
AGTTCCCTCA 
AGTTCCCTCA 
AGTTCCCTCA 
AGTTCCCTCA 
AGTTCCCTCA 
AGTTCCCTCA 
AGTTCCCTCA 
AGTTCCCTCA 



GCaTATGAAG 
GCaTATGAAG 
GCaTATGAAG 
GCaTATGAAG 
GCaTATGAAG 
GCaTATGAAG 
GCgTATGAAG 
GCgTATGAAG 
GCaTATGAAG 
GCaTATGAAG 
GCaTATGAAG 



AGGTGAAGtC 
AGGTGAAGtC 
AGGTGAAGtC 
AGGTGAAGc C 
AGGTGAAGcC 
AGGTGAAGc C 
AGGTGAAGcC 
AGGTGAAGcC 
AGGTGAAGcC 
AGGTGAAGcC 
AGGTGAAGcC 



450 

AgAAAGCAAG 
AgAAAGCAAG 
AgAAAGCAAG 
AgAAAGCAAG 
AgAAAGCAAG 
AgAAAGCAAG 
AgAAAGCAAG 
AgAAAGCAAG 
AgAAAGCAAG 
AgAAAGCAAG 
AaAAAGCAAG 



********** **_******* ********_* *_******** 



msal83564 . 

msal83564 . 

msal83564 . 

msal83564 . 
msal83S64 .2(147 
msal83564 .2 { 
msal83564 
msal83564 .2{ 

msal83564 . 

msal83564 . 
msal83564 -2{ 



2{l47_COHl} 
2{147_M732} 
2{l47_M78l} 
2{147_2603} 
_JM9 130013} 
147_18RS21} 
.2{147_090} 
147_CJB110} 
2{147_A909} 
2{147_H36B] 
147„1169NT} 
Consensus 



451 

TCATCgCTTG 

TCATCgCTTG 

TCATCgCTTG 

TCATCgCTTG 

TCATCgCTTG 

TCATCgCTTG 

TCATCgCTTG 

TCATCgCTTG 

TCATCaCTTG 

TCATCaCTTG 

TCATCgCTTG 
*★***_ **** 



CTGTT cTTGA 

CTGTT cTTGA 

CTGTT cTTGA 

CTGTT cTTGA 

CTGTT cTTG A 

CTGTT cTTGA 

CTGTTtTTGA 

CTGTT t TTGA 

CTGTTcTTGA 

CTGTTcTTGA 

CTGTTcTTGA 
*****_**** 



TACATCTAAA 
TACATCTAAA 
TACATCTAAA 
TACATCTAAA 
TACATCTAAA 
TACATCTAAA 
TACATCTAAA 
TACATCTAAA 
TACATCTAAA 
TACATCTAAA 
TACATCTAAA 



ATAACAAAAT 
ATAACAAAAT 
ATAACAAAAT 
ATAACAAAAT 
ATAACAAAAT 
ATAACAAAAT 
ATAACAAAAT 
ATAACAAAAT 
ATAACAAAAT 
ATAACAAAAT 
ATAACAAAAT 



500 

TaCAAGCCAc 
TaCAAGCCAc 
TaCAAGCCAc 
TaCAAGCCAt 
TaCAAGCCAt 
TaCAAGCCAt 
TgCAAGCCAt 
TgCAAGCCAt 
TgCAAGCCAt 
TgCAAGCCAt 
TgCAAGCCAt 



********** ********** *_*******_ 



msal83564 . 

msal83564 ♦ 

msal83564 . 

msal83564 . 
msal83564.2{l47 
msal83564 -2{ 
msal83564 
msa!83564 .2{ 

msal83564 . 

msal83564 - 
msal83564 .2{ 



2{147_C0H1} 
2{147_M732} 
2{147_M781} 
2{147_2603} 
_JM9130013} 
147_18RS2l} 
.2 { 147_090} 
147__CJB110" 
2{147_A909 ! 
2{147_H36B] 
147_1169NT} 
Consensus 



501 

AACCCAAAGA 

AACCCAAAGA 

AACCCAAAGA 

AACCCAAAGA 

AACCCAAAGA 

AACCCAAAGA 

AACCCAAAGA 

AACCCAAAGA 

AACCCAAAGA 

AACCCAAAGA 

AACCCAAAGA 
********** 



GGAAAGGGAA 
GGAAAGGGAA 
GGAAAGGGAA 
GGAAAGGGAA 
GGAAAGGGAA 
GGAAAGGGAA 
GGAAAGGGAA 
GGAAAGGGAA 
GGAAAGGGAA 
GGAAAGGGAA 
GGAAAGGGAA 



ATGTAGTAGC 
ATGTAGTAGC 
ATGTAGTAGC 
ATGTAGTAGC 
ATGTAGTAGC 
ATGTAGTAGC 
ATGTAGTAGC 
ATGTAGTAGC 
ATGTAGTAGC 
ATGTAGTAGC 
ATGTAGTAGC 



********** ********** 



TATTATTGAT 

TATTATTGAT 

TATTATTGAT 

TATTATTGAT 

TATTATTGAT 

TATTATTGAT 

TATTATTGAT 

TATTATTGAT 

TATTATTGAT 

TATTATTGAT 

TATTATTGAT 
********** 



550 

ACTGGCTTTG 

ACTGGCTTTG 

ACTGGCTTTG 

ACTGGCTTTG 

ACTGGCTTTG 

ACTGGCTTTG 

ACTGGCTTTG 

ACTGGCTTTG 

ACTGGCTTTG 

ACTGGCTTTG 

ACTGGCTTTG 
********** 



msal83564 . 

msal83564 . 

msa!83564 . 

msal83564 . 
msal83564.2{l47 
msal83564 .2{ 
msal83564 
msal83564 .2{ 

msal83564 . 

msal83564 . 
msa!83564 .2 { 



2{147_C0H1} 
2f 147_M732j 
2{147J4781} 
2{147 2603} 
_JM9 130013} 
147_18RS21} 
.2{147_090} 
147_CJB110} 
2{147_A909} 
2{147_H36B> 
147_1169NT) 
Consensus 



551 

ATATTAACCA 

ATATTAACCA 

ATATTAACCA 

ATATTAACCA 

ATATTAACCA 

ATATTAACCA 

ATATTAACCA 

ATATTAACCA 

ATATTAACCA 

ATATTAACCA 

ATATTAACCA 
* ******* * * 



TGATATTTTT 
TGATATTTTT 
TGATATTTTT 
TGATATTTTT 
TGATATTTTT 
TGATATTTTT 
TGATATTTTT 
TGATATTTTT 
TGATATTTTT 
TGATATTTTT 
TGATATTTTT 



CGTTTAGATA 
CGTTTAGATA 
CGTTTAGATA 
CGTTTAGATA 
CGTTTAGATA 
CGTTTAGATA 
CGTTTAGATA 
CGTTTAGATA 
CGTTTAGATA 
CGTTTAGATA 
CGTTTAGATA 



GCCCAAAAGA 
GCCCAAAAGA 
GCCCAAAAGA 
GCCCAAAAGA 
GCCCAAAAGA 
GCCCAAAAGA 
GCCCAAAAGA 
GCCCAAAAGA 
GCCCAAAAGA 
GCCCAAAAGA 
GCCCAAAAGA 



600 

TGATAAGCAC 
TGATAAGCAC 
TGATAAGCAC 
TGATAAGCAC 
TGATAAGCAC 
TGATAAGCAC 
TGATAAGCAC 
TGATAAGCAC 
TGATAAGCAC 
TGATAAGCAC 
TGATAAGCAC 



********** ********** ********** ********** 



msal83564 . 

msal83564 . 

msal83564 . 

msa!83564 . 
msal83564 .2{147 
msal83564.2{ 
msal83 564 
msal83564 .2{ 

msal83564 . 

msal83564 . 
msal83564 .2{ 



2{147_C0H1} 
2{147_M732} 
2 147_M781) 
2{147_2603} 
_JM9130013} 
147_18RS21} 
,2{147 > _090} 
147_CJB110} 
2{147_A909} 
2{147_H36B} 
147_1169NT} 
Consensus 



601 

AGCTTTAAAA 
AGCTTTAAAA 
AGCTTTAAAA 
AGCTTTAAAA 
AGCTTTAAAA 
AGCTTTAAAA 
AGCTTTAAAA 
AGCTTTAAAA 
AGCTTTAAAA 
AGCTTTAAAA 
AGCTTTAAAA 



cTAAggCAGA 
cTAAggCAGA 
cTAAggCAGA 
cTAAg aCAGA 
cTAAgaCAGA 
cTAAgaCAGA 
cTAAagCAGA 
cTAAagCAGA 
cTAAggCAGA 
cTAAggCAGA 
aTAAggCAGA 



ATTtGAGGAA 
ATT t GAGGAA 
ATTtGAGGAA 
ATTtGAGGAA 
ATTtGAGGAA 
ATTtGAGGAA 
ATTcGAGGAA 
ATTcGAGGAA 
ATTtGAGGAA 
ATTtGAGGAA 
ATTcGAGGAA 



TTAAAAGCAA 
TTAAAAGCAA 
TTAAAAGCAA 
TTAAAAGCAA 
TTAAAAGCAA 
TTAAAAGCAA 
TTAAAAGCAA 
TTAAAAGCAA 
TTAAAAGCAA 
TTAAAAGCAA 
TTAAAAGCAA 



650 

AACATAATAT 
AACATAATAT 
AACATAATAT 
AACATAATAT 
AACATAATAT 
AACATAATAT 
AACATAATAT 
AACATAATAT 
AACATAATAT 
AACATAATAT 
AACATAATAT 



********** _*** **** ***„****** ********** ********** 



msa!83564 .2(l47_COHl 
msal83564 . 2 { 147_M732 
msal83564.2{147_M781 < 

msal83564 ,2{147_2603 
msal83564 . 2 { 147_JM9130013 
■ msa!83564.2{l47_18RS21 



651 

CACTTATGGG 
CACTTATGGG 
CACTTATGGG 
CACTTATGGG 
CACTTATGGG 
CACTTATGGG 



AAATGGGTTA 
AAATGGGTTA 
AAATGGGTTA 
AAATGGGTTA 
AAATGGGTTA 
AAATGGGTTA 



ACGATAAGAT 
ACGATAAGAT 
ACGATAAGAT 
ACGATAAGAT 
ACGATAAGAT 
ACGATAAGAT 



TGTTTTTG CA 
TGTTTTTGCA 
TGTTTTTG CA 
TGTTTTTGCA 
TGTTTTTGCA 
TGTTTTTGCA 



700 

CATAACTACG 
CATAACTACG 
CATAACTACG 
CATAACTACG 
CATAACTACG 
CATAACTACG 



774 



WO 2004/018646 



PCT/US2003/026827 



Table 44: Comparative Sequences relating to SAG0416 (strain info highlighted in BOLD) 



msa!83564 .2{l47_090} 
msal83564 . 2 { 147_CJB110 } 
tnsal83564 .2(147 A909} 
msal83564 .2{l47~H36B} 
msal83564 - 2 { 147_1169NT} 

Consensus 



CACTTATGGG 
CACTTATGGG 
CACTTATGGG 
CACTTATGGG 
CACTTATGGG 



AAATGGGTTA 
AAATGGGTTA 
AAATGGGTTA 
AAATGGGTTA 
AAATGGGTTA 



********** ********** 



ACGATAAGAT 

ACGATAAGAT 

ACGATAAGAT 

ACGATAAGAT 

ACGATAAGAT 
********** 



TGTTTTTGCA 

TGTTTTTGCA 

TGTTTTTGCA 

TGTTTTTGCA 

TGTTTTTGCA 
********** 



CATAACTACG 

CATAACTACG 

CATAACTACG 

CATAACTACG 

CATAACTACG 
********** 



msal835S4 . 

msal83564 . 

msal83564 . 

msal83564 . 
msal83564.2{l47 
msal83564 .2{ 
msal83564 
msal83564 .2 { 

msal83564 . 

msal83564 . 
msal83564 .2{ 



2{l47_COHl/ 
2{147_M732; 
2{l47_M78l' 
2{l47_2603; 
_JM9 130013 
147 > _18RS21 
,2{l47_090 
147_CJB110 
2{147_A909 
2{l47_H36B 
147_1169NT> 
Consensus 



msa!83564 
msal83564 
msal83564 
msa!83564 
msal83564 .2{l47 

msal83564 .2 { 
msal83564 

msal83564 ,2{ 
msal83564 . 
msal83564 . 

msal83564 .2{ 



2 {147 COHl} 
2{l47~M732} 
2{147 M781} 
2{14722603) 
__JM9130013} 
147_18RS2l} 
,2{147_090} 
147_CJB110} 
2{l47_A909} 
2{l47_H36B} 
147JL169NT} 
Consensus 



701 

CCAACAATAC 
C CAACAATAC 
CCAACAATAC 
CCAACAATAC 
CCAACAATAC 
CCAACAATAC 
CCAACAATAC 
CCAACAATAC 
CCAACAATAC 
CCAACAATAC 
CCAACAATAC 



AGAAACGGTG 
AGAAACGGTG 
AGAAACGGTG 
AGAAACGGTG 
AGAAACGGTG 
AGAAACGGTG 
AGAAACGGTG 
AGAAACGGTG 
AGAAACGGTG 
AGAAACGGTG 
AGAAACGGTG 



GCTGATATTG 
GCTGATATTG 
GCTGATATTG 
GCTGATATTG 
GCTGATATTG 
GCTGATATTG 
GCTGATATTG 
GCTGATATTG 
GCTGATATTG 
GCTGATATTG 
GCTGATATTG 



CAGCAGCTAT 
CAGCAGCTAT 
CAGCAGCTAT 
CAGCAGCTAT 
CAGCAGCTAT 
CAGCAGCTAT 
CAGCAGCTAT 
CAGCAGCTAT 
CAGCAGCTAT 
CAGCAGCTAT 
CAGCAGCTAT 



750 

GAAAGATGGT 
GAAAGATGGT 
GAAAGATGGT 
GAAAGATGGT 
GAAAGATGGT 
GAAAGATGGT 
GAAAGATGGT 
GAAAGATGGT 
GAAAGATGGT 
GAAAGATGGT 
GAAAGATGGT 



********** ********** ********** ********** ********** 



751 

TATGGgTCAG 

TATGGgTCAG 

TATGGgTCAG 

TATGGtTCAG 

TATGGtTCAG 

TATGGtTCAG 

TATGGgTCAG 

TATGGgTCAG 

TATGGgTCAG 

TATGGgTCAG 

TATGGtTCAG 
*****_**** 



AAGCAAAGAA 
AAGCAAAGAA 
AAGCAAAGAA 
AAGCAAAGAA 
AAGCAAAGAA 
AAGCAAAGAA 
AAGCAAAGAA 
AAGCAAAGAA 
AAGCAAAGAA 
AAGCAAAGAA 
AAGCAAAGAA 
********** 



TATTT tGCAT 
TATTT t GCAT 
TATTT tGCAT 
TATTT CGCAT 
TATTT cGCAT 
TATTT CGCAT 
TATTT cGCAT 
TATTTcGCAT 
TATTT cGCAT 
TATTTcGCAT 
TATTTcGCAT 
*****_**** 



GGTACACACG 

GGTACACACG 

GGTACACACG 

GGTACACACG 

GGTACACACG 

GGTACACACG 

GGTACACACG 

GGTACACACG 

GGTACACACG 

GGTACACACG 

GGTACACACG 
********** 



800 

TTGCTGGTAT 

TTGCTGGTAT 

TTGCTGGTAT 

TTGCTGGTAT 

TTGCTGGTAT 

TTGCTGGTAT 

TTGCTGGTAT 

TTGCTGGTAT 

TTGCTGGTAT , 

TTGCTGGTAT 

TTGCTGGTAT 
********** 



msal83564 . 

msal83564 . 

msal83564 . 

msal83564 . 
msal83564.2{l47 
msal83564.2{ 
msal83564 
tnsal83564 .2{ 

msal83564 . 

msal83564 . 
msa!83564 .2{ 



2{147_C0H1; 
2 {147 M732 
2(l47^M78i; 
2{147 2603 
_JM9130013] 
147_18RS2i; 
.2{147_090 
147__CJB110 
2{147_A909; 
2{147_H36B; 
147_1169NT} 
Consensus 



801 

TTTTGTAGGT 

TTTTGTAGGT 

TTTTGTAGGT 

TTTTGTAGGT 

TTTTGTAGGT 

TTTTGTAGGT 

TTTTGTAGGT 

TTTTGTAGGT 

TTTTGTAGGT 

TTTTGTAGGT 

TTTTGTAGGT 
********** 



AATAGTAAAC 

AATAGTAAAC 

AATAGTAAAC 

AATAGTAAAC 

AATAGTAAAC 

AATAGTAAAC 

AATAGTAAAC 

AATAGTAAAC 

AATAGTAAAC 

AATAGTAAAC 

AATAGTAAAC 
********** 



GTCCAGCAAT 

GTCCAGCAAT 

GTCCAGCAAT 

GTCCAGCAAT 

GTCCAGCAAT 

GTCCAGCAAT 

GTCCAGCAAT 

GTCCAGCAAT 

GTCCAGCAAT 

GTCCAGCAAT 

GTCCAGCAAT 
********** 



CAATaGTCTT 

CAATaGTCTT 

CAATaGTCTT 

CAATgGTCTT 

CAATgGTCTT 

CAATgGTCTT 

CAATgGTCTT 

CAATgGTCTT 

CAATgGTCTT 

CAATgGTCTT 

CAATgGTCTT 
****_***** 



850 

CTTTTAGAAG 

CTTTTAGAAG 

CTTTTAGAAG 

CTTTTAGAAG 

CTTTTAGAAG 

CTTTTAGAAG 

CTTTTAGAAG 

CTTTTAGAAG 

CTTTTAGAAG 

CTTTTAGAAG 

CTTTTAGAAG 
********** 



msal83564 . 

msal83564 . 

tnsal83564 . 

msal83564 . 
msa!83564 .2(147 
msal83564 .2{ 
msal83564 
msal83564.2{ 

msal83564 - 

msal83564 . 
msa!83564 .2{ 



2 {147 COHl) 
2{147~M732) 
2{147 M781^ 
2{l47_2603 
__JM9130013 
147JL8RS21 
.2{147_090 
147_CJB110 
2(147 A909 
2{147^H36B 
147_1169NT 
Consensus 



851 

GTGCAGCGCC 

GTGCAGCGCC 

GTGCAGCGCC 

GTGCAGCGCC 

GTGCAGCGCC 

GTGCAGCGCC 

GTGCAGCGCC 

GTGCAGCGCC 

GTGCAGCGCC 

GTGCAGCGCC 

GTGCAGCGCC 
********** 



AAATGCTCAA 

AAATGCTCAA 

AAATGCTCAA 

AAATGCTCAA 

AAATGCTCAA 

AAATGCTCAA 

AAATGCTCAA 

AAATGCTCAA 

AAATGCTCAA 

AAATGCTCAA 

AAATGCTCAA 
********** 



GTCTTATTAA 

GTCTTATTAA 

GTCTTATTAA 

GTCTTATTAA 

GTCTTATTAA 

GTCTTATTAA 

GTCTTATTAA 

GTCTTATTAA 

GTCTTATTAA 

GTCTTATTAA 

GTCTTATTAA 
********** 



TGCGTATTCC 

TGCGTATTCC 

TGCGTATTCC 

TGCGTATTCC 

TGCGTATTCC 

TGCGTATTCC 

TGCGTATTCC 

TGCGTATTCC 

TGCGTATTCC 

TGCGTATTCC 

TGCGTATTCC 
********** 



900 

AGATAAAATT 
AGATAAAATT 
AGATAAAATT 
AGATAAAATT 
AGATAAAATT 
AGATAAAATT 
AGATAAAATT 
AGATAAAATT 
AGATAAAATT 
AGATAAAATT 
AGATAAAATT 
********** 



msal83564.2{l47_COHl' 
msal83 564 .2(14 7J4732] 
rasal83564.2{l 47 _M781 4 
msal83564 . 2 { 147_2603 } 
msal83564 . 2 { 14 7_JM9130013 } 
msal83564.2{l47_18RS21^ 
msal83564.2{!47 090 
msal83564.2{l47_CJB110| 
msal83564 .2{147_A909' 
msal83564 .2{l47~H36Bj 
msal83564 . 2 { 147JL169NT} 

Consensus 



901 

GATTCGGACA 

GATTCGGACA 

GATTCGGACA 

GATTCGGACA 

GATTCGGACA 

GATTCGGACA 

GATTCGGACA 

GATTCGGACA 

GATTCGGACA 

GATTCGGACA 

GATTCGGACA 
********** 



AATTTGGaGA 
AATTTGGaGA 
AATTTGGaGA 
AATTTGGtGA 
AATTTGGtGA 
AATTTGGtGA 
AATTTGGaGA 
AATTTGGaGA 
AATTTGGtGA 
AATTTGGtGA 
AATTTGGaGA 
******* - ** 



AGCATATGCT 

AGCATATGCT 

AGCATATGCT 

AGCATATGCT 

AGCATATGCT 

AGCATATGCT 

AGCATATGCT 

AGCATATGCT 

AGCATATGCT 

AGCATATGCT 

AGCATATGCT 
********** 



AAAGCAATCA 
AAAGCAATCA 
AAAGCAATCA 
AAAGCAATCA 
AAAGCAATCA 
AAAGCAATCA 
AAAGCAATCA 
AAAGCAATCA 
AAAGCAATCA 
AAAGCAATCA 
AAAGCAATCA 
********** 



950 

tAGACGCTGT 

tAGACGCTGT 

tAGACGCTGT 

cAGACGCTGT 

cAGACGCTGT 

cAGACGCTGT 

cAGACGCTGT 

cAGACGCTGT 

CAGACGCTGT 

cAGACGCTGT 

cAGACGCTGT 
_********* 



msal83564 . 2 { 147J30H1 
msal83564.2{!47 M732 
msal83564 . 2 { 147~M781 
msal83564 . 2 { 147J2603 
msal83564 . 2 { 14 7__JM9130013 / 



951 

TAATCTAGGA 
TAATCTAGGA 
TAATCTAGGA 
TAATCTAGGA 
TAATCTAGGA 



GCaAAAACGA 
GCaAAAACGA 
GCaAAAACGA 
GCaAAAACGA 
GCaAAAACGA 



TTAATATGAG 
TTAATATGAG 
TTAATATGAG 
TTAATATGAG 
TTAATATGAG 



ccTgGGAAAA 
ccTgGGAAAA 
ccTgGGAAAA 
t aT t GGAAAA 
taTtGGAAAA 



1000 
ACgGCtGATT 
ACgGCtGATT 
ACgGCtGATT 
ACaGCtGATT 
ACaGCtGATT 



775 



WO 2004/018646 



PCT/US2003/026827 



Table 44: Comparative Sequences relating to SAG0416 (strain info highlighted in BOLD) 



msal83564 . 2 { 147_18RS21 } 
msal83564 . 2 { 147_090 } 
msa!83564 . 2 { 147_CJB110 } 
msal83564 . 2{ 147_A909} 
msal83564 . 2{ 147_H36B j 
msal83564 .2{147_11S9NT} 

Consensus 



TAATCTAGGA 

TAATCTAGGA 

TAATCTAGGA 

TAATCTAGGA 

TAATCTAGGA 

TAATCTAGGA 
* ********* 



GCaAAAACGA 

GCaAAAACGA 

GCaAAAACGA 

GCaAAAACGA 

GCaAAAACGA 

GCtAAAACGA 
**_******* 



TTAATATGAG 
TTAATATGAG 
TTAATATGAG 
TTAATATGAG 
TTAATATGAG 
TTAATATGAG 
********** 



taTtGGAAAA 

CCTtGGAAAA 

ccTtGGAAAA 

CCTtGGAAAA 

ccTtGGAAAA 

taTtGGAAAA 
- _*_**★*** 



ACaGCtGATT 

ACaGCaGATT 

ACaGCaGATT 

ACaGCaGATT 

ACaGCaGATT 

ACaGCtGATT 
**_**»**** 



msal83564 . 

msal83564 . 

msal83564 . 

msal83564 - 
msal83564 .2{l47 
msal83564 .2{ 
msal83564 
msal83564 .2{ 

msal83564 - 

msal83564 . 
msal83564 .2{ 



2{l47_COHlj 
2{147_M732; 
2{147_M781 
2{147_2603 
_JM9130013 
147_18RS21; 
.2{l47_090 
147_CJB110 
2{147_A909; 
2{147_H36B 
147_1169NT; 
Consensus 



1001 

CTTTAATTGC 

CTTTAATTGC 

CTTTAATTGC 

CTTTAATTGC 

CTTTAATTGC 

CTTTAATTGC 

CTTTAATTGC 

CTTTAATTGC 

CTTTAATTGC 

CTTTAATTGC 

CTTTAATTGC 
********** 



tCTCAATGAT 

t CTCAATGAT 

tCTCAATGAT 

tCTCAATGAT 

tCTCAATGAT 

tCTCAATGAT 

aCTCAATGAT 

aCTCAATGAT 

tCTCAATGAT 

tCTCAATGAT 

tCTCAATGAT 
_********* 



AAAGTTAAAT 

AAAGTTAAAT 

AAAGTTAAAT 

AAAGTTAAAT 

AAAGTTAAAT 

AAAGTTAAAT 

AAAGTTAAAT 

AAAGTTAAAT 

AAAGTTAAAT 

AAAGTTAAAT 

AAAGTTAAAT 
********** 



TAG CACTTAA 
TAG CACTTAA 
TAG CACTT AA 
TAG CACTTAA 
TAG CACTTAA 
TAG CACTTAA 
TAG CACTTAA 
TAG CACTTAA 
TAG CACTTAA 
TAG CACTTAA 

TAG CACTTAA 
********** 



1050 

ATTAGCTTCT 

ATTAGCTTCT 

ATTAGCTTCT 

ATTAGCTTCT 

ATTAGCTTCT 

ATTAGCTTCT 

ATTAGCTTCT 

ATTAGCTTCT 

ATTAGCTTCT 

ATTAGCTTCT 

ATTAGCTTCT 
********** 



msal83564 . 

msal83564 . 

msal83564. 

msal83564 . 
msal83564.2{!47 
msal83564.2{ 
msal83564 
msal83564 .2{ 

msa!83564 . 

msal83564 . 
msal83564 .2{ 



2 { 147_COHl 
2{147_M732 
2{l47_M781 
2 {147_2603} 
_JM9130013; 
147_18RS2l' 
.2{147_090 
147_CJB110 
2{147_A909 
2{147_H36B 
147_1169NT; 
Consensus 



1051 

GAGAAGGGCG 
GAGAAGGGCG 
GAGAAGGGCG 
GAGAAGGGCG 
GAGAAGGGCG 
GAGAAGGGCG 
GAGAAGGGCG 
GAGAAGGGCG 
GAGAAGGGCG 
GAGAAGGGCG 

GAGAAGGGCG 
********** 



TTG CAGTTGT 

TTGCAGTTGT 

TTG CAGTTGT 

TTGCAGTTGT 

TTGCAGTTGT 

TTGCAGTTGT 

TTGCAGTTGT 

TTGCAGTTGT 

TTGCAGTTGT 

TTGCAGTTGT 

TTGCAGTTGT 
********** 



TGTGGCTGCC 
TGTGGCTGCC 
TGTGGCTGCC 
TGTGGCTGCC 
TGTGGCTGCC 
TGTGGCTGCC 
TGTGGCTGCC 
TGTGGCTGCC 
TGTGGCTGCC 
TGTGGCTGCC 
TGTGGCTGCC 
********** 



GGAAATGAAG 

GGAAATGAAG 

GGAAATGAAG 

GGAAATGAAG 

GGAAATGAAG 

GGAAATGAAG 

GGAAATGAAG 

GGAAATGAAG 

GGAAATGAAG 

GGAAATGAAG 

GGAAATGAAG 
********** 



1100 
GtGCATTTGG 
GtGCATTTGG 
GtGCATTTGG 
GcGCATTTGG 
GcGCATTTGG 
GcGCATTTGG 
GtGCATTTGG 
GtGCATTTGG 
GtGCATTTGG 
GtGCATTTGG 
GcGCATTTGG 
*_******** 



msal83564 . 

msal83564 . 

msal83564 . 

msal83564 . 
msal83564.2{l47 
msal83564 .2{ 
msal83564 
msal83564 »2{ 

msal83564 . 

msal83564 . 
msal83564.2{ 



2{l47_COHi: 
2{147_M732 
2{147_M781; 
2{147_2603 
_JM9130013/ 
147__18RS21 
,2{147_090 
147_CJB110 
2{147_A909 
2{147_H36B; 
147_1169NT; 
Consensus 



1101 

TATGGATTAT 

TATGGATTAT 

TATGGATTAT 

TATGGATTAT 

TATGGATTAT 

TATGGATTAT 

TATGGATTAT 

TATGGATTAT 

TATGGATTAT 

TATGGATTAT 

TATGGATTAT 
********** 



AGCAAACCaT 

AGCAAACCaT 

AGCAAACCaT 

AGCAAACCaT 

AGCAAACCaT 

AGCAAACCaT 

AGCAAACCaT 

AGCAAACCaT 

AGCAAACCaT 

AGCAAACCaT 

AGCAAACCgT 
********_* 



TATCAACTAA 

TATCAACTAA 

TATCAACTAA 

TATCAACTAA 

TATCAACTAA 

TATCAACTAA 

TATCAACTAA 

TATCAACTAA 

TATCAACTAA 

TATCAACTAA 

TATCAACTAA 
********** 



TCCTGACTAC 
TCCTGACTAC 
TCCTGACTAC 
TCCTGACTAC 
TCCTGACTAC 
TCCTGACTAC 
TCCTGACTAC 
TCCTGACTAC 
TCCTGACTAC 
TCCTGACTAC 
TCCTGACTAC 



1150 
GGTACGGTTA 
GGTACGGTTA 
GGTACGGTTA 
GGTACGGTTA 
GGTACGGTTA 
GGTACGGTTA 
GGTACGGTTA 
GGTACGGTTA 
GGTACGGTTA 
GGTACGGTTA 
GGTACGGTTA 



********** ********** 



msal83564 . 

msal83564 . 

msal83564 . 

msal83564 . 
msal83564 .2(147 
msal83564 .2{ 
msal83564 
msal83564.2{ 

msal83564 . 

msal83564 . 
msal83564 .2{ 



2{l47_COHl 
2{147_M732' 
2{147_M781 
2{147_2603 
_JM9130013 
147_18RS2l} 
.2{147_090} 
147_CJB110 \ 
2{147__A909) 
2{l47_H36B} 
147_1169NT} 
Consensus 



1151 

ATAGTCCAGC 
ATAGTCCAGC 
ATAGTCCAGC 
ATAGTCCAGC 
ATAGTCCAGC 
ATAGTCCAGC 
ATAGTCCAGC 
ATAGTCCAGC 
ATAGTCCAGC 
ATAGTCCAGC 
ATAGTCCAGC 
' ********** 



TATTTCTGAA 
TATTTCTGAA 
TATTTCTGAA 
TATTTCTGAA 
TATTTCTGAA 
TATTTCTGAA 
TATTTCTGAA 
TATTTCTGAA 
TATTTCTGAA 
TATTTCTGAA 

TATTTCTGAA 
********** 



GATACTTTGA 

GATACTTTGA 

GATACTTTGA 

GATACTTTGA 

GATACTTTGA 

GATACTTTGA 

GATACTTTGA 

GATACTTTGA 

GATACTTTGA 

GATACTTTGA 

GATACTTTGA 
********** 



GTGTTGCTAG 
GTGTTGCTAG 
GTGTTGCTAG 
GTGTTGCTAG 
GTGTTGCTAG 
GTGTTGCTAG 
GTGTTGCTAG 
GTGTTGCTAG 
GTGTTGCTAG 
GTGTTGCTAG 

GTGTTGCTAG 
********** 



1200 
CTATGAATCA 
CTATGAATCA 
CTATGAATCA 
CTATGAATCA 
CTATGAATCA 
CTATGAATCA 
CTATGAATCA 
CTATGAATCA 
CTATGAATCA 
CTATGAATCA 

CTATGAATCA 
********** 



msal83564 . 

msal83564. 

msal83564 ♦ 

msal83564 . 
msal83564.2{!47 
msal835S4 .2{ 
msal83564 
msal83564 . 2{ 

msal83564 . 

msa!83564 . 
msal83564 .2{ 



2{147_C0H1} 
2{147_M732} 
2{147_M781} 
2{147_2603} 
_JM9130013 } 
147_18RS21} 
.2{147_090} 
147_CJB110} 
2{147_A909} 
2{147_H36B} 
147_1169NT} 
Consensus 



1201 

CTTAAAACTA 
CTTAAAACTA 
CTTAAAACTA 
CTTAAAACTA 
CTTAAAACTA 
CTTAAAACTA 
CTTAAAACTA 
CTTAAAACTA 
CTTAAAACTA 
CTTAAAACTA 
CTTAAAACTA 
********** 



TCAGTGAGGT 

TCAGTGAGGT 

TCAGTGAGGT 

TCAGTGAGGT 

TCAGTGAGGT 

TCAGTGAGGT 

TCAGTGAGGT 

TCAGTGAGGT 

TCAGTGAGGT 

TCAGTGAGGT 

TCAGTGAGGT 
********** 



CGTTGAAACA 

CGTTGAAACA 

CGTTGAAACA 

CGTTGAAACA 

CGTTGAAACA 

CGTTGAAACA 

CGTTGAAACA 

CGTTGAAACA 

CGTTGAAACA 

CGTTGAAACA 

CGTTGAAACA 
********** 



ACTATTGAAG 

ACTATTGAAG 

ACTATTGAAG 

ACTATTGAAG 

ACTATTGAAG 

ACTATTGAAG 

ACTATTGAAG 

ACTATTGAAG 

ACTATTGAAG 

ACTATTGAAG 

ACTATTGAAG 
********** 



1250 

GTAAGTTAGT 

GTAAGTTAGT 

GTAAGTTAGT 

GTAAGTTAGT 

GTAAGTTAGT 

GTAAGTTAGT 

GTAAGTTAGT 

GTAAGTTAGT 

GTAAGTTAGT 

GTAAGTTAGT 

GTAAGTTAGT 
********** 



msal83564.2{l47_COHl} 
msal83564.2{l47_M732} 
msal83564 .2{147_M781) 
msal83564.2{147_2603) 



1251 1300 
TAAGTTGCCG ATTGTGACTT CTAAACCTTT TGACAAAGGT AAGGCCTACG 
TAAGTTGCCG ATTGTGACTT CTAAACCTTT TGACAAAGGT AAGGCCTACG 
TAAGTTGCCG ATTGTGACTT CTAAACCTTT TGACAAAGGT AAGGCCTACG 
TAAGTTGCCG ATTGTGACTT CTAAACCTTT TGACAAAGGT AAGGCCTACG 



776 



WO 2004/018646 



PCT/US2003/026827 



Table 44: Comparative Sequences relating to SAG0416 (strain info highlighted in BOLD) 



msal83564 -2{147_JM9130013} 
msal83564 ,2{l47 18RS21} 
msal83564.2{l47 090} 
msal83 564 . 2 {l47_CJB110 } 
msal83564.2( 147_A909} 
msal83564 . 2 {147_H36B} 
msal 83 5 64 . 2 { 14 7_1 1 6 9NT } 

Consensus 



TAAGTTGCCG 
TAAGTTGCCG 
TAAGTTGCCG 
TAAGTTGCCG 
TAAGTTGCCG 
TAAGTTGCCG 
TAAGTTGCCG 



ATTGTGACTT 
ATTGTGACTT 
ATTGTGACTT 
ATTGTGACTT 
ATTGTGACTT 
ATTGTGACTT 
ATTGTGACTT 



CTAAACCTTT 
CTAAACCTTT 
CTAAACCTTT 
CTAAACCTTT 
CTAAACCTTT 
CTAAACCTTT 
CTAAACCTTT 



TGACAAAGGT 
TGACAAAGGT 
TGACAAAGGT 
TGACAAAGGT 
TGACAAAGGT 
TGACAAAGGT 
TGACAAAGGT 



********** ********** ********** ********** 



AAGGCCTACG 

AAGGCCTACG 

AAGGCCTACG 

AAGGCCTACG 

AAGGCCTACG 

AAGGCCTACG 

AAGGCCTACG 
********** 



msal83564 . 

msal83564 . 

msal83564 . 

msal83564 . 
msal83564 .2(147 
msal83564 .2{ 
msal83564 
msal83564 .2{ 

msa!83564 . 

msal83564 . 
msal83564 .2{ 



2{l4 7_COHl} 
2{147_M732} 
2{147_M781} 
2{l47_2603} 
_JM9130013 
147 18RS21 
.2{147_090 
147_CJB110} 
2{147_A909} 
2{147_H36B} 
147_1169NT) 
Consensus 



1301 

ATGTGGTTTA 
ATGTGGTTTA 
ATGTGGTTTA 
ATGTGGTTTA 
ATGTGGTTTA 
ATGTGGTTTA 
ATGTGGTTTA 
ATGTGGTTTA 
ATGTGGTTTA 
ATGTGGTTTA 
ATGTGGTTTA 



TGCCAATTAT 
TGCCAATTAT 
TGCCAATTAT 
TGCCAATTAT 
TGCCAATTAT 
TGCCAATTAT 
TGCCAATTAT 
TGCCAATTAT 
TGCCAATTAT 
TGCCAATTAT 
TGCCAATTAT 



GGTGC . .AAA 
GGTGC. .AAA 
GGTGC. .AAA 
GGTGC . aAAA 
GGTGC. aAAA 
GGTGC . aAAA 
GGTGC. aAAA 
GGTGC. aAAA 
GGTGC a aAAA 
GGTGC • aAAA 
GGTGC. aAAA 



AAAGAtTTTG 
AAAGAtTTTG 
AAAGAtTTTG 
AAAGAcTTTG 
AAAGAcTTTG 
AAAGAcTTTG 
AAAGAcTTTG 
AAAGAcTTTG 
AAAGACTTTG 
AAAGAcTTTG 
AAAGAcTTTG 



********** ********** *****__*** *****_**** 



1350 

AAGGTAAGGA 

AAGGTAAGGA 

AAGGTAAGGA 

AAGGTAAGGA 

AAGGTAAGGA 

AAGGTAAGGA' 

AAGGTAAGGA 

AAGGTAAGGA 

AAGGTAAGGA 

AAGGTAAGGA 

AAGGTAAGGA 
********** 



msal83564 . 
msal83564 . 
msa!83564 . 
msal83564 . 
msal83564.2{l47 
msal83564 .2 { 
msal83564 
' msal83564 .2{ 
msal83564 . 
ms'a!83564 . 
msal83564 .2{ 



2{l47_COHl} 
2{147_M732} 
2{147_M781} 
2{147_2603} 
_JM9130013} 
147_1BRS21} 
.2(147 090} 
147_CJB110) 
2{147_A909> 
2{147_H36B} 
147_1169NT} 
Consensus 



1351 

CTTTAAAGGT 

CTTTAAAGGT 

CTTTAAAGGT 

CTTTAAAGGT 

CTTTAAAGGT 

CTTTAAAGGT 

CTTTAAAGGT 

CTTTAAAGGT 

CTTTAAAGGT 

CTTTAAAGGT 

CTTTAAAGGT 
********** 



AAGATTG CAT 
AAGATTGCAT 
AAGATTG CAT 
AAGATTGCAT 
AAGATTGCAT 
AAGATTGCAT 
AAGATTGCAT 
AAGATTGCAT 
AAGATTGCAT 
AAGATTGCAT 

AAGATTGCAT 
********** 



TAATTGAGCG 

TAATTGAGCG 

TAATTGAGCG 

TAATTGAGCG 

TAATTGAGCG 

TAATTGAGCG 

TAATTGAGCG 

TAATTGAGCG 

TAATTGAGCG 

TAATTGAGCG 

TAATTGAGCG 
********** 



TGGTGGTGGA 

TGGTGGTGGA 

TGGTGGTGGA 

TGGTGGTGGA 

TGGTGGTGGA 

TGGTGGTGGA 

TGGTGGTGGA 

TGGTGGTGGA 

TGGTGGTGGA 

TGGTGGTGGA 

TGGTGGTGGA 
********** 



1400 

CTTGATTTTA 

CTTGATTTTA 

CTTGATTTTA 

CTTGATTTTA. 

CTTGATTTTA 

CTTGATTTTA 

CTTGATTTTA 

CTTGATTTTA 

CTTGATTTTA 

CTTGATTTTA 

CTTGATTTTA 
********** 



msal83564 . 

rasal83564 . 

msa!83564 . 

msal83564 . 
msal83564.2{l47 
msal83564-2 { 
msal83564 
msal83564.2{ 

msal83564 . 

msal83564 . 
msal83564 .2{ 



2(147_C0H1} 
2{147_M732} 
2{147_M781} 
2{147_2603} 
_JM9130013} 
147__18RS2l) 
.2{147_090} 
147__CJB110} 
2f 147_A909} 
2{147__H36B} 
147__1169NT} 
Consensus 



1401 

TGACTAAAAT 

TGACTAAAAT 

TGACTAAAAT 

TGACTAAAAT 

TGACTAAAAT 

TGACTAAAAT 

TGACTAAAAT 

TGACTAAAAT 

TGACTAAAAT 

TGACTAAAAT 

TGACTAAAAT 
* ********* 



CACTCATGCT 

CACTCATGCT 

CACTCATGCT 

CACTCATGCT 

CACTCATGCT 

CACTCATGCT 

CACTCATGCT 

CACTCATGCT 

CACTCATGCT 

CACTCATGCT 

CACTCATGCT 
********** 



ACAAATGCAG 

ACAAATGCAG 

ACAAATGCAG 

ACAAATGCAG 

ACAAATGCAG 

ACAAATGCAG 

ACAAATGCAG 

ACAAATGCAG 

ACAAATGCAG 

ACAAATGCAG 

ACAAATGCAG 
********** 



GTGTTGTTGG 

GTGTTGTTGG 

GTGTTGTTGG 

GTGTTGTTGG 

GTGTTGTTGG 

GTGTTGTTGG 

GTGTTGTTGG 

GTGTTGTTGG 

GTGTTGTTGG 

GTGTTGTTGG 

GTGTTGTTGG 
********** 



1450 

TATCGTTATT 

TATCGTTATT 

TATCGTTATT 

TATCGTTATT 

TATCGTTATT 

TATCGTTATT 

TATCGTTATT 

TATCGTTATT 

TATCGTTATT 

TATCGTTATT 

TATCGTTATT 
********** 



msal83564 . 

msal83564 . 

msa!83564 . 

msa!83564 . 
msal83564.2{l47 
msal83564 .2{ 
msal83564 
msal83564 .2{ 

msal83564 . 

msal83564 . 
msal83564.2{ 



2{l47_COHl 
2{147_M732 
2{147_M781 1 
2{147_2603 
_JM9130013 
147JL8RS21] 
.2{l47_090; 
147_CJB110 
2{147_A909 
2{147_H36B 
147__1169NT; 
Consensus 



1451 

TTTAACGATC 
TTTAACGATC 
TTTAACGATC 
TTTAACGATC 
TTTAACGATC 
TTTAACGATC 
TTTAACGATC 
TTTAACGATC 
TTTAACGATC 
TTTAACGATC 
TTTAACGATC 



AAGAAAAACG 
AAGAAAAACG 
AAGAAAAACG 
AAGAAAAACG 
AAGAAAAACG 
AAGAAAAACG 
AAGAAAAACG 
AAGAAAAACG 
AAGAAAAACG 
AAGAAAAACG 
AAGAAAAACG 



TGGAAATTTT 
TGGAAATTTT 
TGGAAATTTT 
TGGAAATTTT 
TGGAAATTTT 
TGGAAATTTT 
TGGAAATTTT 
TGGAAATTTT 
TGGAAATTTT 
TGGAAATTTT 
TGGAAATTTT 



********** ********** ********** 



CTAATTCCTT 

CTAATTCCTT 

CTAATTCCTT 

CTAATTCCTT 

CTAATTCCTT 

CTAATTCCTT 

CTAATTCCTT 

CTAATTCCTT 

CTAATTCCTT 

CTAATTCCTT 

CTAATTCCTT 
*** * ****** 



1500 

ACCGTGAATT 

ACCGTGAATT 

ACCGTGAATT 

ACCGTGAATT 

ACCGTGAATT 

ACCGTGAATT 

ACCGTGAATT 

ACCGTGAATT 

ACCGTGAATT 

ACCGTGAATT 

ACCGTGAATT 
********** 



msal83564 . 

msal83564 . 

msal83564 . 

msal83564 . 
msal83564.2{l47 
msal83564 ,2{ 
msal83564 
msal83564.2{ 

meal83564 . 

msal83564 . 
msal83564 .2{ 



2{ 147_COHl} 
2{147_M732} 
2(147_M781} 
2{147__2603> 
_JM9130013) 
147__18RS21) 
.2{147_090} 
147 CJB110) 
2{147_A909} 
2{l47_H36B} 
147_1169NT} 
Consensus 



1501 

ACCTGTGGGG 
ACCTGTGGGG 
ACCTGTGGGG 
ACCTGTGGGG 
ACCTGTGGGG 
ACCTGTGGGG 
ACCTGTGGGG 
ACCTGTGGGG 
ACCTGTGGGG 
ACCTGTGGGG 
ACCTGTGGGG 



gTTATTAGTA 
gTTATTAGTA 
gTTATTAGTA 
aTTATTAGTA 
aTTATTAGTA 
aTTATTAGTA 
gTTATTAGTA 
gTTATTAGTA 
gTTATTAGTA 
gTTATTAGTA 
gTTATTAGTA 



AAGTAGATGG 
AAGTAGATGG 
AAGTAGATGG 
AAGTAGATGG 
AAGTAGATGG 
AAGTAGATGG 
AAGTAGATGG 
AAGTAGATGG 
AAGTAGATGG 
AAGTAGATGG 
AAGTAGATGG 



CGAGCGTATA 
CGAG CGT AT A 
CGAGCGTATA 
CGAGCGTATA 
CGAGCGTATA 
CGAGCGTATA 
CGAGCGTATA 
CGAGCGTATA 
CGAGCGTATA 
CGAGCGTATA 
CGAGCGTATA 



1550 
AAAAATACTT 
AAAAATACTT 
AAAAATACTT 
AAAAATACTT 
AAAAATACTT 
AAAAATACTT 
AAAAATACTT 
AAAAATACTT 
AAAAATACTT 
AAAAATACTT 
AAAAATACTT 



********** _********* ********** ********** ********** 



tnsal83564 . 2 { 147_C0H1 } 
msal83564 . 2 f 147_M732 } 
msal83564 .2{147_M78l} 



1551 1600 
CAAGTCAGTT AACATTTAAC CAGAGtTTTG AAGTAGTTGA TAGCCAAGGT 
CAAGTCAGTT AACATTTAAC CAGAGtTTTG AAGTAGTTGA TAGCCAAGGT 
CAAGTCAGTT AACATTTAAC CAGAGtTTTG AAGTAGTTGA TAGCCAAGGT 



777 



WO 2004/018646 



PCT/US2003/026827 



Table 44: Comparative Sequences relating to SAG0416 (strain info highlighted in BOLD) 



msal83564 . 
msal83564.2{l47 
msa!83564 .2{ 
msal83564 
msa!83564.2{ 
msal83564 . 
msal83564 . 
msal83564.2{ 



2 {147^2603} 
_JM9130013} 
147_18RS2l} 
.2{147_090} 
147_CJB110} 
2{147_A909} 
2{147_H36B} 
147_1169NT} 
Consensus 



CAAGTCAGTT 
CAAGTCAGTT 
CAAGTCAGTT 
CAAGTCAGTT 
CAAGTCAGTT 
CAAGTCAGTT 
CAAGTCAGTT 
CAAGTCAGTT 



AACATTTAAC 
AACATTTAAC 
AACATTTAAC 
AACATTTAAC 
AACATTTAAC 
AACATTTAAC 
AACATTTAAC 
AACATTTAAC 



CAGAG t TTTG 
CAGAGtTTTG 
CAGAG t TTTG 
CAGAGtTTTG 
CAGAGtTTTG 
CAGAGtTTTG 
CAGAGtTTTG 
CAGAGaTTTG 



AAGTAGTTGA 
AAGTAGTTGA 
AAGTAGTTGA 
AAGTAGTTGA 
AAGTAGTTGA 
AAGTAGTTGA 
AAGTAGTTGA 
AAGTAGTTGA 



TAGCCAAGGT 
TAG CCAAGGT 
TAGCCAAGGT 
TAGCCAAGGT 
TAGCCAAGGT 
TAGCCAAGGT 
TAGCCAAGGT 
TAGCCAAGGT 



********** ********** *****-**** ********** ********** 



msal83564 . 

msal83564 . 

msal83564 . 

msal83564 . 
msal83564 .2{l47 
msal83564.2{ 
msal83564 
msal83564 .2{ 

msal83564 . 

msal83564 . 
msal83564 .2 { 



2{l47_COHl} 
2{147_M732} 
2{147_M781} 
2{147_2603} 
_JM9130013 
147_18RS21 
-2{147_090 
147__CJB110} 
2(l47_A909 
2{147_H36B] 
147_1169NT] 
Consensus 



1601 

GGcAATCGTA 
GG cAATCGTA 
GGcAATCGTA 
GGtAATCGTA 
GGtAATCGTA 
GGtAATCGTA 
GGcAATCGTA 
GGcAATCGTA 
GGcAATCGTA 
GGcAATCGTA 
GGcAATCGTA 



TGCTGGAACA 
TGCTGGAACA 
TGCTGGAACA 
TGCTGGAACA 
TGCTGGAACA 
TGCTGGAACA 
TGCTGGAACA 
TGCTGGAACA 
TGCTGGAACA 
TGCTGGAACA 
TGCTGGAACA 



ATCAAGTTGG 
ATCAAGTTGG 
ATCAAGTTGG 
ATCAAGTTGG 
ATCAAGTTGG 
ATCAAGTTGG 
ATCAAGTTGG 
ATCAAGTTGG 
ATCAAGTTGG 
ATCAAGTTGG 
ATCAAGTTGG 



GGCGTGACAG 
GGCGTGACAG 
GGCGTGACAG 
GGCGTGACAG 
GGCGTGACAG 
GGCGTGACAG 
GGCGTGACAG 
GGCGTGACAG 
GGCGTGACAG 
GGCGTGACAG 
GGCGTGACAG 



**_******* ********** ********** ********** 



1650 

CTGAAGGAGC 

CTGAAGGAGC 

CTGAAGGAGC 

CTGAAGGAGC 

CTGAAGGAGC 

CTGAAGGAGC 

CTGAAGGAGC 

CTGAAGGAGC 

CTGAAGGAGC 

CTGAAGGAGC 

CTGAAGGAGC 
********** 



msal83564 
msal83564 
msal83564 
msal83564 
msa!83564.2{l4 

msa!83564 .2 
msal8356 

msal83564 .2 
msal83564 
msal83564 

msal83564.2 



.2{l47_COHl 
.2{l47_M732, 
.2{147_M781 
.2{147_2603" 
7_JM9130013 
{l47_18RS21 
4 .2{147_090" 
{147_CJB110 
.2{147_A909 
.2{147JHB6B 
{147_1169NT 
Consensus 



1651 

AATCAAGCCT 
AATCAAGCCT 
AATCAAGCCT 
AATCAAGCCT 
AATCAAGCCT 
AATCAAGCCT 
AATCAAGCCT 
AATCAAGCCT 
AATCAAGCCT 
AATCAAGCCT 
AATCAAGCCT 



GATGTAACAG 
GATGTAACAG 
GATGTAACAG 
GATGTAACAG 
GATGTAACAG 
GATGTAACAG 
GATGTAACAG 
GATGTAACAG 
GATGTAACAG 
GATGTAACAG 
GATGTAACAG 



CTTCTGGCTT 
CTTCTGGCTT 
CTTCTGGCTT 
CTTCTGGCTT 
CTTCTGGCTT 
CTTCTGGCTT 
CTTCTGGCTT 
CTTCTGGCTT 
CTTCTGGCTT 
CTTCTGGCTT 
CTTCTGGCTT 



********** ********** ********** 



tGAAATTTAT 

t G AAATTT AT 

tGAAATTTAT 

tGAAATTTAT 

tGAAATTTAT 

tGAAATTTAT 

tGAAATTTAT 

tGAAATTTAT 

tGAAATTTAT 

tGAAATTTAT 

cGAAATTTAT 
_********* 



1700 

TCTTCAACCT 

TCTTCAACCT 

TCTTCAACCT 

TCTTCAACCT 

TCTTCAACCT 

TCTTCAACCT 

TCTTCAACCT 

TCTTCAACCT 

TCTTCAACCT 

TCTTCAACCT 

TCTTCAACCT 
********** 



msal83564 . 

msal83564 . 

msal83564 . 

msal83564 . 
msal83564 .2(147 
msal83564 .2{ 
msal83564 
msal83564 .2 { 

msal83564 . 

msal83564 . 
msa!83564 .2{ 



2{l47_COHl 
2{147_M732; 
2{147_M781; 
2{147_2603" 
_JM9130013 
147_18RS21 
.2{147_090 
147_CJB110 
2{l47_A909 
2{147_H36B 
147_1169NT 
Consensus 



1701 

ATAATAATCA 

ATAATAATCA 

ATAATAATCA 

ATAATAATCA 

ATAATAATCA 

ATAATAATCA 

ATAATAATCA 

ATAATAATCA 

ATAATAATCA 

ATAATAATCA 

ATAATAATCA 
********** 



ATACtAAACA 

ATACtAAACA 

ATACtAAACA 

ATACcAAACA 

ATACCAAACA 

ATACcAAACA 

ATACCAAACA 

ATACcAAACA 

ATACcAAACA 

ATACcAAACA 

ATACcAAACA 
****_***** 



ATGTCTGGTA 

ATGTCTGGTA 

ATGTCTGGTA 

ATGTCTGGTA 

ATGTCTGGTA 

ATGTCTGGTA 

ATGTCTGGTA 

ATGTCTGGTA 

ATGTCTGGTA 

ATGTCTGGTA 

ATGTCTGGTA 
********** 



CAAGTATGGC 

CAAGTATGGC 

CAAGTATGGC 

CAAGTATGGC 

CAAGTATGGC 

CAAGTATGGC 

CAAGTATGGC 

CAAGTATGGC 

CAAGTATGGC 

CAAGTATGGC 

CAAGTATGGC 
********** 



1750 

TTCACCACAT 

TTCACCACAT 

TTCACCACAT 

TTCACCACAT 

TTCACCACAT 

TTCACCACAT 

TTCACCACAT 

TTCACCACAT 

TTCACCACAT 

TTCACCACAT 

TTCACCACAT 
********** 



msal83564 . 

rasal83564 . 

msal83564 . 

rasal83564 . 
msal83564.2{l47 
msal83564 .2{ 
msal83564 
msal83564 .2{ 

msal83564 . 

maal83564 . 
msal83564 . 2 { 



2{147_C0H1} 
2 (147_M732* 
2{147_M781 
2{147_2603 
_JM9130013 
147_18RS21 
.2{147_090 
147_CJB110} 
2(147^909" 
2{l47_H36B 
147_1169NT 
Consensus 



1751 

GTTG CAGGAT 
GTTGCAGGAT 
GTTG CAGGAT 
GTTGCAGGAT 
GTTGCAGGAT 
GTTGCAGGAT 
GTTGCAGGAT 
GTTGCAGGAT 
GTTGCAGGAT 
GTTGCAGGAT 

GTTGCAGGAT 
********** 



TAATGACAAT 
TAATGACAAT 
TAATGACAAT 
TAATGACAAT 
TAATGACAAT 
TAATGACAAT 
TAATGACAAT 
TAATGACAAT 
TAATGACAAT 
TAATGACAAT 

TAATGACAAT 
********** 



GCTTCAAAgT 
GCTTCAAAgT 
GCTTCAAAgT 
GCTTCAAAgT 
GCTTCAAAgT 
GCTTCAAAgT 
GCTTCAAAgT 
GCTTCAAAaT 
GCTTCAAAgT 
GCTTCAAAgT 
GCTTCAAAgT 



CATTTGG CTG 
CATTTGGCTG 
CATTTGGCTG 
CATTTGGCTG 
CATTTGGCTG 
CATTTGGCTG 
CATTTGGCTG 
CATTTGGCTG 
CATTTGGCTG 
CATTTGGCTG 
CATTTGGCTG 



1800 
AGAAATATAA 
AGAAATATAA 
AGAAATATAA 
AGAAATATAA 
AGAAATATAA 
AGAAATATAA 
AGAAATATAA 
AGAAATATAA 
AGAAATATAA 
AGAAATATAA 
AGAAATATAA 



********—* ********** ********** 



msal83564 
msal83564 
msal83564 
msal83564 
msal83564 .2{l4 

msal83564 .2 
msal8356 

msal83564 .2 
msal83564 
msa!83564 

msal83564 .2 



. 2 { 147_C0H1 
.2{147_M732 
.2{147_M781; 
,2{147_2603 
7_JM9130 013 
{147_18RS21 
4 .2{147_090; 
{147_CJB110 
.2{147_A909 
.2{147_H36B 
{147_1169NT 
Consensus 



1801 

AGGGATGAAT 
AGGGATGAAT 
AGGGATGAAT 
AGGGATGAAT 
AGGGATGAAT 
AGGGATGAAT 
AGGGATGAAT 
AGGGATGAAT 
AGGGATGAAT 
AGGGATGAAT 
AGGGATGAAT 



TTAGATTCTA 
TTAGATTCTA 
TTAGATTCTA 
TTAGATTCTA 
TTAGATTCTA 
TTAGATTCTA 
TTAGATTCTA 
TTAGATTCTA 
TTAGATTCTA 
TTAGATTCTA 
TTAGATTCTA 



AAAAATTGCT 
AAAAATTGCT 
AAAAATTGCT 
AAAAATTGCT 
AAAAATTGCT 
AAAAATTGCT 
AAAAATTGCT 
AAAAATTGCT 
AAAAATTGCT 
AAAAATTGCT 
AAAAATTGCT 



********** ********** ********** 



AGAATTGTCT 

AGAATTGTCT 

AGAATTGTCT 

AGAATTGTCT 

AGAATTGTCT 

AGAATTGTCT 

AGAATTGTCT 

AGAATTGTCT 

AGAATTGTCT 

AGAATTGTCT 

AGAATTGTCT 
********** 



1850 

AAAAACATCC 

AAAAACATCC 

AAAAACATCC 

AAAAACATCC 

AAAAACATCC 

AAAAACATCC 

AAAAACATCC 

AAAAACATCC 

AAAAACATCC 

AAAAACATCC 

AAAAACATCC 
********** 



msal83564 . 2 ( 147_C0H1 } 
msal83564 .2{14 7_M732} 



1851 1900 
TCATGAGCTC AGCAACAGCA TTATATAGTG AAGAGGATAA GGCGTTTTAT 
TCATGAGCTC AGCAACAGCA TTATATAGTG AAGAGGATAA GGCGTTTTAT 



778 



WO 2004/018646 



PCT/US2003/026827 



Table 44: Comparative Sequences relating to SAG0416 (strain info highlighted in BOLD) 



msal83564 . 
msal83564 . 
msal83564.2{l47 
msal83564 .2{ 
msal83564 
msal83564 .2 { 
msal83564 . 
rasal83564 . 
msal83564 .2{ 



2{l47_M78l} 
2{l47_2603} 
_JM9130013} 
147_18RS2l} 
.2{147_090) 
147_CJB110) 
2{147_A909} 
2{147_H36B} 
147_1169NT} 
Consensus 



TCATGAGCTC 

TCATGAGCTC 

TCATGAGCTC 

TCATGAGCTC 

TCATGAGCTC 

TCATGAGCTC 

TCATGAGCTC 

TCATGAGCTC 

TCATGAGCTC 
********** 



AGCAACAGCA 

AGCAACAGCA 

AGCAACAGCA 

AGCAACAGCA 

AGCAACAGCA 

AGCAACAGCA 

AGCAACAGCA 

AGCAACAGCA 

AGCAACAGCA 
********** 



TTATATAGTG 
TTATATAGTG 
TTATATAGTG 
TTATATAGTG 
TTATATAGTG 
TTATATAGTG 
TTATATAGTG 
TTATATAGTG 
TTATATAGTG 



AAGAGGATAA 
AAGAGGATAA 
AAGAGGATAA 
AAGAGGATAA 
AAGAGGATAA 
AAGAGGATAA 
AAGAGGATAA 
AAGAGGATAA 
AAGAGGATAA 



********** ********** 



GGCGTTTTAT 

GGCGTTTTAT 

GGCGTTTTAT 

GGCGTTTTAT 

GGCGTTTTAT 

GGCGTTTTAT 

GGCGTTTTAT 

GGCGTTTTAT 

GGCGTTTTAT 
********** 



msal83564 . 

msal83564 . 

msal83564 . 

msal83564 . 
msal83564.2{l47 
msal83564 .2 { 
msal83564 
rasal83564 .2{ 

msal83564 . 

msal83564 . 
msa!83564 .2{ 



2{147_C0H1 
2{147_M732; 
2{147_M781; 
2{147_2603 
_JM9130013 
147_18RS21 
.2{l47_090 
147_CJB110 
2{l47_A909 
2{l47__H36B 
147 w _1169NT 
Consensus 



1901 

TCACCACGTC 
TCACCACGTC 
TCACCACGTC 
TCACCACGTC 
TCACCACGTC 
TCACCACGTC 
TCACCACGTC 
TCACCACGTC 
TCACCACGTC 
TCACCACGTC 
TCACCACGTC 



AGCAAGGTGC 
AGCAAGGTGC 
AGCAAGGTGC 
AGCAAGGTGC 
AGCAAGGTGC 
AGCAAGGTGC 
AGCAAGGTGC 
AGCAAGGTGC 
AGCAAGGTGC 
AGCAAGGTGC 
AGCAAGGTGC 



********** ********** 



AGGTGTAGTT 

AGGTGTAGTT 

AGGTGTAGTT 

AGGTGTAGTT 

AGGTGTAGTT 

AGGTGTAGTT 

AGGTGTAGTT 

AGGTGTAGTT 

AGGTGTAGTT 

AGGTGTAGTT 

AGGTGTAGTT 
********** 



GATGCTGAAA 
GATGCTGAAA 
GATGCTGAAA 
GATGCTGAAA 
GATGCTGAAA 
GATGCTGAAA 
GATGCTGAAA 
GATGCTGAAA 
GATGCTGAAA 
GATGCTGAAA 
GATGCTGAAA 



1950 
AAGCTATCCA 
AAGCTATCCA 
AAGCTATCCA 
AAGCTATCCA 
AAGCTATCCA 
AAGCTATCCA 
AAGCTATCCA 
AAGCTATCCA 
AAGCTATCCA 
AAGCTATCCA 
AAGCTATCCA 



********** ********** 



msal83564 . 

msal83564 . 

msal83564 .. 

rasal83564 . 
msal835€4.2{l47 
msal83564.2{ 
msal83564 
msal83564 .2 { 

msal83564 . 

msal83564 . 
msal83564 .2 { 



2{l47_COHl} 
2{147_M732} 
2{l47_M78l} 
2 { 147^2603} 
_JM9130013} 
147_18RS2lj 
.2{147_090} 
147__CJB110} 
2{147_A909} 
2{147__H36B} 
147_1169NT} 
Consensus 



1951 

AGCTCAATAT 
AGCTCAATAT 
AGCTCAATAT 
AGCTCAATAT 
AGCTCAATAT 
AGCTCAATAT 
AGCTCAATAT 
AGCTCAATAT 
AGCTCAATAT 
AGCTCAATAT 
AGCTCAATAT 
********** 



TATgTTACTG 
TATgTTACTG 
TATgTTACTG 
TATaTTACTG 
TATaTTACTG 
TATaTTACTG 
TATgTTACTG 
TATgTTACTG 
TATgTTACTG 
TATgTTACTG 
TATgTTACTG 



GAAACGATGG 
GAAACGATGG 
GAAACGATGG 
GAAACGATGG 
GAAACGATGG 
GAAACGATGG 
GAAACGATGG 
GAAACGATGG 
GAAACGATGG 
GAAACGATGG 
GAAACGATGG 



CAAAGtTAAA 
CAAAGtTAAA 
CAAAGtTAAA 
CAAAGcTAAA 
CAAAGcTAAA 
CAAAGcTAAA 
CAAAGcTAAA 
CAAAGCTAAA 
CAAAGcTAAA 
CAAAGCTAAA 
CAAAGcTAAA 



***_****** ********** *****_**** 



2000 

ATTAATCTCA 

ATTAATCTCA 

ATTAATCTCA 

ATTAATCTCA 

ATTAATCTCA 

ATTAATCTCA 

ATTAATCTCA 

ATTAATCTCA 

ATTAATCTCA 

ATTAATCTCA 

ATTAATCTCA 
********** 



msal835S4 . 

msal83564 . 

msal83564 . 

msal83564 . 
msal83564.2{l47 
msal83564 .2{ 
msal83564 
rasal83564.2{ 

msal83564 . 

msal83564 . 
msal83564 .2 { 



2{l47_COHl} 
2{147_M732} 
2{147_M781} 
2{147_2603} 
_JM9130013} 
147_18RS21} 
.2{147_090) 
147_CJB110} 
2{147__A909} 
2{147_H36B} 
147_1169NT} 
Consensus 



2001 

AACGAgaGGG 

AACGAgaGGG 

AACGAgaGGG 

AACGAa t GGG 

AACGAatGGG 

AACGAa t GGG 

AACGAgtGGG 

AACGAg t GGG 

AACGAgtGGG 

AACGAgtGGG 

AACGAgtGGG 
*****__*** 



AGATAAATTT 

AGATAAATTT 

AGATAAATTT 

AGATAAATTT 

AGATAAATTT 

AGATAAATTT 

AGATAAATTT 

AGATAAATTT 

AGATAAATTT 

AGATAAATTT 

AGATAAATTT 
********** 



GATATCACAG 

GATATCACAG 

GATATCACAG 

GATATCACAG 

GATATCACAG 

GATATCACAG 

GATATCACAG 

GATATCACAG 

GATATCACAG 

GATATCACAG 

GATATCACAG 
********** 



TTACAATTCA 

TTACAATTCA 

TTACAATTCA 

TTACAATTCA 

TTACAATTCA 

TTACAATTCA 

TTACAATTCA 

TTACAATTCA 

TTACAATTCA 

TTACAATTCA 

TTACAATTCA 
********** 



2050 

TAAACTTGTA 

TAAACTTGTA 

TAAACTTGTA 

TAAACTTGTA 

TAAACTTGTA 

TAAACTTGTA 

TAAACTTGTA 

TAAACTTGTA 

TAAACTTGTA 

TAAACTTGTA 

TAAACTTGTA 
********** 



msal83564 . 

msal83564 . 

msal83564 . 

msal83564 . 
msa!83564.2{l47 
msal83564 .2{ 
msal83564 
mBal83564 .2{ 

msal83564 - 

msal83564 . 
mBal83564 .2{ 



2(l47_COHl} 
2{147_M732) 
2{147_M781' 
2{147_2603 
_JM9130013 
147_18RS21 
.2{147_090} 
147_CJB110} 
2{147_A909} 
2{147_H36B} 
147_1169NT} 
Consensus 



2051 

GAAGGTGTCA 

GAAGGTGTCA 

GAAGGTGTCA 

GAAGGTGTCA 

GAAGGTGTCA 

GAAGGTGTCA 

GAAGGTGTCA 

GAAGGTGTCA 

GAAGGTGTCA 

GAAGGTGTCA 

GAAGGTGTCA 
********** 



AAGAATTGTA 
AAGAATTGTA 
AAGAATTGTA 
AAGAATTGTA 
AAGAATTGTA 
AAGAATTGTA 
AAGAATTGTA 
AAGAATTGTA 
AAGAATTGTA 
AAGAATTGTA 
AAGAATTGTA 



TTATCAAGCT 
TTATCAAGCT 
TTATCAAGCT 
TTATCAAGCT 
TTATCAAGCT 
TTATCAAGCT 
TTATCAAGCT 
TTATCAAGCT 
TTATCAAGCT 
TTATCAAGCT 
TTATCAAGCT 



********** ********** 



AATGTAGCAA 

AATGTAGCAA 

AATGTAGCAA 

AATGTAGCAA 

AATGTAGCAA 

AATGTAGCAA 

AATGTAGCAA 

AATGTAGCAA 

AATGTAGCAA 

AATGTAGCAA 

AATGTAGCAA 
********** 



2100 

CAGAACAAGT 

CAGAACAAGT 

CAGAACAAGT 

CAGAACAAGT 

CAGAACAAGT 

CAGAACAAGT 

CAGAACAAGT 

CAGAACAAGT 

CAGAACAAGT 

CAGAACAAGT 

CAGAACAAGT 
********** 



msal83564 . 

msal83564 . 

msal83564 . 

msal83564 . 
msal83564 .2{147 
msal83 564.2{ 
msal83564 
msa!83564.2{ 

msal83564 . 

msal83564 . 
msal83564 .2{ 



2{147_C0H1; 
2{147__M732; 
2{147_M781 
2{147_2603 
_JM9130013 
147_18RS21 
.2{147_090 
147_CJB110 
2{147_A909 
2{147_H36B 
147_1169NT; 
Consensus 



2101 

AAATAAAGGT 
AAATAAAGGT 
AAATAAAGGT 
AAATAAAGGT 
AAATAAAGGT 
AAATAAAGGT 
AAATAAAGGT 
AAATAAAGGT 
AAATAAAGGT 
AAATAAAGGT 
AAATAAAGGT 
********** 



AAATTTGCCC 
AAATTTGCCC 
AAATTTGCCC 
AAATTTGCCC 
AAATTTGCCC 
AAATTTGCCC 
AAATTTGCCC 
AAATTTGCCC 
AAATTTGCCC 
AAATTTGCCC 
AAATTTGCCC 



TTAAACCACA 
TTAAACCACA 
TTAAACCACA 
TTAAACCACA 
TTAAACCACA 
TTAAACCACA 
TTAAACCACA 
TTAAACCACA 
TTAAACCACA 
TTAAACCACA 
TTAAACCACA 



AGCCTTGCTA 
AGCCTTGCTA 
AGCCTTGCTA 
AGCOTTGCTA 
AGCCTTGCTA 
AGCCTTGCTA 
AGCCTTGCTA 
AGCCTTGCTA 
AGCCTTGCTA 
AGCCTTGCTA 
AGCCTTGCTA 



********** ********** ********** 



2150 

GATACTAATT 

GATACTAATT 

GATACTAATT 

GATACTAATT 

GATACTAATT 

GATACTAATT 

GATACTAATT 

GATACTAATT 

GATACTAATT 

GATACTAATT 

GATACTAATT 
********** 



msal83564 . 2 { 147_COHl} 



2151 2200 
GGCAGAAAGT AATTCTTCGT GATAAAGAAA CACAAGTTCG ATT TACT ATT 



779 



WO 2004/018646 



PCT/US2003/026827 



Table 44: Comparative Sequences relating to SAG0416 (strain info highlighted in BOLD) 



msal83564 

msal83564 . 

msal83564. 
msal83564.2{l47 
msal83564.2{ 
msal83564 
msal83564 .2{ 

msal83564 . 

msal83564 . 
msal83564 .2{ 



2{147_M732 
2{l47_M781 
2{147_2603 
_JM9130013 
147_18RS2l] 
.2{l47_090; 
147_CJB110 
2{l47_A909 
2{l47_H36B; 
147_1169NT; 
Consensus 



msal83564 . 

msal83564 . 

msal83564 . 

msal83564 . 
msal83564.2{l47 
msal83564 .2 { 
msal83564 
rnsal83564 .2{ 

msal83564 . 

msal83564 . 
msa!83564 .2{ 



2{l47_COHl; 
2{147_M732 
2{147_M781 
2{147_2603 
_JM9130013 
147_18RS21 
.2{l47_090 
147_CJB110 
2{147_A909; 
2{147_H36B; 
147_1169NT/ 
Consensus 



msal83564 .2{l47_COHl 
msal83564 . 2 { 147_M732 
msal83564 -2{147_M781 
msal83564 .2{l47_2603 
msal83564 .2{147_JM9130013 
msal83564 . 2 { 147_18RS21 
msal83564.2{l47_090 
msal83564 . 2 { 147_CJB110 } 
msal83564 . 2 { 147_A909 * 
msal83564 .2{l47_H36B 
msal83564 . 2 { 147„1169NT 

Consensus 



msal83564 .2{147_C0H1 
msal83564.2{l47_M732 
msal83564 .2{147_M781 
msal83564.2{l47_2603; 
msal83564 . 2 { 147_JM9130013 / 
msal83564 . 2 { 147_18RS21 
msal83564 .2 { 147_090 
msal83564 . 2 { 147_CJB110 } 
msal83564.2{l47_A909} 
msal83564 .2{147_H36BJ 
msa!83564 . 2 {l47_1169NT} 

Consensus 



msal83564 
msal83564 
msal83564 
msal83564 . 
msal83564.2{l47 

msal83564 .2{ 
msal83564 

msal83564 .2{ 
msal83564 . 
msal83564 . 

msal83564 .2{ 



2{147_C0H1 
2{147_M732 
2{147_M781 
2{147_2603 
_JM9130013 
147_18RS21 
,2{147_090} 
147_CJB110} 
2{147_A909} 
2{147_H36B} 
147_1169NT} 
Consensus 



GGCAGAAAGT 
GGCAGAAAGT 
GGCAGAAAGT 
GGCAGAAAGT 
GGCAGAAAGT 
GGCAGAAAGT 
GGCAGAAAGT 
GGCAGAAAGT 
GGCAGAAAGT 
GGCAGAAAGT 



AATTCTTCGT 
AATTCTTCGT 
AATTCTTCGT 
AATTCTTCGT 
AATTCTTCGT 
AATTCTTCGT 
AATTCTTCGT 
AATTCTTCGT 
AATTCTTCGT 
AATTCTTCGT 



GATAAAGAAA 
GATAAAGAAA 
GATAAAGAAA 
GATAAAGAAA 
GATAAAGAAA 
GATAAAGAAA 
GATAAAGAAA 
GATAAAGAAA 
GATAAAGAAA 
GATAAAGAAA 



********** ********** ********** 



2201 

GATgCTAGTC 

GATgCTAGTC 

GATgCTAGTC 

GATgCTAGTC 

GATgCTAGTC 

GATgCTAGTC 

GATgCTAGTC 

GATgCTAGTC 

GATtCTAGTC 

GATtCTAGTC 

GATgCTAGTC 
***„****** 



AATTTAGTCA 

AATTTAGTCA 

AATTTAGTCA 

AATTTAGTCA 

AATTTAGTCA 

AATTTAGTCA 

AATTTAGTCA 

AATTTAGTCA 

AATTTAGTCA 

AATTTAGTCA 

AATTTAGTCA 
********** 



2251 

TTTCTTAGAA 
TTTCTTAGAA 
TTTCTTAGAA 
TTTCTTAGAA 
TTTCTTAGAA 
TTTCTTAGAA 
TTTCTTAGAA 
TTTCTTAGAA 
TTTCTTAGAA 
TTTCTTAGAA 
TTTCTTAGAA 
********** 

2301 

AGTTAATGAG 
AGTTAATGAG 
AGTTAATGAG 
AGTTAATGAG 
AGTTAATGAG 
AGTTAATGAG 
AGTTAATGAG 
AGTTAATGAG 
AGTTAATGAG 
AGTTAATGAG 
AGTTAATGAG 



GGTTTTGTAC 
GGTTTTGTAC 
GGTTTTGTAC 
GGTTTTGTAC 
GGTTTTGTAC 
GGTTTTGTAC 
GGTTTTGTAC 
GGTTTTGTAC 
GGTTTTGTAC 
GGTTTTGTAC 

GGTTTTGTAC 
********** 



GAAATTAAAA 

GAAATTAAAA 

GAAATTAAAA 

GAAATTAAAA 

GAAATTAAAA 

GAAATTAAAA 

GAAATTAAAA 

GAAATTAAAA 

GAAATTAAAA 

GAAATTAAAA 

GAAATTAAAA 
********** 



GTTTTAAAGA 

GTTTTAAAGA 

GTTTTAAAGA 

GTTTTAAAGA 

GTTTTAAAGA 

GTTTTAAAGA 

GTTTTAAAGA 

GTTTTAAAGA 

GTTTTAAAGA 

GTTTTAAAGA 

GTTTTAAAGA 
********** 



CACAAGTTCG 

CACAAGTTCG 

CACAAGTTCG 

CACAAGTTCG 

CACAAGTTCG 

CACAAGTTCG 

CACAAGTTCG 

CACAAGTTCG 

CACAAGTTCG 

CACAAGTTCG 
********** 



GAACAGATGG 

GAACAGATGG 

GAACAGATGG 

GAACAGATGG 

GAACAGATGG 

GAACAGATGG 

GAACAGATGG 

GAACAGATGG 

GAACAGATGG 

GAACAGATGG 

GAACAGATGG 
********** 



AGCCAAGGAT 
AGCcAAGGAT 
AGCcAAGGAT 
AGCcAAGGAT 
AGCcAAGGAT 
AGCcAAGGAT 
AGCcAAGGAT 
AGCcAAGGAT 
AGCcAAGGAT 
AGCcAAGGAT 

AGC t AAGGAT 
***».****** ********** 



ATTTACTATT 

ATTTACTATT 

ATTTACTATT 

ATTTACTATT 

ATTTACTATT 

ATTTACTATT 

ATTTACTATT 

ATTTACTATT 

ATTTACTATT 

ATTTACTATT 
********** 

2250 

CAAATGGTTA 

CAAATGGTTA 

CAAATGGTTA 

CAAATGGTTA 

CAAATGGTTA 

CAAATGGTTA 

CAAATGGTTA 

CAAATGGTTA 

CAAATGGTTA 

CAAATGGTTA 

CAAATGGTTA 
********** 

2300 
AGTAATCAGG 
AGTAATCAGG 
AGTAATCAGG 
AGTAATCAGG 
AGTAATCAGG 
AGTAATCAGG 
AGTAATCAGG 
AGTAATCAGG 
AGTAATCAGG 
AGTAATCAGG 
AGTAATCAGG 



TATTCCTTTT 
TATTCCTTTT 
TATTCCTTTT 
TATTCCTTTT 
TATTCCTTTT 
TATTCCTTTT 
TATTCCTTTT 
TATTCCTTTT 
TATTCCTTTT 
TATTCCTTTT 
TATTCCTTTT 



GTAGGATTTA 
GTAGGATTTA 
GTAGGATTTA 
GTAGGATTTA 
GTAGGATTTA 
GTAGGATTTA 
GTAGGATTTA 
GTAGGATTTA 
GTAGGATTTA 
GTAGGATTTA 
GTAGGATTTA 



********** ********** ********** 



2350 
TGCGAaCTTA 
TGCGAaCTTA 
TGCGAaCTTA 
TGCGAaCTTA 
TGCGAaCTTA 
TGCGAaCTTA 
TGCGAaCTTA 
TGCGAaCTTA 
TGCGAaCTTA 
TGCGAaCTTA 
TGCGAgCTTA 
*****_**** 

2400 
GTAGTTTCTA 
GTAGTTTCTA 
GTAGTTTCTA 
GTAGTTTCTA 
GTAGTTTCTA 
GTAGTTTCTA 
GTAGTTTCTA 
GTAGTTTCTA 
GTAGTTTCTA 
GTAGTTTCTA 
GTAGTTTCTA 

********** ********** ********** ..********* ********** 



2351 

CAAGCACTTG 
CAAGCACTTG 
CAAGCACTTG 
CAAGCACTTG 
CAAGCACTTG 
CAAGCACTTG 
CAAGCACTTG 
CAAGCACTTG 
CAAGCACTTG 
CAAGCACTTG 
CAAGCACTTG 



AAACACCGAT 
AAACACCGAT 
AAACACCGAT 
AAACACCGAT 
AAACACCGAT 
AAACACCGAT 
AAACACCGAT 
AAACACCGAT 
AAACACCGAT 
AAACACCGAT 
AAACACCGAT 



TTATAAGACG 
TTATAAGACG 
TTATAAGACG 
TTATAAGACG 
TTATAAGACG 
TTATAAGACG 
TTATAAGACG 
TTATAAGACG 
TTATAAGACG 
TTATAAGACG 
TTATAAGACG 



ATGGTGATTT 
ATGGTGATTT 
ATGGTGATTT 
ATGGTGATTT 
ATGGTGATTT 
ATGGTGATTT 
ATGGTGATTT 
ATGGTGATTT 
ATGGTGATTT 
ATGGTGATTT 
ATGGTGATTT 
********** 



cTTTCTAAAG 
CTTTCTAAAG 
cTTTCTAAAG 
cTTTCTAAAG 
cTTTCTAAAG 
aTTTCTAAAG 
CTTTCTAAAG 
cTTTCTAAAG 
cTTTCTAAAG 
cTTTCTAAAG 
CTTTCTAAAG 



msal83564 
msal83564 
msal83564 . 
msal83564 . 
msal83564.2{l47 

msal83564.2{ 
msal83564 

rnsal83564 .2{ 
msal83564 . 
msal83564 . 

msal83564.2{ 



2{l47_C0HlJ 
2{147_M732} 
2{147_M781} 
2{147_2603} 
_JM9130013' 
147_18RS21 
.2{147_090; 
147_CJB110} 
2f 147_A909> 
2{147_H36B) 
147_1169NT} 
Consensus 



2401 

CTATAAACCA 
CTATAAACCA 
CTATAAACCA 
CTATAAACCA 
CTATAAACCA 
CTATAAACCA 
CTATAAACCA 
CTATAAACCA 
CTATAAACCA 
CTATAAACCA 
CTATAAACCA 



AATGATACAA 
AATGATACAA 
AATGATACAA 
AATGATACAA 
AATGATACAA 
AATGATACAA 
AATGATACAA 
AATGATACAA 
AATGATACAA 
AATGATACAA 
AATGATACAA 



CTCATAAAGA 
CTCATAAAGA 
CTCATAAAGA 
CTCATAAAGA 
CTCATAAAGA 
CTCATAAAGA 
CTCATAAAGA 
CTCATAAAGA 
CTCATAAAGA 
CTCATAAAGA 
CTCATAAAGA 



CCAATTGGAG 
CCAATTGGAG 
CCAATTGGAG 
CCAATTGGAG 
CCAATTGGAG 
CCAATTGGAG 
CCAATTGGAG 
CCAATTGGAG 
CCAATTGGAG 
CCAATTGGAG 
CCAATTGGAG 



********** ********** ********** ********** 



2450 

TAcAATGAAT 

TAcAATGAAT 

TAcAATGAAT 

TAcAATGAAT 

TAcAATGAAT 

TACAATGAAT 

TAcAATGAAT 

TAcAATGAAT 

TACAATGAAT 

TAcAATGAAT 

TAtAATGAAT 
**_******* 



2451 



2500 



780 



WO 2004/018646 



PCT/US2003/026827 



Table 44: Comparative Sequences relating to SAG0416 (strain info highlighted in BOLD) 



msal83564. 

msal83564 . 

tnsal83564 . 

msal83564 . 
msal83564.2{l47 
msal83564.2{ 
msal83564 
msal83564 .2{ 

msal83564 . 

msal83564 . 
msa!83564 .2 { 



2(l47_C0Hl 
2{147_M732 
2(l47_M781 
2{l47_2603 
_JM9130013; 
147_18RS2l' 
.2{147_090; 
147_CJB110 
2{l47_A909 
2{l47_H36B 
147_11S9NT] 
Consensus 



CAGCTCCTTT 
CAGCTCCTTT 
CAGCTCCTTT 
CAGCTCCTTT 
CAGCTCCTTT 
CAGCTCCTTT 
CAGCTCCTTT 
CAGCTCCTTT 
CAGCTCCTTT 
CAGCTCCTTT 
CAGCTCCTTT 



TGAAAGCAAC 
TGAAAGCAAC 
TGAAAGCAAC 
TGAAAGCAAC 
TGAAAGCAAC 
TGAAAGCAAC 
TGAAAGCAAC 
TGAAAGCAAC 
TGAAAGCAAC 
TGAAAGCAAC 
TGAAAGCAAC 



AACTATACTG 
AACTATACTG 
AACTATACTG 
AACTATACTG 
AACTATACTG 
AACTATACTG 
AACTATACTG 
AACTATACTG 
AACTATACTG 
AACTATACTG 
AACTATACTG 



********** ********** ********** 



CCTTGTTAAC 

CCTTGTTAAC 

CCTTGTTAAC 

CCTTGTTAAC 

CCTTGTTAAC 

CCTTGTTAAC 

CCTTGTTAAC 

CCTTGTTAAC 

CCTTGTTAAC 

CCTTGTTAAC 

CCTTGTTAAC 
********** 



ACAATCAGCG 

ACAATCAGCG 

ACAATCAGCG 

ACAATCAGCG 

ACAATCAGCG 

ACAATCAGCG 

ACAATCAGCG 

ACAATCAGCG 

ACAATCAGCG 

ACAATCAGCG 

ACAATCAGCG 
********** 



msal83564 . 

msa!83564 . 

msal83564 . 

msal83564 - 
msal83564.2{l47 
msal83564.2{ 
msal83564 
msal83564 .2{ 

msal83564 . 

msal83564 - 
msal83564 .2 { 



2{l47_COHl} 
2{147_M732} 
2{147_M781} 
2{147_2603} 
_JM9130013} 
147_18RS21> 
.2{147_090} 
147_CJB110} 
2{147_A909} 
2{147_H36B) 
147_1169NT} 
Consensus 



2501 

TCTTGGGGCT 
TCTTGGGGCT 
TCTTGGGGCT 
TCTTGGGGCT 
TCTTGGGGCT 
TCTTGGGGCT 
TCTTGGGGCT 
TCTTGGGGCT 
TCTTGGGGCT 
TCTTGGGGCT 
TCTTGGGGCT 
********** 



ATGTTGATTA 

ATGTTGATTA 

ATGTTGATTA 

ATGTTGATTA 

ATGTTGATTA 

ATGTTGATTA 

ATGTTGATTA 

ATGTTGATTA 

ATGTTGATTA 

ATGTTGATTA 

ATGTTGATTA 
********** 



TGTCAAAAAT 

TGTCAAAAAT 

TGTCAAAAAT 

TGTCAAAAAT 

TGTCAAAAAT 

TGTCAAAAAT 

TGTCAAAAAT 

TGTCAAAAAT 

TGTCAAAAAT 

TGTCAAAAAT 

TGTCAAAAAT 
********** 



GGTGGGGAGT 

GGTGGGGAGT 

GGTGGGGAGT 

GGTGGGGAGT 

GGTGGGGAGT 

GGTGGGGAGT 

GGTGGGGAGT 

GGTGGGGAGT 

GGTGGGGAGT 

GGTGGGGAGT 

GGTGGGGAGT 
********** 



2550 

TAGAATTAGC 

TAGAATTAGC 

TAGAATTAGC 

TAGAATTAGC 

TAGAATTAGC 

TAGAATTAGC 

TAGAATTAGC 

TAGAATTAGC 

TAGAATTAGC 

TAGAATTAGC 

TAGAATTAGC 
********** 



msa!83564 . 

msal83564 . 

msal83564 . 

msal83564 . 
msal83564 .2(147 
msal83564 . 2{ 
msal83564 
rasal83564 .2 { 

msal83564 . 

msal83564. 
msa!83564 .2{ 



2{l47_COHl} 
2{147_M732} 
2{l47_M78l} 
2{l47_2 603> 
_JM9130 013} 
147_18RS2lj 
.2{147_090} 
147_CJB110} 
2{147__A909} 
2{l47_H36B} 
147_1169NT} 
Consensus 



2551 

ACCGGAGAGT 

ACCGGAGAGT 

ACCGGAGAGT 

ACCGGAGAGT 

ACCGGAGAGT 

ACCGGAGAGT 

ACCGGAGAGT 

ACCGGAGAGT 

ACCGGAGAGT 

ACCGGAGAGT 

ACCGGAGAGT 
********** 



CCAAAAAGAA 

CCAAAAAGAA 

CCAAAAAGAA 

CCAAAAAGAA 

CCAAAAAGAA 

CCAAAAAGAA 

CCAAAAAGAA 

CCAAAAAGAA 

CCAAAAAGAA 

CCAAAAAGAA 

CCAAAAAGAA 
********** 



TTATTTTAGG 

TTATTTTAGG 

TTATTTTAGG 

TTATTTTAGG 

TTATTTTAGG 

TTATTTTAGG 

TTATTTTAGG 

TTATTTTAGG 

TTATTTTAGG 

TTATTTTAGG 

TTATTTTAGG 
********** 



AACTTTTGAG 

AACTTTTGAG 

AACTTTTGAG 

AACTTTTGAG 

AACTTTTGAG 

AACTTTTGAG 

AACTTTTGAG 

AACTTTTGAG 

AACTTTTGAG 

AACTTTTGAG 

AACTTTTGAG 
********** 



2600 

AATAAGGTTG 

AATAAGGTTG 

AATAAGGTTG 

AATAAGGTTG 

AATAAGGTTG 

AATAAGGTTG 

AATAAGGTTG 

AATAAGGTTG 

AATAAGGTTG 

AATAAGGTTG 

AATAAGGTTG 
********** 



msal83564 . 

msal83564 . 

msal83564 . 

msal83564 . 
msal83564.2{l47 
msal83564 .2{ 
msal83564 
msal83564.2{ 

msal83564 . 

rasal83564 . 
msal83564 -2{ 



2{l47_COHl) 
2{147_M732) 
2{147_M781} 
2{147_2603} 
_JM9130013} 
147_18RS2lj 
.2{147_090} 
147_CJB110} 
2{147_A909} 
2{147_H36B} 
147_1169NT} 
Consensus 



2601 

AGGATAAAAC 

AGGATAAAAC 

AGGATAAAAC 

AGGATAAAAC 

AGGATAAAAC 

AGGATAAAAC 

AGGATAAAAC 

AGGATAAAAC 

AGGATAAAAC 

AGGATAAAAC 

AGGATAAAAC 
********** 



AATTCATCTT 
AATTCATCTT 
AATTCATCTT 
AATTCATCTT 
AATTCATCTT 
AATTCATCTT 
AATTCATCTT 
AATTCATCTT 
AATTCATCTT 
AATTCATCTT 
AATTCATCTT 
********** 



TTGGAAAGAG 

TTGGAAAGAG 

TTGGAAAGAG 

TTGGAAAGAG 

TTGGAAAGAG 

TTGGAAAGAG 

TTGGAAAGAG 

TTGGAAAGAG 

TTGGAAAGAG 

TTGGAAAGAG 

TTGGAAAGAG 
********** 



ATGCAGCGAA 

ATGCAGCGAA 

ATGCAGCGAA 

ATGCAGCGAA 

ATGCAGCGAA 

ATGCAGCGAA 

ATGCAGCGAA 

ATGCAGCGAA 

ATGCAGCGAA 

ATGCAGCGAA 

ATGCAGCGAA 
********** 



2650 

TAATCCATAT 

TAATCCATAT 

TAATCCATAT 

TAATCCATAT 

TAATCCATAT 

TAATCCATAT 

TAATCCATAT 

TAATCCATAT 

TAATCCATAT 

TAATCCATAT 

TAATCCATAT 
********** 



msal83564 . 

msal83564 . 

msal83564 . 

msal83564 . 
msal83564 .2(147 
msal83564 .2{ 
msal83564 
msal83564 . 2{ 

msal83564 . 

msal83564 . 
msal83564 . 2 { 



2{147_C0H1} 
2{147_M732} 
2f 147_M78l} 
2 { 147^2603} 
_JM9130013} 
147_18RS2lj 
.2{147__090) 
147_CJB110} 
2{147_A909} 
2{147_H36B} 
147_1169NT) 
Consensus 



2651 

TTTGCCATTT 

TTTGCCATTT 

TTTGCCATTT 

TTTGCCATTT 

TTTGCCATTT 

TTTGCCATTT 

TTTGCCATTT 

TTTGCCATTT 

TTTGCCATTT 

TTTGCCATTT 

TTTGCCATTT 
********** 



CTCCAAATAA 
CTCCAAATAA 
CTCCAAATAA 
CTCCAAATAA 
CTCCAAATAA 
CTCCAAATAA 
CTCCAAATAA 
CTCCAAATAA 
CTCCAAATAA 
CTCCAAATAA 
CTCCAAATAA 



AGATGGAAAT 
AGATGGAAAT 
AGATGGAAAT 
AGATGGAAAT 
AGATGGAAAT 
AGATGGAAAT 
AGATGGAAAT 
AGATGGAAAT 
AGATGGAAAT 
AGATGGAAAT 
AGATGGAAAT 



AGGGAcGAAA 
AGGGAcGAAA 
AGGGAcGAAA 
AGGGACGAAA 
AGGGAcGAAA 
AGGGAcGAAA 
AGGGAtGAAA 
AGGGAtGAAA 
AGGGAtGAAA 
AGGGAtGAAA 
AGGGAtGAAA 



2700 
TCACTCCCCA 
TCACTCCCCA 
TCACTCCCCA 
TCACTCCCCA 
TCACTCCCCA 
TCACTCCCCA 
TCACTCCCCA 
TCACTCCCCA 
TCACTCCCCA 
TCACTCCCCA 
TCACTCCCCA 



********** ********** *****„**** ********** 



tnsal83564 . 

msal83564 . 

msal83564 . 

msal83564 . 
msal83564 .2(147 
msal83564 .2{ 
msal83564 
msal83564.2{ 

msal83564 . 

msal83564 . 
msal83564 ,2{ 



2{147_C0H1 
2{147_M732 
2{147_M781 
2{147_2603' 
_JM9130013 
147_18RS21 
.2{l47_090 
147_CJB110 
2f 147_A909 
2{147_H36B 
147_1169NT) 
Consensus 



2701 

GGCAACTTTC 

GGCAACTTTC 

GGCAACTTTC 

GGCAACTTTC 

GGCAACTTTC 

GGCAACTTTC 

GGCAACTTTC 

GGCAACTTTC 

GGCAACTTTC 

GGCAACTTTC 

GGCAACTTTC 
********** 



TTAAGAAATG 
TTAAGAAATG 
TTAAGAAATG 
TTAAGAAATG 
TTAAGAAATG 
TTAAGAAATG 
TTAAGAAATG 
TTAAGAAATG 
TTAAGAAATG 
TTAAGAAATG 
TTAAGAAATG 
********** 



TTAAGGATAT 

TTAAGGATAT 

TTAAGGATAT 

TTAAGGATAT 

TTAAGGATAT 

TTAAGGATAT 

TTAAGGATAT 

TTAAGGATAT 

TTAAGGATAT 

TTAAGGATAT 

TTAAGGATAT 
********** 



TTCTGCTCAA 

TTCTGCTCAA 

TTCTGCTCAA 

TTCTGCTCAA 

TTCTGCTCAA 

TTCTGCTCAA 

TTCTGCTCAA 

TTCTGCTCAA 

TTCTGCTCAA 

TTCTGCTCAA 

TTCTGCTCAA 
********** 



2750 

GTTCTAGATC 

GTTCTAGATC 

GTTCTAGATC 

GTTCTAGATC 

GTTCTAGATC 

GTTCTAGATC 

GTTCTAGATC 

GTTCTAGATC 

GTTCTAGATC 

GTTCTAGATC 

GTTCTAGATC 
********** 



781 



WO 2004/018646 



PCT/US2003/026827 



Table 44: Comparative Sequences relating to SAG0416 (strain info highlighted in BOLD) 



msal83564 . 

msa!83564 . 

msa!83564 . 

msal83564 . 
msal83564 .2(147 
msal83564.2{ 
msal83564 
msal83564 .2{ 

msal83564 . 

msal83564 . 
msal83564 ,2 { 



2{l47_COHl} 
2{147_M732} 
2{147_M781 
2{l47_2603 
_JM9130013 
147_18RS21 
.2{l47_090; 
147_CJB110; 
2{147_A909} 
2{147_H36B} 
147_1169NT} 
Consensus 



2751 

AAAATGGAAA 
AAAATGGAAA 
AAAATGGAAA 
AAAATGGAAA 
AAAATGGAAA 
AAAATGGAAA 
AAAATGGAAA 
AAAATGGAAA 
AAAATGGAAA 
AAAATGGAAA 
AAAATGGAAA 



TGTTATTTGG 
TGTTATTTGG 
TGTTATTTGG 
TGTTATTTGG 
TGTTATTTGG 
TGTTATTTGG 
TGTTATTTGG 
TGTTATTTGG 
TGTTATTTGG 
TGTTATTTGG 
TGTTATTTGG 



CAAAGTAAGG 
CAAAGTAAGG 
CAAAGTAAGG 
CAAAGTAAGG 
CAAAGTAAGG 
CAAAGTAAGG 
CAAAGTAAGG 
CAAAGTAAGG 
CAAAGTAAGG 
CAAAGTAAGG 
CAAAGTAAGG 



********** ********** ********** 



TTTTACCATC 

TTTTACCATC 

TTTTACCATC 

TTTTACCATC 

TTTTACCATC 

TTTTACCATC 

TTTTACCATC 

TTTTACCATC 

TTTTACCATC 

TTTTACCATC 

TTTTACCATC 
********** 



2800 

TTATCGTAAA 

TTATCGTAAA 

TTATCGTAAA 

TTATCGTAAA 

TTATCGTAAA 

TTATCGTAAA 

TTATCGTAAA 

TTATCGTAAA 

TTATCGTAAA 

TTATCGTAAA 

TTATCGTAAA 
********** 



msal83564 . 

msal83564 . 

msal83564 . 

msal83564 . 
msal83564 .2{l47 
msal83564.2{ 
msal83564 
msal83564 .2 { 

msal83564 . 

msal83564 . 
msal83564 .2{ 



2{l47_COHl} 
2{147_M732} 
2{l47_M78l} 
2{147_2603} 
_JM9130013} 
147_18RS2l} 
.2{14 7_090} 
147_CJB110} 
2{l47_A909) 
2{l47_H36B} 
147_1169NT} 
Consensus 



2801 

AATTTCCATA 

AATTTCCATA 

AATTTCCATA 

AATTTCCATA 

AATTTCCATA 

AATTTCCATA 

AATTTCCATA 

AATTTCCATA 

AATTTCCATA 

AATTTCCATA 

AATTTCCATA 
********** 



ATAATCCAAA 
ATAATCCAAA 
ATAATCCAAA 
ATAATCCAAA 
ATAATCCAAA 
ATAATCCAAA 
ATAATCCAAA 
ATAATCCAAA 
ATAATCCAAA 
ATAATCCAAA 
ATAATCCAAA 



GCAaAGTGAT 
GCAaAGTGAT 
GCAaAGTGAT 
GCAaAGTGAT 
GCAaAGTGAT 
GCAaAGTGAT 
GCAaAGTGAT 
GCAaAGTGAT 
GCAaAGTGAT 
GCAaAGTGAT 
GCAgAGTGAT 



GGTCATTATC 
GGTCATTATC 
GGTCATTATC 
GGTCATTATC 
GGTCATTATC 
GGTCATTATC 
GGTCATTATC 
GGTCATTATC 
GGTCATTATC 
GGTCATTATC 
GGTCATTATC 



********** ***_****** ********** 



2850 

GTATGGATGC 

GTATGGATGC 

GTATGGATGC 

GTATGGATGC 

GTATGGATGC 

GTATGGATGC 

GTATGGATGC 

GTATGGATGC 

GTATGGATGC 

GTATGGATGC 

GTATGGATGC 
********** 



msa!83564 . 

msa!83564 . 

rasal83564 . 

msal83564 . 
msal83564 .2(147 
msa!83564.2{ 
msal83564 
msal83564 .2{ 

msa!83564 . 

msal83564 . 
msal83564 .2 { 



2{l47_COHl} 
2{l47_M732} 
2{147_M781} 
2{l47_2603} 
_JM9130013} 
147_18RS2l} 
,2{l47 090} 
147_CJB110} 
2{147_A909} 
2{147_H36B} 
147_1169NT} 
Consensus 



2851 

tcTTCAGTGG 

tcTTCAGTGG 

tcTTCAGTGG 

tcTTCAGTGG 

tcTTCAGTGG 

tcTTCAGTGG 

ctTTCAGTGG 

CtTTCAGTGG 

ccTTCAGTGG 

ccTTCAGTGG 

ccTTCAGTGG 
_ _******** 



AGTGGTTTAG 

AGTGGTTTAG 

AGTGGTTTAG 

AGTGGTTTAG 

AGTGGTTTAG 

AGTGGTTTAG 

AGTGGTTTAG 

AGTGGTTTAG 

AGTGGTTTAG 

AGTGGTTTAG 

AGTGGTTTAG 
********** 



ATAAGGATGG 

ATAAGGATGG 

ATAAGGATGG 

ATAAGGATGG 

ATAAGGATGG 

ATAAGGATGG 

ATAAGGATGG 

ATAAGGATGG 

ATAAGGATGG 

ATAAGGATGG 

ATAAGGATGG 
********** 



CAAAGTTGTA 

CAAAGTTGTA 

CAAAGTTGTA 

CAAAGTTGTA 

CAAAGTTGTA 

CAAAGTTGTA 

CAAAGTTGTA 

CAAAGTTGTA 

CAAAGTTGTA 

CAAAGTTGTA 

CAAAGTTGTA 
********** 



2900 

GCAGATGGTT 

GCAGATGGTT 

GCAGATGGTT 

GCAGATGGTT 

GCAGATGGTT 

GCAGATGGTT 

GCAGATGGTT 

GCAGATGGTT 

GCAGATGGTT 

GCAGATGGTT 

GCAGATGGTT 
********** 



msa!83564 - 
msal83564 - 
msal83564 . 
msal83564 . 
msal83564.2{!47 

rnsal83564 . 2{ 
msal83564 

msal83564 .2{ 
msal83564 . 
msal83564 . 

msal83564 .2{ 



2{14 7_C0H1} 
2{147_M732} 
2(l47_M78l} 
2{147_2603~ 
_JM9130013 
147_18RS21 
.2{l47_090~ 
147_CJB110 
2{l47_A909 
2{147_H36B 
147_1169NT; 
Consensus 



2901 

TTTATACTTA 
TTTATACTTA 
TTTATACTTA 
TTTATACTTA 
TTTATACTTA 
TTTATACTTA 
TTTATACTTA 
TTTATACTTA 
TTTATACTTA 
TTTATACTTA 
TTTATACTTA 



TCGctTACGT 
TCGctTACGT 
TCGctTACGT 
TCGctTACGT 
TCGctTACGT 
TCGctTACGT 
TCG CCTACGT 
TCGccTACGT 
TCGttTACGT 
TCGttTACGT 
TCGctTACGT 



TACACACCAG 
TACACACCAG 
TACACACCAG 
TACACACCAG 
TACACACCAG 
TACACACCAG 
TACACACCAG 
TACACACCAG 
TACACACCAG 
TACACACCAG 
TACACACCAG 



TAGCAGAAGG 
TAGCAGAAGG 
TAGCAGAAGG 
TAGCAGAAGG 
TAGCAGAAGG 
TAGCAGAAGG 
TAGCAGAAGG 
TAGCAGAAGG 
TAGCAGAAGG 
TAGCAGAAGG 
TAGCAGAAGG 



2950 
AGCAAATAGT 
AGCAAATAGT 
AGCAAATAGT 
AGCAAATAGT 
AGCAAATAGT 
AGCAAATAGT 
AGCAAATAGT 
AGCAAATAGT 
AGCAAATAGT 
AGCAAATAGT 
AGCAAATAGT 



********** ***„_***** ********** ********** ********** 



msal83564 . 

msal83564 . 

msal83564 . 

msal83564 . 
msal83564-2{l47 
msal83564 .2{ 
msal83564 
msal83564 .2{ 

msal83564 . 

msa!83564 . 
msal83564 ,2{ 



2{147_C0H1} 
2(147_M732} 
2{147_M78l} 
2{147_2603} 
_JM9130013} 
147 18RS21} 
.2{147_090} 
147_CJB110} 
2(147_A909} 
2{l47JH36B} 
147 1169NT} 
Consensus 



2951 

CAGGAGTCAG 
CAGGAGTCAG 
CAGGAGTCAG 
CAGGAGTCAG 
CAGGAGTCAG 
CAGGAGTCAG 
CAGGAGTCAG 
CAGGAGTCAG 
CAGGAGTCAG 
CAGGAGTCAG 
CAGGAGTCAG 



ACTTTAAAGT 
ACTTTAAAGT 
ACTTTAAAGT 
ACTTTAAAGT 
ACTTTAAAGT 
ACTTTAAAGT 
ACTTTAAAGT 
ACTTTAAAGT 
ACTTTAAAGT 
ACTTTAAAGT 
ACTTTAAAGT 



tCAAGTAAGT 
tCAAGTAAGT 
tCAAGTAAGT 
aCAAGTAAGT 
aCAAGTAAGT 
aCAAGTAAGT 
tCAAGTAAGT 
tCAAGTAAGT 
tCAAGTAAGT 
tCAAGTAAGT 
tCAAGTAAGT 



ACTAAGTCAC 
ACTAAGTCAC 
ACTAAGTCAC 
ACTAAGTCAC 
ACTAAGTCAC 
ACTAAGTCAC 
ACTAAGTCAC 
ACTAAGTCAC 
ACTAAGTCAC 
ACTAAGTCAC 
ACTAAGTCAC 



3000 
CAAATCTTCC 
CAAATCTTCC 
CAAATCTTCC 
CAAATCTTCC 
CAAATCTTCC 
CAAATCTTCC 
CAAATCTTCC 
CAAATCTTCC 
CAAATCTTCC 
CAAATCTTCC 
CAAATCTTCC 



********** ********** _********* ********** ********** 



msal83564 . 
msa!83564 - 
msal83564 . 
msal83564 - 
msal83564 .2(147 

msal83564 .2 { 
msal83564 

msal83564 .2{ 
msal83564 - 
msal83564 . 

msal83564 .2 { 



2{l47_COHl; 
2{147_M732 
2{147_M781 
2{147_2603' 
_JM9130013 
147 18RS21; 
.2{T47_090 
147__CJB110' 
2{147_A909 
2{147_H36B 
147_1169NT} 
Consensus 



3001 

TTcACgAGCT 
TTcACgAGCT 
TTcACgAGCT 
TTcACgAGCT 
TTcACgAGCT 
TTcACgAGCT 
TTtACtAGCT 
TTtACtAGCT 
TTcACgAGCT 
TTcACgAGCT 
TTcACgAGCT 



CAGTTTGATG 
CAGTTTGATG 
CAGTTTGATG 
CAGTTTGATG 
CAGTTTGATG 
CAGTTTGATG 
CAGTTTGATG 
CAGTTTGATG 
CAGTTTGATG 
CAGTTTGATG 
CAGTTTGATG 



AAACTAATCG 
AAACTAATCG 
AAACTAATCG 
AAACTAATCG 
AAACTAATCG 
AAACTAATCG 
AAACTAATCG 
AAACTAATCG 
AAACTAATCG 
AAACTAATCG 
AAACTAATCG 



AACATTAAGC 
AACATTAAGC 
AACATTAAGC 
AACATTAAGC 
AACATTAAGC 
AACATTAAGC 
AACATTAAGC 
AACATTAAGC 
AACATTAAGC 
AACATTAAGC 
AACATTAAGC 



3050 
TTAGCCATGC 
TTAGCCATGC 
TTAGCCATGC 
TTAGCCATGC 
TTAGCCATGC 
TTAGCCATGC 
TTAGCCATGC 
TTAGCCATGC 
TTAGCCATGC 
TTAGCCATGC 
TTAGCCATGC 



**-**_**** ********** ********** ********** ********** 
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Table 44: Comparative Sequences relating to SAG0416 (strain info highlighted in BOLD) 



msa!83564 . 

msal83564 . 

msal83564 . 

msal83564 . 
msal83564.2{l47 
msa!83564 ,2{ 
msal83564 
msal83564 .2{ 

msa!83564 . 

msa!83564 . 
msal83564 .2{ 



2{l47__COHl} 
2{147_M732} 
2f 147_M78l) 
2{147_2603} 
_JM9130013" 
147_18RS21 < 
•2{147_090 
147_CJB110} 
2{l47_A909} 
2{147_H36B} 
147_1169NT} 
Consensus 



3051 

CTAAGGaAAG 

CTAAGGaAAG 

CTAAGGaAAG 

CTAAGGaAAG 

CTAAGGaAAG 

CTAAGGaAAG 

CTAAGGaAAG 

CTAAGGaAAG 

CTAAGGaAAG 

CTAAGGaAAG 

CTAAGGgAAG 
* *****„*** 



TAGTTATGTT 

TAGTTATGTT 

TAGTTATGTT 

TAGTTATGTT 

TAGTTATGTT 

TAGTTATGTT 

TAGTTATGTT 

TAGTTATGTT 

TAGTTATGTT 

TAGTTATGTT 

TAGTTATGTT 
********** 



CCTAcATATC 

CCTAcATATC 

CCTACATATC 

CCTAcATATC 

CCTAcATATC 

CCTAcATATC 

CCTACATATC 

CCTACATATC 

CCTAcATATC 

CCTAcATATC 

CCTAtATATC 
****„***** 



GTtTACAATT 
GTtTACAATT 
GTtTACAATT 
GTtTACAATT 
GTtTACAATT 
GTtTACAATT 
GTtTACAATT 
GTtTACAATT 
GTcTACAATT 
GTcTACAATT 
GTcTACAATT 



3100 
AGTTTTATCT 
AGTTTTATCT 
AGTTTTATCT 
AGTTTTATCT 
AGTTTTATCT 
AGTTTTATCT 
AGTTTTATCT 
AGTTTTATCT 
AGTTTTATCT 
AGTTTTATCT 
AGTTTTATCT 



**_******* ********** 



msal83564 . 

msal83564 . 

msal83564 . 

msal83564 , 
msal83564 .2(147 
msal83564 .2{ 
msal83564 
msal83564.2{ 

msal83564 . 

msal835€4 . 
msa!83564 .2{ 



2{l47_COHi; 
2{147_M732^ 
2{l47_M78l' 
2{147_2603 
_JM9130013 
147_18RS2i; 
.2{l47_090.} 
147_CJB110) 
2{147_A909} 
2{147_H36B} 
147_1169NT} 
Consensus 



3101 

CATGTTGTAA 

CATGTTGTAA 

CATGTTGTAA 

CATGTTGTAA 

CATGTTGTAA 

CATGTTGTAA 

CATGTTGTAA 

CATGTTGTAA 

CATGTTGTAA 

CATGTTGTAA 

CATGTTGTAA 
********** 



AAGATGAAGA 
AAGATGAAGA 
AAGATGAAGA 
AAGATGAAGA 
AAGATGAAGA 
AAGATGAAGA 
AAGATGAAGA 
AAGATGAAGA 
AAGATGAAGA 
AAGATGAAGA 

AAGATGAAGA 
********** 



ATATGGgGAT 

ATATGGgGAT 

ATATGGgGAT 

ATATGGgGAT 

ATATGGgGAT 

ATATGGgGAT 

ATATGGgGAT 

ATATGGgGAT 

ATATGGaGAT 

ATATGGaGAT 

ATATGGaGAT 
******_*** 



GAGACTTCTT 

GAGACTTCTT 

GAGACTTCTT 

GAGACTTCTT 

GAGACTTCTT 

GAGACTTCTT 

GAGACTTCTT 

GAGACTTCTT 

GAGACTTCTT 

GAGACTTCTT 

GAGACTTCTT 
********** 



3150 
ACcATTATTT 
ACcATTATTT 
ACcATTATTT 
ACcATTATTT 
ACcATTATTT 
ACcATTATTT 
ACcATTATTT 
ACcATTATTT 
ACcATTATTT 
ACcATTATTT 

AC t ATTATTT 
**_******* 



msal83564 . 

msal83564 . 

msal83564 . 

msal83564 . 
msal83564 .2(147 
msal83564 .2{ 
msal83564 
msal83564 .2{ 

msal83564 . 

msal83564 . 
msal83564 .2 { 



2{l47_COHl} 
2{147__M732) 
2{l47_M78l} 
2{147_2603} 
_JM9130013} 
147_18RS21~ 
.2{147_090 
147_CJB110 
2{147_A909 
2{147_H36B 
147_1169NT 
Consensus 



3151 

CCATATAGAT 

CCATATAGAT 

CCATATAGAT 

CCATATAGAT 

CCATATAGAT 

CCATATAGAT 

CCATATAGAT 

CCATATAGAT 

CCATATAGAT 

CCATATAGAT 

CCATATAGAT 
********** 



CaAGAAGGTA 

CaAGAAGGTA 

CaAGAAGGTA 

CaAGAAGGTA 

CaAGAAGGTA 

CaAGAAGGTA 

CaAGAAGGTA 

CaAGAAGGTA 

CgAGAAGGTA 

CaAGAAGGTA 

CaAGAAGGTA 
*_******** 



AAGtGACACT 

AAGtGACACT 

AAGtGACACT 

AAGtGACACT 

AAGtGACACT 

AAGtGACACT 

AAGtGACACT 

AAGtGACACT 

AAGtGACACT 

AAGtGACACT 

AAGcGACACT 
***^****** 



TCCTAAAACg 

TCCTAAAACg 

TCCTAAAACg 

TCCTAAAACg 

TCCTAAAACg 

TCCTAAAACg 

TCCTAAAACg 

TCCTAAAACg 

TCCTAAAACa 

TCCTAAAACa 

TCCTAAAACg 
*********_ 



3200 

GTTAAGATAG 

GTTAAGATAG 

GTTAAGATAG 

GTTAAGATAG 

GTTAAGATAG 

GTTAAGATAG 

GTTAAGATAG 

GTTAAGATAG 

GTTAAGATAG 

GTTAAGATAG 

GTTAAGATAG 
********** 



msal83564 . 
msal83564 - 
msal83564 . 
msal83564 . 
msa!83564.2{l47 
, msal83564 .2{ 
msal83564 
msal83564 .2{ 
msal83564 . 
msal83564 . 
msal83564 .2{ 



2{l47_COHl) 
2{147_M732} 
2{147_M781} 
2{147__2603} 
_JM9130013} 
147_18RS21} 
.2 {147^090} 
147_CJB110} 
2{147_A909} 
2{147_H36B} 
147_1169NT} 
Consensus 



3201 

GAGAGAGTGA 

GAGAGAGTGA 

GAGAGAGTGA 

GAGAGAGTGA 

GAGAGAGTGA 

GAGAGAGTGA 

GAGAGAGTGA 

GAGAGAGTGA 

GAGAGAGTGA 

GAGAGAGTGA 

GAGAGAGTGA 
********** 



GGTTGCgGTA 
GGTTGCgGTA 
GGTTGCgGTA 
GGTTGCgGTA 
GGTTGCgGTA 
GGTTGCgGTA 
GGTTGCaGTA 
GGTTGCaGTA 
GGTTGCaGTA 
GGTTGCaGTA 
GGTTGCaGTA 

******_*** *********_ ********** ********** 



GACCCTAAGg 
GACCCTAAGg 
GACCCTAAGg 

GACCCTAAGg 
GACCCTAAGg 
GACCCTAAGg 
GACCCTAAGg 
GACCCTAAGg 
GACCCTAAGa 
GACCCTAAGa 
GACCCTAAGg 



CCTTGACACT 
CCTTGACACT 
CCTTGACACT 
CCTTGACACT 
CCTTGACACT 
CCTTGACACT 
CCTTGACACT 
CCTTGACACT 
CCTTGACACT 
CCTTGACACT 
CCTTGACACT 



3250 
TGTTGTGGAA 
TGTTGTGGAA 
TGTTGTGGAA 
TGTTGTGGAA 
TGTTGTGGAA 
TGTTGTGGAA 
TGTTGTGGAA 
TGTTGTGGAA 
TGTTGTGGAA 
TGTTGTGGAA 
TGTTGTGGAA 



msal83564 .2(l47_COHlJ 
msal83564. 2 {147_M732} 
msal83564 .2{l47_M78l) 
msal83564.2{l47 2603* 
msal83564 . 2 { 147_JM9130013 
msal83564 . 2 { 147_18RS2i; 

msal83564.2{l47_090} 
msal83 564 . 2 { 147_CJB110 ' 
msal83564.2fl47_A909l 
msal83564.2{147_H36B; 
msal83564.2{l47_1169NT] 

Consensus 



3251 

GATAAAGCTG 
GATAAAGCTG 
GATAAAGCTG 
GATAAAGCTG 
GATAAAGCTG 
GATAAAGCTG 
GATAAAGCTG 
GATAAAGCTG 
GATAAAGCTG 
GATAAAGCTG 
GATAAAGCTG 
********** 



GTAATTT t GC 
GTAATTTtGC 
GTAATTTtGC 
GTAATTTcGC 
GTAATTTcGC 
GTAATTTcGC 
GTAATTTtGC 
GTAATTTtGC 
GTAATTTcGC 
GTAATTTCGC 

GTAATTTcGC 
***** ** _ ** 



AACGGTAAAA 
AACGGTAAAA 
AACGGTAAAA 
AACGGTAAAA 
AACGGTAAAA 
AACGGTAAAA 
AACGGTAAAA 
AACGGTAAAA 
AACGGTAAAA 
AACGGTAAAA 

AACGGTAAAA 
********** 



TTGTCTGAc C 

TTGTCTGAc C 

TTGTCTGAc C 

TTGTCTGAtC 

TTGTCTGAtC 

TTGTCTGAtC 

TTGTCTGAc C 

TTGTCTGAc C 

TTGTCTGAc C 

TTGTCTGAcC 

TTGTCTGAc C 
********„* 



3300 

TCTTGAATAA 

TCTTGAATAA 

TCTTGAATAA 

TCTTGAATAA 

TCTTGAATAA 

TCTTGAATAA 

TCTTGAATAA 

TCTTGAATAA 

TCTTGAATAA 

TCTTGAATAA 

TCTTGAATAA 
********** 



msal83564 . 

msal83564 . 

msal83564 . 

msal83564 . 
msal83564 .2{l47 
msal83564.2{ 
msa!83564 
msal83564 ,2{ 

msal83564 . 

msal83564 . 
msa!83564.2{ 



2{l47_COHl 
2{147_M732 
2{147_M781 
2{147_2603} 
_JM9130013} 
147JL8RS21* 
.2{147_090 
147_CJB110 
2{147_A909 
2{147_H36B} 
147_1169NT} 



3301 

GGCAGTAGTA 
GGCAGTAGTA 
GGCAGTAGTA 
GGCAGTAGTA 
GGCAGTAGTA 
GGCAGTAGTA 
GGCAGTAGTA 
GGCAGTAGTA 
GGCAGTAGTA 
GGCAGTAGTA 
GGCAGTAGTA 



TCAGAGAAAG 
TCAGAGAAAG 
TCAGAGAAAG 
TCAGAGAAAG 
TCAGAGAAAG 
TCAGAGAAAG 
TCAGAGAAAG 
TCAGAGAAAG 
TCAGAGAAAG 
TCAGAGAAAG 
TCAGAGAAAG 



AAAACGCTAT 
AAAACGCTAT 
AAAACGCTAT 
AAAACGCTAT 
AAAACGCTAT 
AAAACGCTAT 
AAAACGCTAT 
AAAACGCTAT 
AAAACGCTAT 
AAAACGCTAT 
AAAACGCTAT 



AGTAATTTCT 
AGTAATTTCT 
AGTAATTTCT 
AGTAATTTCT 
AGTAATTTCT 
AGTAATTTCT 
AGTAATTTCT 
AGTAATTTCT 
AGTAATTTCT 
AGTAATTTCT 
AGTAATTTCT 



3350 
AACAgTTTCA 
AACAgTTTCA 
AACAgTTTCA 
AACAgTTTCA 
AACAgTTTCA 
AACAgTTTCA 
AACAgTTTCA 
AACAgTTTCA 
AACAaTTTCA 
AACAaTTTCA 
AACAgTTTCA 
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Table 44: Comparative Sequences relating to SAG0416 (strain info highlighted in BOLD) 

Consensus ********** ********** ********** ********** ****_***** 



msal83564 . 

msal83564 . 

msal83564 . 

msal83564 . 
tnsal83564 .2{l47 
msal83S64 .2{ 
rasal83564 
msal83564 .2 { 

msal83564 » 

msal83564 . 
msal83564.2{ 



2{147_C0H1} 
2(147_M732) 
2{147_M781} 
2{147_2603} 
_JM9130013} 
147 18RS21J 
.2{l47__090) 
147__CJB110} 
2{l47_A909} 
2{l47_H36B} 
147_1169NT) 
Consensus 



3351 

AATATTTTGA 
AATATTTTGA 
AATATTTTGA 
AATATTTTGA 
AATATTTTGA 
AATATTTTGA 
AATATTTTGA 
AATATTTTGA 
AATATTTTGA 
AATATTTTGA 
AATATTTTGA 



TAACTTGAAg 
TAACTTGAAg 
TAACTTGAAg 
TAACTTGAAa 
TAACTTGAAa 
TAACTTGAAa 
TAACTTGAAa 
TAACTTGAAa 
TAACTTGAAa 
TAACTTGAAa 
TAACTTGAAa 



AAAGAAcCTA 
AAAGAAcCTA 
AAAGAAcCTA 
AAAGAAcCTA 
AAAGAAcCTA 
AAAGAAcCTA 
AAAGAAtCTA 
AAAGAAcCTA 
AAAGAAcCTA 
AAAGAAcCTA 
AAAGAAcCTA 



TGTTTATTTC 
TGTTTATTTC 
TGTTTATTTC 
TGTTTATTTC 
TGTTTATTTC 
TGTTTATTTC 
TGTTTATTTC 
TGTTTATTTC 
TGTTTATTTC 
TGTTTATTTC 
TGTTTATTTC 



3400 
TAAAgAAGgA 
TAAAgAAGgA 
TAAAgAAGgA 
TAAAaAAGaA 
TAAAaAAGaA 
TAAAaAAGaA 
TAAAgAAGgA 
TAAAgAAGgA 
TAAAgAAGgA 
TAAAgAAGgA 
TAAAaAAGaA 



********** *********_ ******^*** ********** ****_***_* 



msa!83564 . 

msal83564 . 

msal83564 . 

msal83S64 . 
msal83564.2{l47 
msal83564 .2{ 
msal83564 
msal83564 .2{ 

msal83564 . 

msal83564 . 
msa!83564 .2{ 



2{l47_COHl} 
2{147_M732} 
2{l4 7_M78lj 
2{147_2603' 
_JM9130013" 
147_18RS21 
.2{l47 090 
147_CJB110 
2{147_A909} 
2{147_H36B} 
147_JL169NT} 
Consensus 



3401 

AAAGTAGTAA 

AAAGTAGTAA 

AAAGTAGTAA 

AAAGTAGTAA 

AAAGTAGTAA 

AAAGTAGTAA 

AAAGTAGTAA 

AAAGTAGTAA 

AAAGTAGTAA 

AAAGTAGTAA 

AAAGTAGTAA 
********** 



ACAAGAATCT 
ACAAGAATCT 
ACAAGAATCT 
ACAAGAATCT 
ACAAGAATCT 
ACAAGAATCT 
ACAAGAATCT 
ACAAGAATCT 
ACAAGAATCT 
ACAAGAATCT 
ACAAGAATCT 



AGAAGAAATA 
AGAAGAAATA 
AGAAGAAATA 
AGAAGAAATA 
AGAAGAAATA 
AGAAGAAATA 
AGAAGAAATA 
AGAAGAAATA 
AGAAGAAATA 
AGAAGAAATA 
AGAAGAAATA 



acATTAGTTA 
acATTAGTTA 
acATTAGTTA 
atATTAGTTA 
a tATTAGTTA 
atATTAGTTA 
acATTAGTTA 
acATTAGTTA 
gcATTAGTTA 
gcATTAGTTA 
atATTAGTTA 



3450 
AGCCtCAaAC 
AGCCtCAaAC 
AGCCtCAaAC 
AGCCgCAaAC 
AGCCgCAaAC 
AGCCgCAaAC 
AGCCgCAaAC 
AGCCgCAaAC 
AGCCgCAaAC 
AGCCgCAaAC 
AGCCgCAcAC 



********** ********** ******** ****_**_** 



msal83564 . 

msal83564 . 

msal83564 . 

msa!83564 . 
msa!83564.2{l47 
msal83564 .2 { 
msal83564 
msal83564 ,2{ 

msal 83564 . 

msal83564 . 
msal83564 .2 { 



2{l47_COHl 
2{147_M732 
2{147_M781 
2{147_2603 : 
_JM913Q013 
147 18RS21 
.2{147__090' 
147_CJB110 
2{147_A909 
2{147_H36B 1 
147_1169NT/ 
Consensus 



3451 

TACAGTTACT 
TACAGTTACT 
TACAGTTACT 
TACAGTTACT 
TACAGTTACT 
TACAGTTACT 
TACAGTTACT 
TACAGTTACT 
TACAGTTACT 
TACAGTTACT 
TACAGTTACT 



ACTCAATCAT 
ACTCAATCAT 
ACTCAATCAT 
ACTCAATCAT 
ACTCAATCAT 
ACTCAATCAT 
ACTCAATCAT 
ACTCAATCAT 
ACTCAATCAT 
ACTCAATCAT 
ACTCAATCAT 



TGTCTAAAGA 
TGTCTAAAGA 
TGTCTAAAGA 
TGTCTAAAGA 
TGTCTAAAGA 
TGTCTAAAGA 
TGTCTAAAGA 
TGTCTAAAGA 
TGTCTAAAGA 
TGTCTAAAGA 
TGTCTAAAGA 



********** ********** ********** 



AATAACTaAA 

AATAACTaAA 

AATAACTaAA 

AATAACTaAA 

AATAACTaAA 

AATAACTaAA 

AATAACTaAA 

AATAACTaAA 

AATAACTaAA 

AATAACTcAA 

AATAACTaAA 
*******„ * * 



3500 

TCAGGAAATG 

TCAGGAAATG 

TCAGGAAATG 

TCAGGAAATG 

TCAGGAAATG 

TCAGGAAATG 

TCAGGAAATG 

TCAGGAAATG 

TCAGGAAATG 

TCAGGAAATG 

TCAGGAAATG 
********** 



msal83564 . 

msal83564 . 

msal83564 . 

msal83564 . 
msal83564.2{l47 
msal83564 .2 { 
msal83564 
msal83564 .2 { 

msa!83564 . 

msal83564 . 
msal83564 .2{ 



2{l47j20Hl} 
2{147_M732 
2{147_M781 
2{147_2603 
_JM3130013} 
147_18RS2l} 
.2{l47_090} 
147_CJB110} 
2{147_A909} 
2{147_H36B} 
147_1169NT} 
Consensus 



3501 

AGAAAGTCCT 
AGAAAGTCCT 
AGAAAGTCCT 
AGAAAGTCCT 
AGAAAGTCCT 
AGAAAGTCCT 
AGAAAGTCCT 
AGAAAGTCCT 
AGAAAGTCCT 
AGAAAGTCCT 
AGAAAGTCCT 
********** 



CACTTCTACA 

CACTTCTACA 

CACTTCTACA 

CACTTCTACA 

CACTTCTACA 

CACTTCTACA 

CACTTCTACA 

CACTTCTACA 

CACTTCTACA 

CACTTCTACA 

CACTTCTACA 
********** 



AACAATAATA 
AACAATAATA 
AACAATAATA 
AACAATAATA 
AACAATAATA 
AACAATAATA 
AACAATAATA 
AACAATAATA 
AACAATAATA 
AACAATAATA 
AACAATAATA 
********** 



GTAGcAGAGT 
GTAGcAGAGT 
GTAGcAGAGT 
GTAGcAGAGT 
GTAGcAGAGT 
GTAGcAGAGT 
GTAGcAGAGT 
GTAGcAGAGT 
GTAGcAGAGT 
GTAGcAGAGT 
GTAGtAGAGT 



3550 
AGCTAAgATC 
AGCTAAgATC 
AGCTAAgATC 
AGCTAAgATC 
AGCTAAgATC 
AGCTAAgATC 
AGCTAAgATC 
AGCTAAgATC 
AGCTAAgATC 
AGCTAAgATC 
AGCTAAaATC 



****_***** ******_*** 



msa!83564 . 

msal83564 . 

msal83564 . 

msal83564 . 
msal83S64.2{l47, 
msal 8 3 564 .2{ 
msal83564 
msal83564 .2{ 

msal83564 . 

msal83564 . 
msal83564 .2 { 



2{l47__COHl} 
2{147_M732} 
2{147_M781} 
2{147_2603} 
'_JM9130013) 
147_18RS2l} 
.2{147_090} 
147__CJB110} 
2{147_A909} 
2{l47_H36B} 
147_1169NT} 
Consensus 



3551 

ATATCACCTA 

ATATCACCTA 

ATATCACCTA 

ATATCACCTA 

ATATCACCTA 

ATATCACCTA 

ATATCACCTA 

ATATCACCTA 

ATATCACCTA 

ATATCACCTA 

ATATCACCTA 
********** 



3600 



AACATAAcGG 

AACATAAcGG 

AACATAAcGG 

AACATAAcGG 

AACATAAcGG 

AACATAAcGG 

AACATAAcGG 

AACATAAcGG 

AACATAAcGG 

AACATAAcGG 

AACATAAtGG 
*******_** 



GGATTCTGTT 

GGATTCTGTT 

GGATTCTGTT 

GGATTCTGTT 

GGATTCTGTT 

GGATTCTGTT 

GGATTCTGTT 

GGATTCTGTT 

GGATTCTGTT 

GGATTCTGTT 

GGATTCTGTT 
********** 



AACCATACC- 

AACCATACC- 

AACCATACC- 

AACCATACCt 

AACCATACC- 

AACCATACC- 

AACCATACC- 

AACCATACC- 

AACCATACC- 

AACCATACC- 

AACCATACC- 
********** 



tacctagtac 



********** 



msa!83564 . 

msal83564 . 

msal83564 . 

msal83564 . 
msal83564.2{!47 
msal83564 .2{ 
msal83564 
msal83564 .2{ 

msal83564 . 

msa!83564 . 



2{147_C0H1 

2{147_M732 

2{147_M781 

2(147_2603; 

_JM9130013] 

147_18RS2lJ 

.2{147_090} 

147_CJB110} 

2(147_A909} 

2(147_H36B} 



3601 



3650 



atcagataga gcaacgaatg gtctatttgt tggtactttg gcattgttat 
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msal83564 . 2 { 147_1169NT} 

Consensus ********** ********** ********** ********** ********** 

3651 3700 

msal83564 . 2{l47_COHl} 

msal83564 .2{147_M732} 

msal83564 .2{147_M7B1} 

msal83564 . 2 { 147__2603) ctagtttact tctttatttg aaacccaaaa agactaaaaa taatagtaaa 

msal83564 . 2 { 147_JM9130013 } 

msal83564 .2 {147_18RS21} 

msal83564 .2{l47_090} 

msal83564.2{l47_CJB110} 

msal83564 .2fl47_A909} 

msal83564.2{l47_H36B} 

msal83564 .2 { 147_1169NT} 

Consensus ********** ********** ********** ********** ********** 

SEQ ID NO. 4412 
STRAIN 2603 

VDKHHSKKAI IiKLTL ITTS I LLMHSNQVNAEEQELKNQEQSPVI ANVAQQPS PSVTTNTV 
EKTSVTAASASNTAKEMGDTSVKNDKTEDELLEELSKNLDTSNLGADLEEEYPSKPETTN 
NKESNVVTNASTAIAQKVPSAYEEVKPESKSSLAVLDTSKITKLQAITQRGKGNWAI I D 
TGFD I NHD I FRLDS PKDDKHS FKTKTE FEELKAKHN I TYGKWVNDKI VFAHNYANNTETV 
ADIAAAMKDGYGSEAKNISHGTHVAGI FVGNSKRPAI NGLLLEGAAPNAQVLLMRI PDKI 
DS D KFGEAY AKA I TD AVNLGAKT I NM S I G KTADS L IAliNDKVKLALKLASEKGVAVWAA 
GNEGAFGMDYSKPLSTNPDYGTVNS PAI SEDTLS VAS YESLKTI SEWETT I EGKLVKLP 
IVTSKPFDKGKAYDVVYANYGAKKDFEGKDFKGKIALIERGGGLDFMTKITHATNAGVVG 
I VI PNDQEKRGNFIi I P YRELPVG 1 1 SKVDGER I KNTS SQLTFNQS FEWDSQGGNRMLEQ 
SSWGVTAEGAIKPDWASGFEIYSSTYNNQYQTMSGTSMASPHVAGL^^^MLQSHIIAEKYK 
GMNLDSKKLLELSKN I LMS S ATAL YSEEDKAF YS PRQQGAGVVDAEKAI QAQYY I TGNDG 
KAKI NLKRMGDKFD I TVT I HKLVEG VKELY YQANVATEQVNKGKFALKPQALLDTNWQKV 
I LRDKETQVRFT I DASQFSQKLKEQMANGY FLEGFVRFKEAKDSNQELMS I P FVG FNGDF 
ANLQALETPIYKTLSKGSFYYKPNDTTHKDQLEYNESAPFESNNYTALLTQSASWGYVDY 
VKNGGELELAPESPKRI I LGTFENKVEDKT I HLLERDAANNPYFAI SPNKDGNRDE ITPQ 
AT FLRNVKD I SAQVLDQNGNVI WQSKVLPS YRKNFHNNPKQSDGHYRMDALQWSGLDKDG 
KVVADGFYTYRLRYTPVAEGANSQESDFKVQVSTKSPNLPSRAQFDETNRTLSLAMPKES 
S YVPTYRLQLVLSHVVKDEE YGDETS YHYFH I DQEGKVTL PKTVKI GESE VAVDPKALTL 
VVEDKAGNFATVKLSDLLNKAVVS EKENA I VI SNSFKYFDNLKKEPMFI SKKEKWNKNL 
EEI ILVKPQTTVTTQSLSKEITKSGNEKVLTSTNNNSSRVAKI ISPKHNGDSVNHTLPST 
SDRATNGLF VGTLALLS SLLLYLKPKKTKNNS K 



SEQ ID NO. 4413 
STRAIN A909 

EEQELKNQEQSP VI ANVAQQPS PS VTTOTVEKTSVTS AS ASOTAKEMGDTSVKl^KTEDE 
LLEELSKNLDTSNLGADLEEEYPSKPETTNNKESNVVTNASTAIAQKVPSAYEEVBCPESK 
SSLAVLDTSKITKLQAITQRGKGNWAI IDTGFDINHDI FRLDS PKDDKHSFKTKAEFEE 
LKAKHNITYGKWVNDKI VFAHNYANNTETVAD I AAAMKDGYGSEAKNI SHGTHVAGI FVG 
NSKRPA I NGLLLEGAAPNAQVLLMRI PDKI DSDKFGEAYAKA ITDAVNLGAKT I NMSLGK 
TADSLIALNDKVKLALKLASEKGVAVWAAGNEGAFGMDYSKPLSTNPDYGTVNSPAISE 
DTLSVASYESLKTI SEWETT I EGKLVKLP I VTSKPFDKGKAYDWYANYGAKKRL . R . G 
L . R . DCIN . AWWWT . FYD . NHSCYKCRCCWYRYF . RSRKTWKFSNSLP . ITCGGY . . SRW 
RAYKKYFKSVNI . PEF . SS . . PRWQSYAGTI KLGRDS . RSNQA . CNSFWL .NLFFNL . . S 
IPNNVWYKYGFTTCCRINDNASKSFG . EI - RDEFRF . KIARIV. KHPHELSNSI I . . RG . 
GVLFTTSARCRCS . C . KSYPSSILCYWKRWQS .N . SQTSGR. I . YHSYNS .TCRRCQRIV 
LSS . CSNRTSK.R. I CP . TTSLARY . LAESNSS . . RNTSSIYY. F . SI . S E I KRTDGKWL 
FLRRFCTF . RSQG . . SGVNEYSFCRI . W . FCELTST . NTDL . DAF . R . FLL . TK . YNS . R 
PIGVQ . ISSF . KQQL YCLVNT I SVLGLC . LCQKWWGVR I STGES KKNYFRNF . E . G . G . N 
NSSFGKRCSE . SI FCHFSK . RWK . G . NHSPGNFLKKC . GYFCSSSRSKWKCYLAK . GFTI 
LS - KFP . . SKAK . WSLSYGCPSVEWFR . GWQSCSRWFLYLSFTLHTSSRRSK . SGVRL . S 
SSKY . VTKSSFTSSV . . N . SNI KLSHA . GK . LCSYI SSTI SFISCCKR . RI WR . DFLPLF 
PYRSRR . SDTS . NS . DRRE . GCSRP . DLDTCCGR . SW . FRNGKIV. PLE . GSSIRERKRY 
SNF . QFQIF . . LEKRTYVY F . RRKSSKQESRRNSIS . AANYSYYSI IV . RNNSIRK. ESP 
HFYKQ . . . QSS . DHIT . T . RGFC . PY 

SEQ ID NO. 4414 
STRAIN H36B 

EEQELKNQEQSPVIANVAQQPSPSVTTNTVEKTSVTSASASNTAKEMGDTSVKNDKTEDE 
LLEELSKNLDTSNLGADLEEEYPSKPETTNNKESNVVTNASTAIAQKVPSAYEEVKPESK 
SSLAVLDTSKITKLQAITQRGKGNWAI IDTGFDINHDI FRLDS PKDDKHSFKTKAEFEE 
LKAKHNI TYGKWVNDKI VFAHNYANNTETVAD I AAAMKDGYGSEAKNI SHGTHVAGI FVG 
NS KRPAINGLLLEGAAPNAQ VLLMR I PDKI DSDKFGEAYAKAI TDAVNLGAKT I NMS LGK 
TADSLIALNDKVKLALKLASEKGVAVWAAGNEGAFGMDYSKPLSTNPDYGTVNSPAISE 
DTLSVASYESLKTI SEWETTI EGKLVKLP I VTSKPFDKGKAYDWYANYGAKKDFEGKD 
FKGKI AL I ERGGGLDFMTKI THATNAG WG IV I FNDQEKRGNFL I PYRELPVGVI SKVDG 
ERI KNTS SQLTFNQS FEWDSQGGNRMLEQS S WGVTAEGA I KPDVTASGFEI YSSTYNNQ 
YQTMSGTSMASPHVAGLMTMLQSHLAEKYKGMNLDSKKLLELSKNILMSSATALYSEEDK 
AFYS PRQQGAGWDAEKAI QAQYYVTGNDGKAKI NLKRVGDKFD I TVT I HKLVEG VKELY 
YQANVATEQVNKGKFALKPQALLDTNWQKVILRDKETQVRFTIDSSQFSQKLKEQMANGY 
FLEGFVRFKEAKDSNQELMS I PFVG FNGD FANLQALETP I YKTLSKG S FYYKPNDTTHKD 
QLEYNESAPFESNNYTALLTQSASWGYVDYVKNGGELELAPESPKRI I LGTFENKVEDKT 
IHLLERDAANNPYFAI SPNKDGNRDEITPQATFLRNVKDI SAQVLDQNGNVI WQSKVLPS 
YRKNFHNNPKQSDGHYRMDALQWSGLDKDGKWADGFYTYRLRYTPVAEGANSQESDFKV 
QVSTKSPNLPSRAQFDETNRTLSLAMPKESSYVPTYRLQLVLSHWKDEEYGDETSYHYF 
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HIIX^EGKVTLPKTVKIGESEVAVDPKTLTLVVEDKAGNFATVKLSDLLNKAVVSEKENAI 
VISNNFKYFDNLKKEPMFISKEGKVVNKNLEEIALVKPQTTVTTG^^ 
TSTNNNSSRVAKI I SPKHNGDS VNHT 

- i 

SEQ XD NO. 4415 
STRAIN 18RS21 

EEQELKNQEQSPVIANVAQQPSPSVTTNTVEKTSVTAASASOTAKBMGDTSVKNDKTEDE 
LLEELSKNLDTSNLGADLEEEYPSKPETTNNKESNVVTNASTAIAQKVPSAYEEVKPESK 
SSLAVLDTSKITKLQAITQRGKGNWAI IDTGFD INHDI FRLDS PKDDKHSFKTKTEFEE 
LKAKHNITYGKWVNDKIVFAHNYANNTETVADIAAAMK^ 

NSKRPAINGLLLEGAAPNAQVLLMRIPDKIDSDKFGEAYAKAITDAVNLGAKTINMSIGK 
TADSLI AI^KVKLALKLASEKGVAVWAAGNEGAFGMDYSKPLSTNPDYGTVNS PAI SE 
DTLSVASYESLKTI SEWETTI EG KL VKL P I VT S KP F D KG KA YD WYANYGA KKD FEGKD 
FKGKI AL I ERGGGLDFMTKI THATNAG WG I VI FNDQE KRGNFL I PYRELPVG 1 1 SKVDG 
ERI KNTSSQLTFNQSFEWDSQGGNRMLEQSSWGVTAEGAI KPDVTASGFE X YSSTYNNQ 
YG^SGTSMASPHVAGI^MLQSHLAEKYKGMNLDSKKLLELSKNILMSSATALYSEEDK 
AFYS PRQQGAGWDAEKAI QAQYY I TGNDGKAKI NLKRMGDKFD I TVT I HKLVEGVKELY 
YQANVATEQVNKGKFALKPQALLDTNWQKV I LRD KETQVRFT I DASQFSQKLKEQMANGY 
FLEGFVRFKEAKDSNQELMS I PFVGFNGD FANLQALETP I YKTI SKGS FYYKPNDTTHKD 
QLEYNESAPFESNNYTALLTQSASWGYVDYVKNGGELELAPESPKRIILGTFENKVEDKT 
I HLLERDAANNP YFA I S PNKDGNRDE ITPQAT FLfRNVKD I SAQVLDQNGNVI WQSKVLPS 
YRKNFHNNPKQSDGHYRMDALQWSGLDKDGKWADGFYTYRLRYTPVAEGANSQESDFKV 
QVSTKSPNLPSRAQFDETNRTLSIjAMPKESSYVPTYRLQLVLSHVVKDEEYGDETSYHYF 
HIDQEGKVTLPKTVKIGESEVAVDPKALTLVVEDKAGNFAWKliSDLI^KAWSEKENAI 
VI SNSFKYFDNLKKEPMFI SKKEKWNKNLEE I ILVKPQTTVTTQSLSKEITKSGNEKVL 
TSTNNNSSRVAKI I SPKHNGDS VNHT 

SEQ ID NO. 4416 
STRAIN M732 

EEQELKNQEQSPVIAWAQQPSPSVTTNIVEKTSVTAASASNTVKEMGDTSVKNDKTEDE 
I^EELSKNLDTSNLGADLEEEYPSKPETTNNKESNVVTNASTAIAQKVPSAYEEVKSESK 
SSIAVLDTSKITKLQATTQRGKGNVVAIIDTGFDINHDIFRLDSPKDDKHSFKTKAEFEE 
LKAKHNI TYGKWVNDKI VFAHNYANNTET VAD I AAAMKDGYGSEAKN I LHGTHVAGI FVG 
NSKRPAI NSLLLEGAAPNAQ VLLMR I PDKI DS DKFGEAYAKA 1 1 DAVNLGAKT I NMSLGK 
TADSLIALNDKVKLALK1ASEKGVAWVAAGNEGAFGMDYSKPLSTNPDYGTVNSPAISE 
DTLSVASYESLKTI SEWETTI EGKLVKLP I VTSKPFDKGKAYDWYANYGAKKI LKVRT 
LKVRLH . LSVWDLI L . LKSLMLQMQVLLVSLFLTI KKNVEI F . FLTVNYLWGLLVK . MA 
SV . KILQVS . HLTRVLK . LIAKVAI VCWNNQVGA . QLKEQSSLM . QLLALKF I LQP 1 1 IN 
TKQCLVQVWLHHMLQD . . QCFKVIWLRNIKG . I . ILKNC . NCLKTSS . AQQQHYI VKRIR 
RFIHHVSKVQV. LMLKKLSKLNIMLLETMAKLKLISNEREINLISQLQFINL . KVSKNCI 
I KLM . QQNK . IKVNLPLNHKPC . ILIGRK . FFVI KKHKFDLLLMLVNLVRN . KNRWQMVI 
S . KVLYVLKKPRI VI RS . . VFLL . DLMVI LRTYKHLKHRF I RRFLKWSTINQM I QLI KT 
NWSTMNQLLLKATT ILPC . HNQRLGAMLIMSKMVGS . N . HRRVQKELF . ELLRI RLRI KQ 
FIFWKEMQRIIHILPFLQIKMEIGTKSLPRQLS . EMLRIFLLKF . I KMEMLFGKVRFYHL 
I VKI S 1 1 1 QSKVMVI I VWMLFSG W . I RMAKL . QMVF I L I AYVTHQ . QKEQI VRSQTLKF 
K. VLSHQIFLHELSLMKLIEH . A. PCLRKWMFLHIVYN . FYLML . KMKNMGMRLLTI IS 
I . I KKVK . HFLKRLR . ERVRLR - TLRP . HLLWKI KLVILQR . NCLTS . IRQ. YQRKKTL . 
. FLTVSNILIT . RKNLCLFLKKEK . . TRI . KK . H . LSLKLQLLLNHCLKK . LNQEMRKSS 
LLQTIIVAE . LRSYHLNITGILLTI 

SEQ ID NO. 4417 
STRAIN COHl 

EEQELKNQEQSPVIANVAQQPSPSVTTNIVEKTSVTAASASNTVK^GDTSVKNDKTEDE 
LLEELSKNLDTSNLGADLEEEYPSKPETTNNKESNVVTNASTAIAQKVPSAYEEVKSESK 
SSLAVLDTSKITKLQATTQRGKGNWAI IDTGFD INHDI FRLDSPKDDKHSFKTKAEFEE 
LKAKHNI TYGKWVNDKI VFAHNYANNTET VAD I AAAMKDGYGSEAKN I LHGTHVAG I FVG 
NSKRPAINSLLLEGAAPNAQVLIjMRI PDKI DSDKFGEAYAKA 1 1 DAVNLGAKT I NMSLGK 
TADSLIALNDKVKIALKIASEKGVAWVAAGNEGAFGMDYSKPLSTNPDYGTVNSPAISE 
DTLSVAS YESLKTI SEWETTI EGKLVKLP I VTSKPFDKGKAYDWYANYGAKKI LKVRT 
LKVRLH . LSVWDLIL . LKSLMLQMQVLLVSL FLT I KKNVE I F . FLTVNYLWGLLVK . MA 
SV . KILQVS . HLTRVLK. LIAKVAI VCWNNQVGA . QLKEQSSLM . QLLALKF I LQP 1 1 IN 
TKQCLVQVWLHHMLQD. . QCFKVIWLRNIKG, I . ILKNC .NCLKTSS .AQQQHYI VKRIR 
RF IHHVSKVQV . LMLKKLS KLNIMLLETMAKLKL I SNERE INLI SQLQF INL . KVSKNC I 
I KLM . QQNK . IKVNLPLNHKPC . ILIGRK. FFVI KKHKFDLLLMLVNLVRN . KNRWQMVI 
S . KVLYVLKKPRI VIRS . .VFLL . DLMVI LRTYKHLKHRF I RRFLKWSTI NQM I QL I KT 
NWSTMNQLLLKATT ILPC . HNQRLGAMLIMSKMVGS . N . HRRVQKELF . ELLRI RLRI KQ 
FIFWKEMQRIIHILPFLQIKMEIGTKSLPRQLS .EMLRIFLLKF. I KMEMLFGKVRFYHL 
I VKI S 1 1 IQSKVMVI I VWMLFSGW . I RMAKL . QMVF I L I AYVTHQ . QKEQI VRSQTLKF 
K. VLSHQIFLHELSLMKLIEH .A . PCLRKWMFLH I VYN . FYLML . KMKNMGMRLLTI IS 
I . I KKVK . HFLKRLR . ERVRLR . TLRP . HLLWKI KLVILQR . NCLTS . IRQ. YQRKKTL . 
. FLTVSNILIT . RKNLCLFLKKEK . . TRI . KK . H . LSLKLQLLLNHCLKK . LNQEMRKSS 
LLQTI I VAE . LRSYHLNITGILLTI 

SEQ ID NO. 4418 
STRAIN M781 

EEQELKNQEQSPVIANVAQQPSPSVTTNIVEKTSVTAASASNTVKEMGDTSVKNDKTEDE 
LLEELSKNLDTSNLGADLEEEYPSKPETTNNKESNWTNASTAIAQKVPSAYEEVKSESK 
SSLAVLDTSKITKLQATTQRGKGNVVAI IDTGFD INHDI FRLDSPKDDKHSFKTKAEFEE 
LKAKHNI TYGKWVNDKI VFAHNYANNTETVAD I AAAMKDGYGSEAKN I LHGTHVAG I FVG 
NSKRPAI NS LLLEGAAPNAQ VLLMR I PDKIDSDKFGEAYAKAI I DAVNLGAKT I NMSLGK 
TADSLIALNDKVKLALKLASEKGVAVWAAGNEGAFGMDYSKPLSTNPDYGTVNSPAISE 
DTLSVASYESLKTI SEWETTI EGKLVKLP I VTSKPFDKGKAYDWYANYGAKKI LKVRT 
LKVRLH . LS VWDL I L . LKSLMLQMQVLLVSL FLT I KKNVEI F . FLTVNYLWGLLVK . MA 
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SV . KILQVS . HLTRVLK. LIAKVAI VCWNNQVGA . QLKEQSSLM . QLLALKFILQPI I IN 
TKQCLVQVWLHHMLQD . . QCFKVIWLRNIKG. I . ILKNC .NCLKTSS . AQQQHYI VKRIR 
RFIHHVSKVQV.I^LKKLSKLNIMLLETMAKLKIilSNEREINIiISQLQFINL . KVSKNCI 
IKLM . QQNK. IKVNLPLNHKPC . ILIGRK. FFVI KKHKFDLLLMLVNLVRN . KNRWQMVI 
S . KVLYVLKKPRIVIRS . . VFLL . DLMVI LRTYKHLKHRF I RRFLKWST INQM I QL I KT 
NWSTMNQLLLKATTILPC.HNQRLGAMLIM8KMVGS . N . HRRVQKEL F . ELLRIRLRIKQ 
F I FWKEMQRI I HI LP FLQ I KME I GTKSLPRQLS . EMLRI FLLKF . I KMEMLFGKVRFYHL 
IVKISI I IQSKVMVI IVWMLFSGW . IRMAKL .QMVFILIAYVTHQ . QKEQI VRSQTLKF 
K. VLSHQIFLHELSLMKLIEH. A. PCLRKWMFLHIVYN . FYLML . KMKNMGMRLLTI IS 
I . I KKVK . HFLKRLR . ERVRLR . TLRP . HLLWKI KLVILQR . NCLTS . IRQ. YQRKKTL . 
. FLTVSNILIT . RKNLCLFLKKEK . TRI . KK . H . LSLKLQLLLNHCLKK. LNQEMRKSS 
LLQTIIVAE . LRSYHLNITGILLTI 

SEQ ID KO. 4419 
STRAIN JM9130013 

EEQELKNQEQSPVIANVAC^PSPSVTTNTVEKTSVTAASASNTAKEMGDTSVKNDKTED 
LLEELSKNLDTSNLGADLEEEYPSKPETTNNKESNVVTNASTAIAQKVPSAYEEVKPESK 
SSLAVLDTS KITKLQAI TQRGKGNWAI I DTGFD I NHDI FRLDS PKDDKHS FKTKTEFEE 
LKAKHNI TYGKWVNDKI VFAHNYANNTETVAD I AAAMKDGYGS EAKN I SHGTHVAG I FVG 
NSKRPAINGLLLEGAAPNAQVLLMRI PDKI DSDKFGEAYAKA I TDAVNLGAKT I NMS IGK 
TADSLIALNDKVKLALKLASEKGVAVWAAGNEGAFGMDYSKPLSTNPD^ 
DTLSVASYESLKTISEVVETTIEGKLVKTjPIVTSKPFDKGKAYDVVYANYGAKKDFEGKD 
FKGKI AL I ERGGGLDFMTK I THATNAG WG I VI FNDQEKRGNFL I PYRELP VG 1 1 SKVDG 
ERI KNTS SQLTFNQSFEWDSQGGNRMLEQSS WGVTAEGAI KPDVTASGFE I YSSTYNNQ 
YQTMSGTSMASPHVAGLMTMLQSHLAEKYKGMNLDSKKLLELSKNILMSSATALYSEEDK 
AFYSPRQQGAGWDAEKAI QAQ YY I TGNDGKAKI NLKRMGDKFD I TVT I HKL VEGVKEL Y 
YQANVATEQVNKGKFALKPQALLDTNWQKVILRDK^QVRFTIDASQFSQKLREQMANGY 
FLEGFVRFKEAKDSNQELMS I PFVGFNGDFANLQALETPI YKTLSKGSFYYKPNDTTHKD 
QLEYNESAPFESNNYTALLTQSASWGYVDYVKNGGELELAPESPKRIILiGTFENKVEDKT 
I HLLERDAANNP YFAI S PNKDGNRDE I TPQATFLRNVKD I SAQVLDQNGNVI WQSKVLPS 
YRKNFHNNPKQSDGHYRMDALQWSGLDKDGKWADGFYTYRLR 

QVSTKSPNLPSRAQFDETNRTLSLAMPKESSYVPTYRLQLVLSHVVKDEEYGDETSYHYF 
HIDQEGKVTLPKTVKIGESEVAVDPKALTLVVEDKAGNFATVK^ 

VI SNSFKYFDNLKKEPMFI SKKEKWNKNLEE 1 1 LVKPQTTVTTQSLSKEITKSGNEKVL 
TSTNNNSSRVAKI I S PKHNGDS VNHT 

SEQ ID NO. 4420 
STRAIN 090 

EEQELKNQEQSPVIANVAO^PSPSVTTNIVEKTSVTAASASNTVKEMGDTSVKNDKTEDE 
LLEELSKNLDTSNLGADLEEEYPSKPETTNNKESNVVTNASTAIAQKVPSAYEEVKPESK 
SSLAVFDTSKITKLQAITQRGKGNVVAIIDTGFDINHDIFRLDSPKDDKHSFKTKAEFEE 
LKAKHNITYGKWVNDKI VFAHNYANNTETVAD IAAAMKDGYGSEAKNI SHGTHVAGI FVG 
NS KRPAI NGLLLEGAAPNAQVLLMRI PDKI DSDKFGEAYAKAI TDAVNLGAKT INMSLGK 
TADSLIAIOTKVKI^LKLASEKGVAVVVAAGNEGAFGMDYSKPLSTNPDYGTVNSPAISE 
DTLSVASYESLKTI SEWETTI EGKLVKLP I VTSKPFDKGKAYDWYANYGAKKDFEGKD 
FKGKI AL I ERGGGLDFMTK I THATNAG WG I VI FNDQEKRGNFLI PYRELPVGVI SKVDG 
ERIKNTSSQLTFNQSFEWDSGGGNRMLEQSSWGVTAEGAIKPDVTASGFEI YSSTYNNQ 
YQTMSGTSMASPHVAGLMTMLQSHIiAEKYKGMNLDSKKLLELSKNILMSSATALYSEEDK 
AFYSPRQQGAGVVDAEKAIQAQYYVTGNDGKAKINLKRVGDKFDITVTIHKLVEGVKELY 
YQANVATEQVNKGKFALKPQALLDTNWQKVI LRDKETQVRFT I DASQFSQKLKEQMANG Y 
FLEGFVRFKEAKDSNQELMS I PFVGFNGDFANLQALETPI YKTLSKGSFYYKPNDTTHKD 
QLEYNESAPFESNNYTALLTQSASWGYVDYVKNGGELEIoAPESPKRIILGTFENKVEDKT 
I HLLERDAANNPYFAI S PNKDGNRDE I TPQATFLRNVKD I SAQVLDQNGNVI WQSKVLPS 
YRKNFBNNPKQSDGHYRMDAFQWSGLDKDGKVVADGFYTYRLRYTPVAEGANSQESDFKV 
QVSTKSPNLPLLAQFDETNRTLSLAMPKESSYVPTYRLQLVLSHVVKDEEYGDETSYHYF 
HIDQEGKVTLPKTVKIGESEVAVDPKALTLVVEDKAGNFATVK^ 

VI SNS FKYFDNLKKESMF I SKEGKWNKNLEE ITLVKPQTTVTTQSLSKEITKSGNEKVL 
TSTNNNSSRVAKI IS PKHNGDS VNHT 

SEQ ID NO. 4421 
STRAIN CJB110 

EEQELKNQEQSPVIANVAQQPSPSVTTNIVEKTSVTAASASNTAKEMGDTSVKNDKTEDE 
LLEELSKNLDTSNLGADLEEEYPSKPETTNNKESNVVTNASTAIAQKVPSAYEEVKPESK 
SSLAVFDTS KITKLQAI TQRGKGNWAI I DTGFD I NHDI FRLDSPKDDKHSFKTKAEFEE 
LKAKHNITYGKWVNDKI VFAHNYANNTETVAD IAAAMKDGYGSEAKNI SHGTHVAGI FVG 
NS KRPAI NGLLLEGAAPNAQVLLMRI PDKI DSDKFGEAYAKAI TDAVNLGAKT I NMSLGK 
TADSLIALNDKVKLALKLASEKGVAVWAAGNEGAFGMDYSKPLSTNPDYGTVNSPAISE 
DTLSVASYESLKTI SEWETTI EGKLVKLP I VTSKPFDKGKAYDWYANYGAKKDFEGKD 
FKGKI AL I ERGGGLD FMTKI THATNAGWG I VI FNDQEKRGNFLI PYRELPVGVI SKVDG 
ERI KNTSSQLTFNQS FEWDSQGGNRMLEQSS WGVTAEGAI KPDVTASGFE I YSSTYNNQ 
YQ^SGTSMASPHVAGLMTMLQNHLAEKYKGMNLDSKKLLELSKNILMSSATALYSEEDK 
AFYSPRQQGAGWDAEKAI QAQYYVTGNDGKAKINLKRVGDKFDITVTI HKL VEGVKEL Y 
YQANVATEQVNKGKFALKPQALLDTNWQKVI LRDKETQVRFT I DASQFSQKLKEQMANG Y 
FLEGFVRFKEAKDSNQELMS I PFVGFNGDFANLQALETP I YKTLSKGSFYYKPNDTTHKD 
QLEYNES AP FESNNYTALLTQS ASWGYVDYVKNGGELELAPESPKR I- 1 LGTFENKVEDKT 
I HLLERDAANNPYFAI S PNKDGNRDE I TPQATFLRNVKD I SAQVLDQNGNVI WQSKVLPS 
YRKNFHNNPKQSDGHYRMDAFQWSGLDKDGKWADGFYTYRLRYTPVAEGANSQESDFKV 
QVSTKSPNLPLLAQFDETNRTLSLAMPKESSWPTYRLQLVLSHWKDEEYGDETSYHYF 
H I DQEGKVTL PKTVKI GES E VAVD PKALTLWEDKAGNFATVKLSDLLNKAWSEKENA I 
VI SNSFKYFDNLKKESMFI SKEGKWNKNLEEI TLVKPQTTVTTQSLSKE I TKSGNEKVL 
TSTNNNSSRVAKI I S PKHNGDS VNHT 

SEQ ID NO. 4422 
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Table 44: Comparative Sequences relating to SAG0416 (strain info highlighted in BOLD) 



STRAIN 1169NT 

EEQELKNQEQSPVIANVAQQPSPSVTTNIVEKTSVTAASASNTAKEMGDTSVKNDKTEDE 
LLEELSKNLDTSNMGADLEEEYPSKPETTNNKESNVVTNASTAIAQKVPSAYEEVKPKSK 
SSIAVLDTSKITKLQAITQRGKGNWAIIDTGFDINHDIFRLDSPKDDKHSFKNKAEFEE 
LKAKHN I TYGKWVNDKI VFAHNYANNTET VAD I AAAMKDGYG SEAKN I SHGTHVAG I FVG 
NSKRPAINGIiLLEGAAPNAQVLLMRI PDKI DSDKFGEAYAKAI TDAVNLGAKT I NMS IGK 
TADSL IALNDKVKLALKLASEKGVAVWAAGNEGAFGMDYSKPLSTNPDYGTVNS PAISE 
DTLSVASYESLKTISEVVETTIEGKLVKLPIVTSKPFDKGKAYDVVYANYGAKKDFEGKD 
FKGKI AL I ERGGGLD FMTKI THATNAG WG I VI FNDQEKRGNFL I PYRELPVGVI SKVDG 
ERIKNTSSQLTFNQRFEWDSQGGNRMLEQSSWGVTAEGAI KPDVTASGFE I YSSTYNNQ 
YQTMSGTSMASPHVAGLMTMLQSHLAEKYKGMNLDSKKLLELSKNILMS SATALYSEEDK 
AFYSPRQQGAGWDAEKAI QAQYYVTGNDGKAKINLKRVGDKFD I TVT I HKL VEGVKELY 
YQANVATEQVNKGKFALKPQALLDTNWQKVILRDKETQVRFTIDASQFSQKIjKEQMANGY 
FLEGFVRFKEAKDSNQELMS I P FVGFNGD FAS LQALETP I YKTLS KGS FYYKPNDTTHKD 
QLEYNESAPFESNNYTALLTQSASWGYVDYVKNGGELEI^PESPKRIILGTFENKVEDKT 
IHLLERDAANNPYFAI S PNKDGNRDE ITPQATFLRNVKD I SAQVLDQNGNVI WQSKVLPS 
VRKNFHNNPKQSDGHYRMDALQWSGIjDKDGKWADGFYTYRLRYTPVAEGANSQESDFlW 
QVSTKSPNLPSRAQFDETNRTLSIAMPKGSSYVPIYRLQLVLSHVVKDEEYGDETSYYYF 
H I DQEGKATLPKTVKI GESE VAVDPKALTL VVEDKAGNFATVKLS DLLNKAWS EKENA I 
VI SNSFKYFDNLKKEPMFI SKKEKWNKNLEEI ILVKPHTTVTTQSLSKEITKSGNEKVL 
TSTNNNSSRVAKI I S PKHNGD S VNHT 



PRETTY of: /biotmp/msa209368 . 2 { * } February 10, 2003 02:09 



1 

msa2 09368 .2{147_C0H1) EEQELKNQEQ 

msa2 09368 . 2{147_M732} EEQELKNQEQ 

msa2 09368 .2{l47_M78l} EEQELKNQEQ 

msa2 09368 .2 { 147_18RS21 } EEQELKNQEQ 

msa209368 .2{l47_2603 \ vdkhhskkai lkltlittsi llmhsnqvna EEQELKNQEQ 

msa209368 . 2 { 147_JM9130013 } EEQELKNQEQ 

tnsa209368 .2{147_090} EEQELKNQEQ 

msa209368 . 2 { 147_CJB110 } EEQELKNQEQ 

msa2 09368 .2{147_1169NT} EEQELKNQEQ 

msa2 09368.2{l47_H36B} EEQELKNQEQ 

msa2 09368.2{l47_A909} EEQELKNQEQ 

Consensus ********** ********** ********** ********** 



50 

SPVIANVAQQ 

SPVIANVAQQ 

SPVIANVAQQ 

SPVIANVAQQ 

SPVIANVAQQ 

SPVIANVAQQ 

SPVIANVAQQ 

SPVIANVAQQ 

SPVIANVAQQ 

SPVIANVAQQ 

SPVIANVAQQ 
********** 



msa209368 . 

msa209368 . 

msa209368 . 
msa209368 .2{ 

tnsa2 09368 . 
msa209368 .2(147 
msa209368 
msa209368 .2{ 
msa209368 .2{ 

msa209368 . 

msa2 09368 . 



2{147_C0H1 
2{147_M732' 
2{147_M781 
147_18RS21 
2{147_2603} 
_JM9130013 \ 
.2{l47_090} 
147_CJB110} 
147_1169NT} 
2f 147_H36Bj 
2{147_A909) 
Consensus 



51 

PSPSVTTNiV 
PSPSVTTNiV 
PSPSVTTNiV 
PSPSVTTNtV 
PSPSVTTNtV 
PSPSVTTNtV 
PSPSVTTNiV 
PSPSVTTNiV 
PSPSVTTNiV 
PSPSVTTNtV 

PSPSVTTNtV 
********_* 



EKTSVTaASA 
EKTSVTaASA 
EKTSVTaASA 
EKTSVTaASA 
EKTSVTaASA 
EKTSVTaASA 
EKTSVTaASA 
EKTSVTaASA 
EKTSVTaASA 
EKTSVTsASA 
EKTSVTsASA 
******-*** 



SNTvKEMGDT 

SNTvKEMGDT 

SNTvKEMGDT 

SNTaKEMGDT 

SNTaKEMGDT 

SNTaKEMGDT 

SNTvKEMGDT 

SNTaKEMGDT 

SNTaKEMGDT 

SNTaKEMGDT 

SNTaKEMGDT 
***_****** 



SVKNDKTEDE 
SVKNDKTEDE 
SVKNDKTEDE 
SVKNDKTEDE 
SVKNDKTEDE 
SVKNDKTEDE 
SVKNDKTEDE 
SVKNDKTEDE 
SVKNDKTEDE 
SVKNDKTEDE 
SVKNDKTEDE 
********** 



100 

LLEELSKNLD 

LLEELSKNLD 

LLEELSKNLD 

LLEELSKNLD 

LLEELSKNLD 

LLEELSKNLD 

LLEELSKNLD 

LLEELSKNLD 

LLEELSKNLD 

LLEELSKNLD 

LLEELSKNLD 
********** 



msa209368 . 

msa2 09368 . 

msa209368 . 
msa209368 .2{ 

msa209368 . 
msa209368 .2(147 
msa209368 
msa209368 .2{ 
msa2 09368 ,2{ 

msa209368 . 

msa209368 . 



2 { 147_COHl } 
2{147_M732} 
2{147_M781} 
147_18RS21) 
2{l47_2603} 
_JM9130013) 
.2{147_090" 
147_CJB110 
147_1169NT; 
2{147_H36B} 
2{147_A909} 
Consensus 



101 

TSN1GADLEE 

TSN1GADLEE 

TSN1GADLEE 

TSN1GADLEE 

TSN1GADLEE 

TSN1GADLEE 

TSN1GADLEE 

TSN1GADLEE 

TSNmGADLEE 

TSN1GADLEE 

TSN1GADLEE 
***_ ****** 



EYPSKPETTN 

EYPSKPETTN 

EYPSKPETTN 

EYPSKPETTN 

EYPSKPETTN 

EYPSKPETTN 

EYPSKPETTN 

EYPSKPETTN 

EYPSKPETTN 

EYPSKPETTN 

EYPSKPETTN 
********** 



NKESNWTNA 

NKESNWTNA 

NKESNWTNA 

NKESNWTNA 

NKESNWTNA 

NKESNWTNA 

NKESNWTNA 

NKESNWTNA 

NKESNWTNA 

NKESNWTNA 

NKESNWTNA 
********** 



STAIAQKVPS 
STAIAQKVPS 
STAIAQKVPS 
STAIAQKVPS 
STAIAQKVPS 
STAIAQKVPS 
STAIAQKVPS 
STAIAQKVPS 
STAIAQKVPS 
STAIAQKVPS 
STAIAQKVPS 



150 

AYEEVKS eSK 
AYEEVKseSK 
AYEEVKs eSK 
AYEEVKpeSK 
AYEEVKpeSK 
AYEEVKpeSK 
AYEEVKpeSK 
AYEEVKpeSK 
AYEEVKpkSK 
AYEEVKpeSK 
AYEEVKpeSK 



********** ******__** 



msa209368 . 

msa209368 . 

msa209368 . 
msa209368 .2{ 

msa209368 . 
msa209368 .2(147 
msa209368 
msa2 09368 .2{ 
msa209368 .2( 

msa209368 . 

msa209368 . 



2{147_C0H1 
2{147_M732 
2{l47_M781 t 
147_18RS2l} 
2(147_2603} 
_JM9130013} 
,2{l47_090} 
147_CJB110} 
147_1169NT} 
2(147_H36B) 
2{147_A909} 
Consensus 



151 

SSLAV1DTSK 
SSLAV1DTSK 
SSLAV1DTSK 
SSLAV1DTSK 
SSLAV1DTSK 
SSLAV1DTSK 
SSLAVf DTSK 
SSLAVf DTSK 
SSLAV1DTSK 
SSLAV1DTSK 
SSLAV1DTSK 



ITKLQAtTQR 
ITKLQAtTQR 
ITKLQAtTQR 
ITKLQAiTQR 
ITKLQAiTQR 
ITKLQAiTQR 
ITKLQAiTQR 
ITKLQAiTQR 
ITKLQAiTQR 
ITKLQAiTQR 
ITKLQAiTQR 



*****_**** ******>*** 



GKGNWAI I D 
GKGNWA 1 1 D 
GKGNWAI ID 
GKGNWAI ID 
GKGNWAI ID 
GKGNWAI ID 
GKGNWAI ID 
GKGNWAI ID 
GKGNWAI ID 
GKGNWAI ID 

GKGNWAI ID 
********** 



TGFDINHDIF 

TGFDINHDIF 

TGFDINHDIF 

TGFDINHDIF 

TGFDINHDIF 

TGFDINHDIF 

TGFDINHDIF 

TGFDINHDIF 

TGFDINHDIF 

TGFDINHDIF 

TGFDINHDIF 
********** 



200 

RLDSPKDDKH 

RLDSPKDDKH 

RLDSPKDDKH 

RLDSPKDDKH 

RLDSPKDDKH 

RLDSPKDDKH 

RLDSPKDDKH 

RLDSPKDDKH 

RLDSPKDDKH 

RLDSPKDDKH 

RLDSPKDDKH 
********** 



msa2 09368 .2{l47_COHl} 



201 

S FKt KaE FEE 



LKAKHN I TYG KWVNDKIVFA 



250 

HNYANNTETV ADIAAAMKDG 
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Table 44: Comparative Sequences relating to SAG0416 (strain info highlighted in BOLD) 



msa209368 . 

msa2 093 68 . 
msa209368 ,2{ 

msa209368 . 
msa209368 .2(147 
msa209368 
msa209368 .2{ 
msa209368 .2{ 

msa209368 . 

msa209368 . 



2{l47_M732} 
2{147_M781 
147_18RS21 
2{147_2603 
_JM9130013 
.2{147_090; 
147_CJB110 
147_1169NT 
2f 147_H36B 
2{147_A909; 
Consensus 



SFKtKaEFEE 

SFKtKaEFEE 

SFKtKtEFEE 

SFKtKtEFEE 

SFKtKtEFEE 

SFKtKaEFEE 

SFKtKaEFEE 

SFKnKaEFEE 

SFKtKaEFEE 

SFKtKaEFEE 
***„*_**** 



LKAKHNITYG 

LKAKHNITYG 

LKAKHNITYG 

LKAKHNITYG 

LKAKHNITYG 

LKAKHNITYG 

LKAKHNITYG 

LKAKHNITYG 

LKAKHNITYG 

LKAKHNITYG 
********** 



KWVNDKIVFA 
KWVNDKIVFA 
KWVNDKIVFA 
KWVNDKIVFA 
KWVNDKIVFA 
KWVNDKIVFA 
KWVNDKIVFA 
KWVNDKIVFA 
KWVNDKIVFA 
KWVNDKIVFA 
********** 



HNYANNTETV 
HNYANNTETV 
HNYANNTETV 
HNYANNTETV 
HNYANNTETV 
HNYANNTETV 
HNYANNTETV 
HNYANNTETV 
HNYANNTETV 
HNYANNTETV 
********** 



ADIAAAMKDG 

ADIAAAMKDG 

ADIAAAMKDG 

ADIAAAMKDG 

ADIAAAMKDG 

ADIAAAMKDG 

ADIAAAMKDG 

ADIAAAMKDG 

ADIAAAMKDG 

ADIAAAMKDG 
********** 



msa209368 
msa209368 
msa209368 
msa209368 .2 
msa209368 
msa209368 .2{14 
msa20936 
msa209368 -2 
msa209368 .2 
msa209368 
tnsa2093S8 



.2{l47_COHl} 
.2{147_M732} 
.2(147 M781} 
(l47_18RS2l} 
.2(147 2503} 
7_JM9130013} 
8 .2 {l47__090} 
{147_CJB110} 
{147_1169NT} 
.2{147_H36B} 
-2{147_A909} 
Consensus 



251 

YGSEAKNI1H 
YGSEAKNI1H 
YGSEAKNI1H 
YGSEAKNIsH 
YGSEAKNISH 
YGSEAKNIsH 
YGSEAKNIsH 
YGSEAKNIsH 
YGSEAKNIsH 
YGSEAKNIsH 
YGSEAKNIsH 
********—* 



GTHVAG I FVG 

GTHVAG I FVG 

GTHVAG I FVG 

GTHVAG I FVG 

GTHVAG I FVG 

GTHVAG I FVG 

GTHVAG I FVG 

GTHVAG I FVG 

GTHVAG I FVG 

GTHVAG I FVG 

GTHVAG I FVG 
********** 



NSKRPAINsL 
NSKRPAINsL 
NSKRPAINsL 
NSKRPAINgL 
NSKRPAINgL 
NSKRPAINgL 
NSKRPAINgL 
NSKRPAINgL 
NSKRPAINgL 
NSKRPAINgL 
NSKRPAINgL 



LLEGAAPNAQ 
LLEGAAPNAQ 
LLEGAAPNAQ 
LLEGAAPNAQ 
LLEGAAPNAQ 
LLEGAAPNAQ 
LLEGAAPNAQ 
LLEGAAPNAQ 
LLEGAAPNAQ 
LLEGAAPNAQ 
LLEGAAPNAQ 



********_* ********** 



300 

VLLMRIPDKI 

VLLMRIPDKI 

VLLMRIPDKI 

VLLMRIPDKI 

VLLMRIPDKI 

VLLMRIPDKI 

VLLMRIPDKI 

VLLMRIPDKI 

VLLMRIPDKI 

VLLMRIPDKI 

VLLMRIPDKI 
********** 



msa209368 . 

msa209368 . 

msa209368 . 
msa209368 .2{ 

msa209368 - 
msa209368 .2{l47 
msa209368 
msa209368 .2{ 
msa209368 .2{ 

msa209368 . 

msa209368 . 



2{147_C0H1) 
2{147 M732} 
2{147"M781} 
147_18RS2l} 
2{147_2603} 
_JM9130013} 
,2{147_090} 
147_CJB110} 
147_1169NT} 
2{147_H36B} 
2{147_A909} 
Consensus 



301 

DSDKFGEAYA 

DSDKFGEAYA 

DSDKFGEAYA 

DSDKFGEAYA 

DSDKFGEAYA 

DSDKFGEAYA 

DSDKFGEAYA 

DSDKFGEAYA 

DSDKFGEAYA 

DSDKFGEAYA 

DSDKFGEAYA 
********** 



KAI iDAVNLG 

KAI iDAVNLG 

KAI iDAVNLG 

KAItDAVNLG 

KAI tDAVNLG 

KAItDAVNLG 

KAItDAVNLG 

KAItDAVNLG 

KAItDAVNLG 

KAItDAVNLG 

KAItDAVNLG 
***_****** 



AKTINMS1GK 
AKTINMS1GK 
AKTINMS1GK 
AKTINMSiGK 
AKTINMSiGK 
AKTINMSiGK 
AKTINMSIGK 
AKTINMSIGK 
AKTINMSiGK 
AKTINMSIGK 

AKTINMSIGK 
*******_** 



TADSLIALND 

TADSL I ALND 

TADSLIALND 

TADSLIALND 

TADSLIALND 

TADSLIALND 

TADSLIALND 

TADSLIALND 

TADSLIALND 

TADSLIALND 

TADSLIALND 
********** 



350 

KVKLALKLAS 
KVKLALKLAS 
KVKLALKLAS 
KVKLALKLAS 
KVKLALKLAS 
KVKLALKLAS 
KVKLALKLAS 
KVKLALKLAS 
KVKLALKLAS 
KVKLALKLAS 

KVKLALKLAS 
********** 



msa209368 . 

msa209368 . 

msa209368 . 
msa209368 .2{ 

msa209368 . 
msa209368 .2(147 
msa209368 
msa209368 .2{ 
msa209368 .2{ 

msa209368 - 

msa209368 . 



2{l47_COHl} 
2{147_M732} 
2{147_M781} 
147_18RS2l} 
2{147_2603} 
_JM9130013} 
.2{147_090} 
147_CJB110} 
147__1169NT) 
2{147_H36B} 
2{147_A909} 
Consensus 



351 

EKGVAWVAA 

EKGVAVWAA 

EKGVAWVAA 

EKGVAVWAA 

EKGVAVWAA 

EKGVAVWAA 

EKGVAVWAA 

EKGVAVWAA 

EKGVAVWAA 

EKGVAVWAA 

EKGVAVWAA 
********** 



GNEGAFGMDY 

GNEGAFGMDY 

GNEGAFGMDY 

GNEGAFGMDY 

GNEGAFGMDY 

GNEGAFGMDY 

GNEGAFGMDY 

GNEGAFGMDY 

GNEGAFGMDY 

GNEGAFGMDY 

GNEGAFGMDY 
********** 



SKPLSTNPDY 

SKPLSTNPDY 

SKPLSTNPDY 

SKPLSTNPDY 

SKPLSTNPDY 

SKPLSTNPDY 

SKPLSTNPDY 

SKPLSTNPDY 

SKPLSTNPDY 

SKPLSTNPDY 

SKPLSTNPDY 
********** 



GTVNSPAISE 

GTVNSPAISE 

GTVNSPAISE 

GTVNSPAISE 

GTVNSPAISE 

GTVNSPAISE 

GTVNSPAISE 

GTVNSPAISE 

GTVNSPAISE 

GTVNSPAISE 

GTVNSPAISE 
********** 



400 

DTLSVASYES 

DTLSVASYES 

DTLSVASYES 

DTLSVASYES 

DTLSVASYES 

DTLSVASYES 

DTLSVASYES 

DTLSVASYES 

DTLSVASYES 

DTLSVASYES 

DTLSVASYES 
********** 



msa209368 . 

msa209368 . 

msa209368 . 
msa209368 .2 { 

msa209368 . 
msa209368 .2(147 
msa209368 
msa209368 
msa209368 

msa209368 . 

msa209368 . 



2{147_C0H1} 
2{147_M732} 
2(147_M781} 
147_18RS2l} 
2{l47_2603} 
_JM9130013 
^J.2{147_090" 
,2{ 147_CJB110 
.2{147_1169NT_ 
2(147_H36B; 
2(l47_A909" 
Consensus 



401 

LKTISEWET 

LKTISEWET 

LKTISEWET 

LKTISEWET 

LKTISEWET 

LKTISEWET 

LKTISEWET 

LKTISEWET 

LKTISEWET 

LKTISEWET 

LKTISEWET 
********** 



TIEGKLVKLP 

T I EGKLVKLP 

TIEGKLVKLP 

TIEGKLVKLP 

TIEGKLVKLP 

TIEGKLVKLP 

TIEGKLVKLP 

TIEGKLVKLP 

TIEGKLVKLP 

TIEGKLVKLP 

TIEGKLVKLP 
********** 



I VTS KPFDKG 

IVTSKPFDKG 

I VTS KPFDKG 

IVTSKPFDKG 

IVTSKPFDKG 

IVTSKPFDKG 

IVTSKPFDKG 

IVTSKPFDKG 

IVTSKPFDKG 

IVTSKPFDKG 

IVTSKPFDKG 
********** 



KAYDWYANY 

KAYDWYANY 

KAYDWYANY 

KAYDWYANY 

KAYDWYANY 

KAYDWYANY 

KAYDWYANY 

KAYDWYANY 

KAYDWYANY 

KAYDWYANY 

KAYDWYANY 
********** 



450 

GAKKilkvrt 

GAKKilkvrt 

GAKKilkvrt 

GAKKdfegkd 

GAKKdfegkd 

GAKKdfegkd 

GAKKdfegkd 

GAKKdfegkd 

GAKKdfegkd 

GAKKdfegkd 

GAKKrl.r.g 
**** 



msa209368 . 

msa20936B . 

msa209368 . 
msa209368 ,2{ 

msa209368 . 
msa209368 .2(147 
msa209358 
msa209368 .2{ 
msa209368 .2( 

msa209368 . 

msa20936B . 



2{l47_COHl 
2{147_M732 
2(147_M781 
147_18RS21 
2(147^2603" 
_JM9130013 
.2{147_090* 
147_CJB110 
147 1169NTJ 
2{147_H36B} 
2{147_A909} 
Consensus 



451 

lkvrlh. lsv 
lkvrlh. lsv 
lkvrlh. lsv 
f kgkialier 
f kgkialier 
f kgkialier 
fkgkialier 
f kgkialier 
fkgkialier 
fkgkialier 
1 .r.dcin.a 



wdlil . Iks 
wdlil .Iks 
wdlil. Iks 
gggldf mtki 
gggldfmtki 
gggldfmtki 
gggldfmtki 
gggldfmtki 
gggldfmtki 
gggldfmtki 
wwwt . f yd . n 



Imlqmqvllv 
lmlqmqvllv 
Imlqmqvllv 
thatnagwg 
thatnagwg 
thatnagwg 
thatnagwg 
thatnagwg 
thatnagwg 
thatnagwg 
hscykcrccw 



slf ltikknv 
slf ltikknv 
slf ltikknv 
ivif ndqekr 
ivif ndqekr 
ivif ndqekr 
ivif ndqekr 
ivif ndqekr 
ivif ndqekr 
ivifndqekr 
yryf . rsrkt 



500 

eiF.f ltvny 
eiF. f ltvny 
eiF. f ltvny 
gnFlipyrel 
gnFlipyrel 
gnFlipyrel 
gnFlipyrel 
gnFlipyrel 
gnFlipyrel 
gnFlipyrel 
wkFsnslp.i 



501 



550 



789 
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Table 44: Comparative Sequences relating to SAG0416 (strain info highlighted in BOLD) 



msa209368 . 

msa209368 . 

msa209368 . 
msa209368 .2{ 

msa209368 . 
msa209368 .2(147 
msa209368 
msa209368 .2{ 
msa209368 .2{ 

msa209368 . 

msa209368 . 



2f 147_COHl} 
2{147__M732} 
2{l47_M78l} 
147_18RS2l} 
2{l47_2603 ) 
_JM9130013} 
,2{147_090} 
147_CJB110} 
147 1169NT) 
2{147_H36B} 
2{147_A909} 
Consensus 



lwGllvk.ma 
lwGllvk.ma 
lwGllvk.ma 
pvGiiskvdg 
pvGiiskvdg 
pvGiiskvdg 
pvGviskvdg 
pvGviskvdg 
pvGviskvdg 
pvGviskvdg 
tcGgy . . srw 



sv.Kilqvs . 
sv.Kilqvs . 
sv.Kilqvs . 
eriKntssql 
eriKntssql 
eriKntssql 
eriKntssql 
eriKntssql 
eriKntssql 
eriKntssql 
rayKkyf ksv 



hltrvlk. li 
hltrvlk.li 
hltrvlk. li 
tfnqsf ewd 
tfnqsf ewd 
tfnqsf ewd 
tfnqsf ewd 
tfnqsf ewd 
tfnqrf ewd 
tfnqsf ewd 
ni .pef . ss . 



akvaivcwnti 
akvaivcwnri 
akvaivcwnn 
sqggnrmleq 
sqggnrmleq 
sqggnrmleq 
sqggnrmleq 
sqggnrmleq 
sqggnrmleq 
sqggnrmleq 
. prwqsyagt 



qvga .qlkeq 
qvga.qlkeq 
qvga. qlkeq 
sswgvtaega 
sswgvtaega 
sswgvtaega 
sswgvtaega 
sswgvtaega 
sswgvtaega 
sswgvtaega 
iklgrds . rs 



msa2093 68 . 

msa209368 . 

msa209368 . 
msa209368.2{ 

msa209368 . 
msa209368 .2(147 
msa209368 
msa209368 .2 { 
msa209368 .2 { 

msa209368 . 

msa209368 . 



2{l47_COHl} 
2{147_M732} 
2{147_M781} 
147_18RS2l) 
2{l47_2603 
_JM9130013 
.2{l47_090 , 
147_CJB110} 
147_1169NT} 
2(l47_H36B} 
2{147_A909} 
Consensus 



551 

sslm. qllal 
sslm.qllal 
sslm. qllal 
ikpdvtasgf 
ikpdvtasgf 
ikpdvtasgf 
ikpdvtasgf 
ikpdvtasgf 
ikpdvtasgf 
ikpdvtasgf 
nqa.cnsfwl 



kfilqpiiin 
kf ilqpiiin 
kfilqpiiin 
eiysstynnq 
eiysstynnq 
eiysstynnq 
eiysstynnq 
eiysstynnq 
eiysstynnq 
eiysstynnq 
.nlf f nl . . s 



tkqclvqvwl 
tkqclvqvwl 
tkqclvqvwl 
yqtmsgt sma 
yqtmsgtsma 
yqtmsgt sma 
yqtmsgtsma 
yqtmsgtsma 
yqtmsgtsma 
yqtmsgtsma 
ipnnvwykyg 



hhmlqd . . qc 
hhmlqd. .qc 
hhmlqd. .qc 
sphvaglmtm 
sphvaglmtm 
sphvaglmtm 
sphvaglmtm 
sphvaglmtm 
sphvaglmtm 
sphvaglmtm 
f ttccrindn 



600 

f kviwlrnik 
f kviwlmik 
f kviwlrnik 
lqshlaekyk 
lqshlaekyk 
lqshlaekyk 
lqshlaekyk 
lqnhlaekyk 
lqshlaekyk 
lqshlaekyk 
asksf g. ei . 



msa209368 . 

msa209368 . 

msa209368 . 
msa209368 .2{ 

msa209368 . 
msa209368 .2{l47 
msa209368 
msa209368 ,2{ 
msa209368 .2 { 

msa209368 . 

msa209368 . 



2{l47_COHl} 
2{147_M732} 
2{l47_M78l} 
147_18RS2l} 
2{147_2603} 
_JM9130013} 
.2{147_090} 
147_CJB110} 
147_1169NT} 
2{l47_H36B} 
2{147_A909} 
Consensus 



601 

g . i . ilknc . 
g .i . ilknc . 
g. i . ilknc . 
gmnldskkll 
gmnldskkll 
gmnldskkll 
gmnldskkll 
gmnldskkll 
gmnldskkll 
gmnldskkll 
rdef rf .kia 



nclktss . aq 
nclktss . aq 
nclktss . aq 
elsknilmss 
elsknilmss 
elsknilmss 
elsknilmss 
elsknilmss 
elsknilmss 
elsknilmss 
riv . khphel 



qqhyivkrir 
qqhyivkrir 
qqhyivkrir 
atalyseedk 
atalyseedk 
atalyseedk 
atalyseedk 
atalyseedk 
atalyseedk 
atalyseedk 
snsii . .rg. 



rf ihhvskvq 
rf ihhvskvq 
rf ihhvskvq 
afysprqqga 
afysprqqga 
afysprqqga 
afysprqqga 
afysprqqga 
afysprqqga 
afysprqqga 
gvlf ttsarc 



650 

v. lmlkKlsk 

v. ImlkKlsk 

v. lmlkKlsk 

gwdaeKaiq 

gwdaeKaiq 

gwdaeKaiq 

gwdaeKaiq 

gwdaeKaiq 

gwdaeKaiq 

gwdaeKaiq 

res .c .Ksyp 
* 



msa209368 . 

msa209368 . 

msa209368 . 
msa209368 .2 { 

msa209368 . 
msa209368 .2(147 
msa209368 
msa209368 .2 { 
msa209368 .2{ 

msa209368 . 

msa209368 . 



2{147_C0H1' 
2{147_M732 
2{l47_M78i; 
147_18RS21 
2{147_2603* 
_JM9130013 
.2{l47_090 
147_CJB110} 
147_1169NT} 
2{l47_H36B} 
2{147_A909} 
Consensus 



651 

lnimlletma 
lnimlletma 
lnimlletma 
aqyyitgndg 
aqyyitgndg 
aqyyitgndg 
aqyyvtgndg 
aqyyvtgndg 
aqyyvtgndg 
aqyyvtgndg 
ssilcywkrw 



klklisnere 
klklisnere 
klklisnere 
kakinlkrmg 
kakinlkrmg 
kakinlkrmg 
kakinlkrvg 
kakinlkrvg 
kakinlkrvg 
kakinlkrvg 
qs .n. sqtsg 



inlisqlqf i 
inlisqlqf i 
inlisqlqf i 
dkfditvtih 
dkfditvtih 
dkfditvtih 
dkfditvtih 
dkfditvtih 
dkfditvtih 
dkfditvtih 
r . i .yhsyns 



nl .kvsknei 
nl . kvsknei 
nl . kvsknei 
klvegvkely 
klvegvkely 
klvegvkely 
klvegvkely 
klvegvkely 
klvegvkely 
klvegvkely 
. tcrrcqriv 



700 

iklm.qqnk. 

iklm.qqnk. 

iklm.qqnk. 

yqanvateqv 

yqanvateqv 

yqanvateqv 

yqanvateqv 

yqanvateqv 

yqanvateqv 

yqanvateqv 

lss .csnrts 



msa209368 . 

msa209368 . 

msa209368 . 
msa209368 .2 { 

msa209368 . 
msa209368 .2(147 
msa209368 
msa209368 .2 { 
msa209368 .2( 

msa209368 . 

msa209368 . 



2(147_C0H1 
2(147_M732 
2(147_M7 81 
147_18RS21 
2(147_2603 
_JM9130013 
72(147_090 
147_CJB110 
147__1169NT 
2{l47_H3 6B 
2{147_A909 
Consensus 



701 

ikvnlplnhk 
ikvnlplnhk 
ikvnlplnhk 
nkgkfalkpq 
nkgkfalkpq 
nkgkfalkpq 
nkgkfalkpq 
nkgkfalkpq 
nkgkfalkpq 
nkgkfalkpq 
k.r.icp.tt 



pc. iligrk. 

pc. iligrk. 

pc. iligrk. 

alldtnwqkv 

alldtnwqkv 

alldtnwqkv 

alldtnwqkv 

alldtnwqkv 

alldtnwqkv 

alldtnwqkv 

slary .laes 



ffvikkhkfd 
ffvikkhkfd 
ffvikkhkfd 
ilrdketqyr 
ilrdketqvr 
ilrdketqvr 
ilrdketqvr 
ilrdketqvr 
ilrdketqvr 

ilrdketqvr 
nss . .rntsB 



lllmlvnlvr 
lllmlvnlvr 
lllmlvnlvr 
f tidasqf sq 
f tidasqf sq 
f tidasqf sq 
f tidasqf sq 
f tidasqf sq 
f tidasqf sq 
f tidssqf sq 
iyy.f .si.s 



750 

n . Knrwqmvi 
n . Knrwqmvi 
n . Knrwqmvi 
klKeqmangy 
klKeqmangy 
klKeqmangy 
klKeqmangy 
klKeqmangy 
klKeqmangy 
klKeqmangy 
eiKrtdgkwl 



msa209368 . 

msa209368 . 

msa209368 . 
msa209368 .2{ 

msa209368 . 
msa209368 .2(147 
msa209368 
msa209368 .2 
msa209368 .2 

msa209368 . 

msa209368 . 



2{l47_COHl} 
2{147_M732} 
2{147_M781} 
147_18RS2l} 
2{147_2603} 
_JM9130013) 
.2{147_090) 
147_CJB110} 
147_1169NT} 
2{147_H36B} 
2(147_A909} 
Consensus 



751 

s . kvlyvlkk 
s . kvlyvlkk 
s . kvlyvlkk 
f legfvrf ke 
f legfvrf ke 
f legfvrf ke 
f legfvrf ke 
f legfvrf ke 
f legfvrf ke 
f legfvrf ke 
f lrrfctf .r 



pnvirs . . v 
privirs . . v 
privirs. .v 
akdsnqelms 
akdsnqelms 
akdsnqelms 
akdsnqelms 
akdsnqelms 
akdsnqelms 
akdsnqelms 
sqg . . sgvne 



fll.dlmvil 
f 11 .dlmvil 
f 11 .dlmvil 
ipfvgfngdf 
ipfvgfngdf 
ipfvgfngdf 
ipfvgfngdf 
Ipfvgfngdf 
ipfvgfngdf 
ipfvgfngdf 
ysfcri . w. f 



rtykhlkhrf 
rtykhlkhrf 
rtykhlkhrf 
anlqaletpi 
anlqaletpi 
anlqaletpi 
anlqaletpi 
anlqaletpi 
aslqaletpi 
anlqaletpi 
celtst .ntd 



800 

irrf Ikwst 
irrf Ikwst 
irrf Ikwst 
yktiskgsfy 
yktlskgsfy 
yktiskgsfy 
yktlskgsfy 
yktiskgsfy 
yktlskgsfy 
yktlskgsfy 
l.daf .r.fl 



790 
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Table 44: Comparative Sequences relating to SAG0416 (strain info highlighted in BOLD) 



msa209368 . 

msa209368 . 

msa209368 . 
msa209368 .2{ 

msa209368 . 
msa209368 .2{l47 
msa209368 
msa209368 .2 { 
msa209368 .2{ 

msa209368 . 

msa209368 . 



2{147_C0H1} 
2{147_M73 2} 
2{147_M781} 
147_18RS2l} 
2{147_2603} 
_JM9130013} 
.2{147_090} 
147_CJB110} 
147_1169NT) 
2{147_H36B} 
2{147__A909} 
Consensus 



801 

inqmiqlikt 
inqmiqlikt 
inqmiqlikt 
ykpndtthkd 
ykpndtthkd 
ykpndtthkd 
ykpndtthkd 
ykpndtthkd 
ykpndtthkd 
ykpndtthkd 
1 . tk.yns .r 



nwstmnqlll 
nwstmnqlll 
nwstmnqlll 
qleynesapf 
qleynesapf 
qleynesapf 
qleynesapf 
qleynesapf 
qleynesapf 
qleynesapf 
pigvq. issf 



kattilpc .h 
kattilpc.h 
kattilpc. h 
esnnytallt 
esnnytallt 
esnnytallt 
esnnytallt 
esnnytallt 
esnnytallt 
esnnytallt 
.kqqlyclvn 



nqrlgaml im 
nqrlgamlim 
nqrlgaml im 
qsaswgyvdy 
qsaswgyvdy 
qsaswgyvdy 
qsaswgyvdy 
qsaswgyvdy 
qsaswgyvdy 
qsaswgyvdy 
tisvlglc . 1 



850 

skmvgs .n.h 
skmvgs. n.h 
skmvgs. n.h 
vknggelela 
vknggelela 
vknggelela 
vknggelela 
vknggelela 
vknggelela 
vknggelela 
cqkwwgvris 



msa209368 . 

msa209368 . 

msa209368 . 
msa209368.2{ 

msa209368 . 
msa209368 .2(147 
msa209368 
msa209368 .2{ 
msa209368 .2{ 

msa209368 . 

msa2 09368 - 



2{147_C0H1} 
2{l47_M732} 
2{147_M781} 
147_18RS2l} 
2{147_2603} 
JM9130013} 
72{147_090} 
147_CJB110} 
147_1169NT} 
2{147_H36B} 
2{147_A909} 
Consensus 



851 

rrvqKelf . e 
rrvqKelf .e 
rrvqKelf .e 
pespKriilg 
pespKriilg 
pespKriilg 
pespKriilg 
pespKriilg 
pespKriilg 
pespKriilg 
tgesKknyf r 



llrirlrikq 
llrirlrikq 
llrirlrikq 
tf enkvedkt 
tf enkvedkt 
tf enkvedkt 
tf enkvedkt 
tf enkvedkt 
tf enkvedkt 
tf enkvedkt 
nf .e.g.g.n 



f ifwkemqri 
f ifwkemqri 
f ifwkemqri 
ihllerdaan 
ihllerdaan 
ihllerdaan 
ihllerdaan 
ihllerdaan 
ihllerdaan 
ihllerdaan 
nssfgkrcse 



ihilpf lqik 
ihilpflqik 
ihilpflqik 
npyf aispnk 
npyf aispnk 
npyf aispnk 
npyf aispnk 
npyf aispnk 
npyf aispnk 
npyf aispnk 
. sif chf sk. 



900 

meigtkslpr 
meigtkslpr 
meigtkslpr 
dgnrdeitpq 
dgnrdeitpq 
dgnrdeitpq 
dgnrdeitpq 
dgnrdeitpq 
dgnrdeitpq 
dgnrdeitpq 
rwk . g . nhsp 



msa209368 . 

msa209368 . 

msa209368 . 
msa209368 .2 { 

msa209368 . 
msa209368 .2(147 
msa209368 
msa2 09368 .2 { 
msa209368 .2{ 

msa209368 . 

msa209368 . 



2{l47_COHl} 
2{147_M732} 
2{147_M781} 
147_18RS2l} 
2{l47_2603} 
_JM9130013} 
.2{147_090} 
147_CJB110" 
147_1169NT 
2{147_H36B" 
2{l47_A909 
Consensus 



901 

qls . emlrif 
qls . emlrif 
qls . emlrif 
atf lrnvkdi 
atf lrnvkdi 
atf lrnvkdi 
atf lrnvkdi 
atf lrnvkdi 
atf lrnvkdi 
atf lrnvkdi 
gnf lkkc .gy 



llkf . ikmem 
llkf . ikmem 
llkf . ikmem 
saqvldqngn 
saqvldqngn 
saqvldqngn 
saqvldqngn 
saqvldqngn 
saqvldqngn 
saqvldqngn 
f csssrskwk 



lfgkvrfyhl 
If gkvrfyhl 
lfgkvrfyhl 
viwqskvlps 
viwqskvlps 
viwqskvlps 
viwqskvlps 
viwqskvlps 
viwqskvlps 
viwqskvlps 
cylak.gfti 



ivkisiiiqs 
ivkisiiiqs 
ivkisiiiqs 
yrknf hnnpk 
yrknf hnnpk 
yrknf hnnpk 
yrknf hnnpk 
yrknf hnnpk 
yrknf hnnpk 
yrknfhnnpk 
Is .kfp . . sk 



950 

kvmviivwml 
kvmviivwml 
kvmviivwml 
qsdghyrmda 
qsdghyrmda 
qsdghyrmda 
qsdghyrmda 
qsdghyrmda 
qsdghyrmda 
qsdghyrmda 
ak. wslsygc 



msa209368 . 

msa209368 . 

msa209368 . 
msa209368 .2{ 

msa209368 . 
msa209368 .2(147 
msa209368 
msa209368 
msa209368 

msa209368 

msa209368 . 



2{l47_COHl 
2{147_M732 
2{147_M781 : 
147 18RS21 
2{147_2603 
JM9130013 
U ,,T2{147_090 
.2{147__CJB110 
.2{147_1169NT 
2{147_H36B 
2{147_A909; 
Consensus 



951 

f sgw.irma 
f sgw. irma 
f sgw. irma 
lqwsgldkdg 
lqwsgldkdg 
lqwsgldkdg 
f qwsgldkdg 
f qwsgldkdg 
lqwsgldkdg 
lqwsgldkdg 
psvewf r .gw 



kl .qmvFili 
kl .qmvFili 
kl .qmvFili 
kwadgFyty 
kwadgFyty 
kwadgFyty 
kwadgFyty 
kwadgFyty 
kwadgFyty 
kwadgFyty 
qscsrwFlyl 



ayvthq . qke 
ayvthq . qke 
ayvthq . qke 
rlrytpvaeg 
rlrytpvaeg 
rlrytpvaeg 
rlrytpvaeg 
rlrytpvaeg 
rlrytpvaeg 
rlrytpvaeg 
sf tlhtssrr 



qivrsqtlkf 
qivrsqtlkf 
qivrsqtlkf 
ansqesdf kv 
ansqesdf kv 
ansqesdf kv 
ansqesdf kv 
ansqesdf kv 
ansqesdfkv 
ansqesdfkv 
sk.sgvrl . s 



1000 
k.vlshqif 1 
k.vlshqif 1 
k.vlshqif 1 
qvstkspnlp 
qvstkspnlp 
qvstkspnlp 
qvstkspnlp 
qvstkspnlp 
qvstkspnlp 
qvstkspnlp 
ssky . vtkss 



msa209368 . 

msa209368 . 

msa209368 . 
msa209368.2{ 

msa209368 . 
msa209368.2{l47 
msa209368 
msa209368 
msa209368 

msa209368 . 

msa209368 . 



2{l47_COHl} 
2{147_M732} 
2{147_M781} 
147_18RS2l} 
2{l47_2603} 
_JM9130013} 
,2{l47__090} 
.2{147_CJB110" 
,2{147_1169NT 
2{147_H36B 
2{147_A909} 
Consensus 



1001 

helslmklie 
helslmklie 
helslmklie 
sraqf detnr 
sraqf detnr 
sraqfdetnr 
llaqfdetnr 
llaqf detnr 
sraqfdetnr 
sraqfdetnr 
f tssv. .n.s 



h.a.pclrkv 
h-a.pclrkv 
h. a .pclrkv 
tlslampkes 
tlslampkes 
tlslampkes 
tlslampkes 
tlslampkes 
tlslampkgs 
tlslampkes 
niklsha.gk 



vmf lhivyn. 
vmf lhivyn. 
vmf lhivyn. 
syvptyrlql 
syvptyrlql 
syvptyrlql 
syvptyrlql 
syvptyrlql 
syvpiyrlql 
syvptyrlql 
. lcsyissti 



f ylml . Kmkn 
f ylml . Kmkn 
f ylml . Kmkn 
vlshwKdee 
vlshwKdee 
vlshwKdee 
vlshwKdee 
vlshwKdee 
vlshwKdee 
vlshwKdee 
sf isccKr .r 



1050 
mgmrlltiis 
mgmrlltiis 
mgmrlltiis 
ygdetsyhyf 
ygdetsyhyf 
ygdetsyhyf 
ygdetsyhyf 
ygdetsyhyf 
ygdetsyyyf 
ygdetsyhyf 
iwr.dflplf 



msa209368 . 

msa209368 . 

msa209368 . 
msa209368.2{ 

msa209368 . 
msa209368 .2{147 
msa209368 
msa209368 .2{ 
msa209368 .2{ 

msa209368 . 

msa209368 . 



2{l47_COHl} 
2{147_M732} 
2{l47_M78lj 
147__18RS21) 
2{147_2603} 
_JM9130013} 
,2{l47_090) 
147_CJB110) 
147_1169NT} 
2{147_H36B} 
2{147_A90 9) 
Consensus 



1051 

i . ikkvk . hf 
i . ikkvk . hf 
i. ikkvk. hf 
hidqegkvtl 
hidqegkvtl 
hidqegkvtl 
hidqegkvtl 
hidqegkvtl 
hidqegkatl 
hidqegkvtl 
pyrsrr . sdt 



lkrlr . ervr 
lkrlr . ervr 
lkrlr . ervr 
pktvkigese 
pktvkigese 
pktvkigese 
pktvkigese 
pktvkigese 
pktvkigese 
pktvkigese 
s ,ns . drre . 



Ir . tlrp .hi 
lr . tlrp . hi 
lr . tlrp .hi 
vavdpkaltl 
vavdpkaltl 
vavdpkaltl 
vavdpkaltl 
vavdpkaltl 
vavdpkaltl 
vavdpktltl 
gcsrp.dldt 



lwkiklvilq 
lwkiklvilq 
lwkiklvilq 
wedkagnf a 
wedkagnf a 
wedkagnf a 
wedkagnf a 
wedkagnf a 
wedkagnf a 
wedkagnf a 
ccgr.sw.fr 



1100 
r .nclts . ir 
r .nclts . ir 
r. nclts . ir 
tvklsdllnk 
tvklsdllnk 
tvklsdllnk 
tvklsdllnk 
tvklsdllnk 
tvklsdllnk 
tvklsdllnk 
ngkiv.ple . 



791 
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msa209368 .2{l47_COHl} 
msa209368 . 2 { 147_M732 } 
msa209368 .2{l47_M78l} 
msa209368 . 2 { 147_18RS2l} 
msa209368 . 2 { 147_2603 } 
maa209368 . 2 { 147_JM9130013 } 
msa209368 . 2 { 147_090 } 
msa209368 . 2 { 147_CJB110 } 
msa209368 . 2 { 147_1169NT} 
msa209368 .2{147_H36B} 
msa209368 .2{147_A909} 
Consensus 



msa209368 .2{l47_COHl; 
msa209368 . 2 { 147JVI732 ] 
msa209368 . 2 { 147_M781 / 
msa209368 . 2 { 147_18RS21 
msa209368 .2{l47_2603 
msa209368 . 2 { 147_JM9130013 
msa2 09368 .2{l47_090; 
msa209368 .2{147_CJB110 : 
msa209368 .2 { 147_1169NT 
msa209368 .2{l47_H36B 
msa209368 .2{147_A909; 

Consensus 



msa2 09368 . 2 { 147_COHl } 
msa209368 . 2 { 147_M732 } 
msa209368 .2{l47_M78l} 
msa209368 . 2 { 147_18RS2l} 
j msa2 09368 .2{l47_2603} 
msa209368 . 2 {l47_JM9130013 } 
msa209368 . 2 { 147_090 } 
msa209368 . 2 { 147_CJB110 } 
msa209368.2{l47_1169NT} 
msa209368 .2{147_H36B} 
msa209368 .2{l47_A909} 
Consensus 



1101 1150 

q.yqrkktl. .fltvsnili t.rKnlclfl kkeK..tri. kk.h.lslkl 

q.yqrkktl. -fltvsnili t.rKnlclfl kkeK..tri. kk.h.lslkl 

q.yqrkktl. .fltvsnili t.rKnlclfl kkeK..tri. kk.h.lslkl 

awsekenai visnsfkyfd nlkKepmfis kkeKwnknl eeiilvkpqt 

awsekenai visnsfkyfd nlkKepmfis kkeKwnknl eeiilvkpqt 

awsekenai visnsfkyfd nlkKepmfis kkeKwnknl eeiilvkpqt 

awsekenai visnsfkyfd nlkKesmfis kegKwnknl eeitlvkpqt 

awsekenai visnsfkyfd nlkKesmfis kegKwnknl eeitlvkpqt 

awsekenai visnsfkyfd nlkKepmfis kkeKwnknl eeiilvkpht 

awsekenai visnnfkyfd nlkKepmfis kegKwnknl eeialvkpqt 

gssirerkry snf.qfqif. .leKrtyvyf .rrKsskqes rrnsis.aan 
_ „ * *_ 

1151 1200 

qlllnhclkk .Inqemrkss llqtiivae. lrsyhlnitg illti — 

qlllnhclkk .Inqemrkss llqtiivae. lrsyhlnitg illti 

qlllnhclkk .Inqemrkss llqtiivae. lrsyhlnitg illti 

tvttqslske itksgnekvl tstnnnssrv akiispkhng dsvnhT 

tvttqslske itksgnekvl tstnnnssrv akiispkhng dsvnhTlpst 

tvttqslske itksgnekvl tstnnnssrv akiispkhng dsvnhT 

tvttqslske itksgnekvl tstnnnssrv akiispkhng dsvnhT 

tvttqslske itksgnekvl tstnnnssrv akiispkhng dsvnhT 

tvttqslske itksgnekvl tstnnnssrv akiispkhng dsvnhT 

tvttqslske itqsgnekvl tstnnnssrv akiispkhng dsvnhT 

ysyysiiv.r nnsirk.esp hfykq. . .qs s.dhit.t.r gfc.py 

__: _ It**** 



1201 



1233 



sdratnglfv gtlallssll lylkpkktkn nsk 



********** ********** ********** *** 
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Table 45: Comparative Sequences relating to SAG1404 (strain info highlighted in BOLD) 



SEQ ID NO. 4501 
STRAIN 2603 

ATGAAAAAGATTAGAAAAAGTTTAGGACTT 

TTAGCGTTTTTTTCGGTAGCCAGTGTAAATGCTC 

CAGATAGGACTTCAGCCAAATACTACAGAGG 

ACTGACAACTTAAAAGTTG ATTTATTGAG C CAAATGACAG AT AG CGAATT GAACCAG AAG 
TATAAG AGTATCTTGACTT CTC CT ACTGAT ACTAATGGT CAG ACAAAGAT AG CACTCCCA 
AATGGTTCGTACTTTGGTCGTGCirATAAAGCTGATClAAAG 

TTTC ATATTG AATT ACCAGATGAT AAGTT ATCAAATCAATTACAGATAT^AT C CTAAG CG A 
AAAGTTGAAACAGG C CGATT AAAACTTATTAAATATACAAAAGAAGG AAAGATAAAG AAA 
AGGCTATCCGGAGTAATATTTGTATTATACGATAACCAGAATCAGCCAGTTCGCrTTAAA 
AATGGACGATTTACGACCGATCAAGATGGGATTACTTCATTAGTAACTGATGATAAGGGA 
GAAATTGAGGTTGAAGGTTTATTACCTGGT^ 

ACTGGTTACCGTATATCTATGAAGGATGCTGTAGTTGCTGTAGTTGCTAATAAAACACAG 
GAAGTAGAGGTAGAAAACGAAAAAGAA^CTCCTCCACCAACAAATCCTAAACCATCACAA 
CCG CTTTTT C CACAATCATTTCTT CCT AAAACAGG AATG ATT ATTGGTGGAGGACTGACA 
ATTCITGGTTGTATTATTTTGGGAATTTTGTTTATCTTTTTAAGAA 
AAAT CTG AAAGAAACG ATACAGT A 

SEQ ID NO. 4502 
STRAIN 090 

GATACCCCTAATCAACTAACAATCACAC 

AGATAGGACTT CAG C CAAAT ACTACAGAGG AGGGG ATTT CTT AT CGTTT A 
TGGACTGTGACTGACAACTTAAAAGTTGATTTATTGAGC CAAATGACAG A 
TAGCGAATTGAACCAGAAGTATAAGAGTATCTTGACTTCTCCTACTGATA 
CTAATGGtCAGACAAAGATAGCACTCCCAAATGGTTCGTACTTTGGTCGT 
GCTT ATAAAG CTGAT CAAAG CGTTT CAACAATAGTACCTTTTTATATTGA 
ATTACCAGATGATAAGTTATCAAATa^TTACAGATAAATCCTAAGCGAA 
AAGTTGAAACAGG C CGATTAAAACITATTAAATAT ACAAAAG AAG GAAAG 
ATAAAGAAAAGGCTATCAGGAGTAATATTTGTATTATACGATAACCAGAA 
TCAGCCAGTTCG CITTAAA^TGGACGATTTACGACCGAT CAAGATGGGA 
TTACTTCATT^GTAACTGATGATAA^ 

TTAC CTGGTAAGTATATTTTT CGAGAAG CAAAAGCACTAACTGG t TACCG 
TATATCTATGAAGGATGCTGTAGTTGCTGTAGTTGCTAATAAAACACAGG 
AAGTaGAGGTaGAAAACGAAAAAGAAACTCCTCa^CCAACAAATCCTAAA 
CCATCACAACCG 

SEQ ID NO. 4503 
STRAIN H36B 

GATACCCCTAATCAACTAACAATCACACAGA 

TAGGACTTCAG CCAAATACT ACAGAGGAGGGGATTT CTTATCGTTTATGG 
ACTGTGACTG ACAACTTAAAAGTTGATTTATTGAG C CAAATGACAGATAG 
CGAATTG AACCAGAAGTATAAGAGTAT CTTGACTT CTCCTACTGATACT 
ATGG t CAGAQ\AAGATAGCACTCCCAAATGGTTCGTACTTTGGTCGTGCT 
TATAAAG CTGATCAAAG CGTTT CAACAATAGTAC CTTTTTATATTGAATT 
ACCAGATGATAAGTTATCAAAT CAATTACAGATAAATC CTAAGCGAAAAG 
TTGAAACAGG CCGATTAAAACTTATTAAATATACAAAAGAAGGAAAGATA 
AAGAAAAGGCTv^CCGGAGTAATATTTGTATTATACGATAACCAGAATCA 
GCCAGTTCGCTTTAAAAATGGACGATTTACGACCGAT CAAGATGGGATTA 
CITC^TTAGTAACTGATGATAAGGGAGAAATTGAGGTTGAAGGTTTATTA 
CCTGGTAAGTATATTTTTCGAGAAGCAAAAG 

AT CTATGAAGGATG CTGTAGTTGCTGTAGTTG CTAATAAAACACAGGAAG 
TAGAGGTAGAAAACGA^A^GAAACTCCTCCACCAACAAATCCTA^ACCA 
TCACAACCGC 

SEQ ID NO. 4504 
STRAIN 18RS21 

GATACC CCTAAT CAACTAACAATCACACAG 

AT AGGACTT CAG CCAAATACTACAGAGGAGGGGATTT CTTATCGTTT ATG 
GACTGTGACTGACAACTTAAAAGTTGATTTATTGAGCCAAATGACAGATA 
GCGAATTGAACCAGAAGTATAAGAGTATCTTGACTTCTCCTACTGATACT 
AATGG t CAGACAAAGATAG CACTCC CAAATGGTTCGTACTTrGGTCGTG C 
TTATAAAGCTGATCAAAGCGTTTCAACAATAGTACCTTTTTATATTGAAT 
TACCAGATGATAAGTTATCAAATCAATTACAGATAAATCCTAAGCGAAAA 
GTTGAAACAGGCCGATTAAAACTITATTAAATATACAA^GAAGGAAAGAT 
AAAGAAAAGGCTATCCGGAGTAATATTTGTATTATACGATAACCAGAATC 
AG CCAGTTCG CTTTAAAAATGGACG ATTTACGACCGATCAAGATGGGATT 
ACTTCATTAGTAACrGATGATAAGGGAGAAATTGAGGTTGAAGGTTTATT 
AC CTGGTAAGTATATTTTT CGAGAAGCAAAAG CACTAACTGGTTACCGTA 
TATCTATGAAGGATGCTGTAGTTGCTGTAGTTGCTAATAAAACACAGGAA 
GTAGAGGTAGAAAACaWVAAGAAA.CTCCTCC^CCAACAAATCCTAAACC 
ATCACAACC 

SEQ ID NO. 4505 
STRAIN CJB110 

GATACCCCTAATCAACTAACAATCACACA 
GATAGGACTTCAGC CAAATACTACAGAGGAGGGG ATTTCT^ 
GG aCTGTGACTGACAACTTAAAAGTTGATTTATTG AG CCAAATG AC^ 
AGCGAATTgAACCAGAAGTATAAGAGTATCTTGACTTCTCctACTGATAc 
T AATGGTCAG ACAAAGATAG CACT CCC AAATGGTT CGTACTTTGGT CGT G 
CTTAT AAAG CTGATCAAAG CGTTT CAACAATAGTACCTTTTTATATTGAA 
TTACC^GATGATAAGTTATCAAATCAATTACAGa t AAATCCTAAGCGAAA 
AGTTGAAACAGG CCG ATTa aAACTTATTAAATATACAAAAGAAGGAAAGA 
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Table 45: Comparative Sequences relating to SAG1404 (strain info highlighted in BOLD) 

* 

TAAAGAAAAGGCTaTCAGGAGTAATATTTGTATTATACGATAACCAGAAT 
CAGCCAGTTCGCTTTAAAAATGGACGATTTACGACCGATCAAGATGGGAT 
TACIT CATTAGTAACTGATGATAAGGGAGAAATTGAGGTTGAAGGTTTAT 
TACCTGGT AAGTATATTTTT CGAG AAG CAAAAG CACTAACTGGTTa CCGT 
AT AT CT ATG AAGGATG CTGT AGTTG CTGT AGTTG CTAAT AA^ACACAGG A 
AGTAGAGGTAGAAAACGAAAAAGAAACTCCTCCACCAACAAATCCTAAAC 
CATCACAACC 

SEQ ID NO. 4506 
STRAIN 1169NT 

GATAC CC CTAAT CAACTAACAAT CACACAG 

ATAGGACTTCAG C CAAATACT ACAG AGGAGGG G ATTT CTT ATCGTTT ATG 
GAOTGTGACrGACAACTTAAAAGTTGATTTATTGAGCCAAATGACAGATA 
GCGAATTGAACCAGAAGTATAAGAGTATCTTGACTTCTCCTACTGATACT 
AATGG t CAga CAAAGATAG CACTCCCAAATGGTT CGTACTTTGGTCGTG C 
TTATAAAGCTGATCAAAGCGTTTCAACAATAGTACCTTTTTATATTGAAT 
TAC CAGATGATAAGTTATCAAATCAATTACAGAT AAATCCTAAG CGAAAA 
GTTGAAACAGGCCGATTAAAACTTATTAAATATACAAAAGAAGGAAAGAT 
AAAGAAAAGGCTATCAGGAGTAATATTTGTATTATACGATAAC CAGAAT C 
AGCCAGTTCG CTITTAAAAATGG ACGATTTACGAC CGATCA^G ATGGGAT T 
ACTTCATTAGTAACtgaTGATAAGGGAGAAATTGAGGTTGAAGGTTTATT 
AC CTGGTAAGTATATTTTT CGAGAAGCAAAAG CACTAACTGGTT ACCGTA 
TATCTATGAAGGATGCTGTAGTTGCrGTAGTTGCTAATAAAACACAGGAA 
GTAGAGGTAGAAAACGAAAAAGAAACTCCTCCACCAACAAATCCTAAACC 
ATCACAACC 

PRETTY Of : /biotmp/msal84750 . 2 { * } May 13, 2003 06:23 

1 50 

m8al84750 ,2{150_030} 

msa!84750 . 2{l50_1169NT} 

msalB47 50 . 2 { 150_CJB110 } 

msal84750.2{l50_18RS2l} 

msal84750.2{l50_2603) atgaaaaaga ttagaaaaag tttaggactt ctactatgtt gctttttagg 

msal84750.2 {150__H36B} ~ 

Consensus ********** ********** ********** ********** ********** 

51 100 

msal847 50 . 2 { 150 JD 90 } GATACCC 

msal84750.2{l50_1169NT} GATACCC 

msal847 50.2(l50_CJB110> GATACCC 

msal84750 . 2 { 150_18RS21 } GATACCC 

msal84750.2{l50_2603} attggtacaa fctagcgtttt tttcggtagc cagtgtaaat gctGATACCC 

msal84750.2{l50_H36B} GATACCC 

Consensus ********** ********** ********** ********** ********** 



msal84750 .2{150_090} 
msal84750 . 2 { 150_1169NT* 
msal84750.2{ 150_CJB110, 
msal84750 . 2 { 150_18RS21 [ 
msal84750.2{l50_2603} 
msal84750 . 2 { 150_H3 6B} 

Consensus 



101 

CTAATCAACT 
CTAAT CAACT 
CTAATCAACT 
CTAATCAACT 
CTAATCAACT 
CTAATCAACT 



AACAATCACA 
AACAATCACA 
AACAATCACA 
AACAATCACA 
AACAATCACA 
AACAATCACA 



CAGATAGGAC 
CAGATAGGAC 
CAGATAGGAC 
CAGATAGGAC 
CAGATAGGAC 
CAGATAGGAC 



TTCAGCCAAA 
TTCAGCCAAA 
TTCAGCCAAA 
TTCAGCCAAA 
TTCAGCCAAA 
TTCAGCCAAA 



150 

TACTACAGAG 
TACTACAGAG 
TACTACAGAG 
TACTACAGAG 
TACTACAGAG 
TACTACAGAG 



********** ********** ********** ********** ********** 



msal84750 .2{l50_090} 
msal 84750 . 2 { 150_11?9NT} 
msal84750 .2{l50_CJB110} 
msa!84750 . 2 { 150_18RS21 \ 
msal84750 . 2 { 150_2S03 } 
msal84750.2{l50_H36B} 

Consensus 



151 

GAGGGGATTT 

GAGGGGATTT 

GAGGGGATTT 

GAGGGGATTT 

GAGGGGATTT 

GAGGGGATTT 
********** 



CTTATCGTTT 
CTTATCGTTT 
CTTATCGTTT 
CTTATCGTTT 
CTTATCGTTT 
CTTATCGTTT 



ATGGACTGTG 
ATGGACTGTG 
ATGGACTGTG 
ATGGACTGTG 
ATGGACTGTG 
ATGGACTGTG 



ACTGACAACT 
ACTGACAACT 
ACTGACAACT 
ACTGACAACT 
ACTGACAACT 
ACTGACAACT 



200 

TAAAAGTTGA 
TAAAAGTTGA 
TAAAAGTTGA 
TAAAAGTTGA 
TAAAAGTTGA 
TAAAAGTTGA 



********** ********** ********** ********** 



msal84750 ,2{l50_090> 
msal84750 . 2 { 150_1169NT* 
msal 8 47 5 0 . 2 { 1 5 0_GJB1 1 0 
msal84750 - 2 { 150_18RS21 
msal84750 . 2 \ 150_2603 , 
msal84750 . 2{150_H36B; 

Consensus 



201 

TTTATTGAGC 
TTTATTGAGC 
TTTATTGAGC 
TTTATTGAGC 
TTTATTGAGC 
TTTATTGAGC 



CAAATGACAG 
CAAATGACAG 
CAAATGACAG 
CAAATGACAG 
CAAATGACAG 
CAAATGACAG 



ATAGCGAATT 
ATAGCGAATT 
ATAGCGAATT 
ATAGCGAATT 
ATAGCGAATT 
ATAGCGAATT 



GAACCAGAAG 
GAACCAGAAG 
GAACCAGAAG 
GAACCAGAAG 
GAACCAGAAG 
GAACCAGAAG 



250 

TATAAGAGTA 
TATAAGAGTA 
TATAAGAGTA 
TATAAGAGTA 
TATAAGAGTA 
TATAAGAGTA 



********** ********** ********** ********** ********** 



msa!84750 .2{150_090; 
msal84750 . 2 ( 150_1169NT; 
msal84750 .2 jl50~CJB110^ 
msal84750 .2 { 150_18RS2l} 
msal84750 . 2 { 150J2603 ) 
msal84750 . 2 (l50_H3 SB} 

Consensus 



251 

TCTTGACTTC 
TCTTGACTTC 
TCTTGACTTC 
TCTTGACTTC 
TCTTGACTTC 
TCTTGACTTC 



TCCTACTGAT 
TCCTACTGAT 
TCCTACTGAT 
TCCTACTGAT 
TCCTACTGAT 
TCCTACTGAT 



********** ********** 



ACTAATGGTC 

ACTAATGGTC 

ACTAATGGTC 

ACTAATGGTC 

ACTAATGGTC 

ACTAATGGTC 
********** 



AGACAAAGAT 

AGACAAAGAT 

AGACAAAGAT 

AGACAAAGAT 

AGACAAAGAT 

AGACAAAGAT 
********** 



300 

AGCACTCCCA 
AGCACTCCCA 
AGCACTCCCA 
AGCACTCCCA 
AGCACTCCCA 
AGCACTCCCA 
********** 



301 



350 
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Table 45: Comparative Sequences relating to SAG1404 (strain info highlighted in BOLD) 



msa!84750 .2{l50_090} 
msal84750 . 2 { 150_1169NT} 
msal84750 . 2 { 150_CJB110 } 
msal84750.2{l50_18RS2l} 
rasal84750 . 2 (150^2603 } 
msal84750.2{150_H36B} 

Consensus 



AATGGTTCGT 
AATGGTTCGT 
AATGGTTCGT 
AATGGTTCGT 
AATGGTTCGT 
AATGGTTCGT 



ACTTTGGTCG 
ACTTTGGTCG 
ACTTTGGTCG 
ACTTTGGTCG 
ACTTTGGTCG 
ACTTTGGTCG 



TGCTTATAAA 
TGCTTATAAA 
TGCTTATAAA 
TGCTTATAAA 
TGCTTATAAA 
TGCTTATAAA 



GCTGATCAAA 
GCTGATCAAA 
GCTGATCAAA 
GCTGATCAAA 
GCTGATCAAA 
GCTGATCAAA 



GCGTTTCAAC 
GCGTTTCAAC 
GCGTTTCAAC 
GCGTTTCAAC 
GCGTTTCAAC 
GCGTTTCAAC 



********** ********** ********** ********** ********** 



msal84750 .2{l50_090} 
msal84750 . 2 { 150_1169NT} 
msal84750 .2{150_CJB110} 
msal84750 . 2 { 150_18RS2l' 
msal84750.2{l50_2603 
msal84750.2{l50_H36B] 

Consensus 



351 

AATAGTACCT 

AATAGTACCT 

AATAGTACCT 

AATAGTACCT 

AATAGTACCT 

AATAGTACCT 
********** 



TTTTATATTG 

TTTTATATTG 

TTTTATATTG 

TTTTATATTG 

TTTTATATTG 

TTTTATATTG 
********** 



AATTACCAGA 
AATTACCAGA 
AATTACCAGA 
AATTACCAGA 
AATTACCAGA 
AATTACCAGA 



TGATAAGTTA 
TGATAAGTTA 
TGATAAGTTA 
TGATAAGTTA 
TGATAAGTTA 
TGATAAGTTA 



400 

TCAAATCAAT 
TCAAATCAAT 
TCAAATCAAT 
TCAAATCAAT 
TCAAATCAAT 
TCAAATCAAT 



********** ********** ********** 



msal84750. 2 { 150_090} 
msal847S0.2( 150_1169Nt| 
msal84750 . 2 { 150_CJB110) 
msal84750 . 2 { 150_18RS2l} 
msal84750 . 2 { 150__2603 } 
msal84750 . 2 { 150JK36B} 

Consensus 



401 

TACAGATAAA 

TACAGATAAA 

TACAGATAAA 

TACAGATAAA 

TACAGATAAA 

TACAGATAAA 
********** 



TCCTAAGCGA 
TCCTAAGCGA 
TCCTAAGCGA 
TCCTAAGCGA 
TCCTAAGCGA 
TCCTAAGCGA 



AAAGTTGAAA 
AAAGTTGAAA 
AAAGTTGAAA 
AAAGTTGAAA 
AAAGTTGAAA 
AAAGTTGAAA 



CAGGCCGATT 
CAGGCCGATT 
CAGGCCGATT 
CAGGCCGATT 
CAGGCCGATT 
CAGGCCGATT 



450 

AAAACTTATT 
AAAACTTATT 
AAAACTTATT 
AAAACTTATT 
AAAACTTATT 
AAAACTTATT 



********** ********** ********** ********** 



msal84750 .2{150_090} 
msal84750 . 2 { 150_1169NT} 
msal84750 . 2 { 150_CJB110 } 
msal84750 .2{150_18RS21} 
msal84750 . 2 f 150_2'603 J 
msal84750 . 2 { 150_H36B) 

Consensus 



451 

AAATATACAA 
AAATATACAA 
AAATATACAA 
AAATATACAA 
AAATATACAA 
AAATATACAA 
********** 



AAGAAGGAAA 

AAGAAGGAAA 

AAGAAGGAAA 

AAGAAGGAAA 

AAGAAGGAAA 

AAGAAGGAAA 
********** 



GATAAAGAAA 
GATAAAGAAA 
GATAAAGAAA 
GATAAAGAAA 
GATAAAGAAA 
GATAAAGAAA 
********** 



AGGCTaTCaG 

AGGCTaTCaG 

AGGCTaTCaG 

AGGCTaTCcG 

AGGCTaTCcG 

AGGCTwTCcG 
*****„**_* 



500 

GAGTAATATT 

GAGTAATATT 

GAGTAATATT 

GAGTAATATT 

GAGTAATATT 

GAGTAATATT 
********** 



msal84750.2{l50_090 
msal84750 . 2 { 150_1169NT' 
msal84750 -2{ 150_CJB110 
msal84750 . 2 { 150_18RS21 
msa!84750 . 2 ( 150_2603 } 
msal84750.2{150_H36B} 

Consensus 



501 

TGTATTATAC 

TGTATTATAC 

TGTATTATAC 

TGTATTATAC 

TGTATTATAC 

TGTATTATAC 
********** 



GATAACCAGA 

GATAACCAGA 

GATAACCAGA 

GATAACCAGA 

GATAACCAGA 

GATAACCAGA 
********** 



ATCAGCCAGT 
ATCAGCCAGT 
ATCAGCCAGT 
ATCAGCCAGT 
ATCAGCCAGT 
ATCAGCCAGT 



TCGCTTTAAA 
TCGCTTTAAA 
TCGCTTTAAA 
TCGCTTTAAA 
TCGCTTTAAA 
TCGCTTTAAA 



550 

AATGGACGAT 
AATGGACGAT 
AATGGACGAT 
AATGGACGAT 
AATGGACGAT 
AATGGACGAT 



********** ********** ********** 



msal84750.2{l50_090} 
msal84750 . 2 { 150_1169NT) 
msal84750 . 2 { 150_CJB110 
msal84750 . 2 { 150_18RS21 
msal847S0 . 2 { 150_2 603 , 
msalB4750.2{l50_H36B 

Consensus 



551 

TTACGACCGA 
TTACGACCGA 
TTACGACCGA 
TTACGACCGA 
TTACGACCGA 
TTACGACCGA 



TCAAGATGGG 
TCAAGATGGG 
TCAAGATGGG 
TCAAGATGGG 
TCAAGATGGG 
TCAAGATGGG 



ATT ACTT CAT 
ATTACTTCAT 
ATT ACTT CAT 
ATTACTTCAT 
ATTACTTCAT 
ATTACTTCAT 



TAGTAACTGA 
TAGTAACTGA 
TAGTAACTGA 
TAGTAACTGA 
TAGTAACTGA 
TAGTAACTGA 



********** ********** ********** ********** 



600 

TGATAAGGGA 

TGATAAGGGA 

TGATAAGGGA 

TGATAAGGGA 

TGATAAGGGA 

TGATAAGGGA 
********** 



msal84750.2{l50_090> 
msa!84750 .2{150_1169NT) 
msal84750 . 2 { 150_CJB110 } 
msal84750 .2{150_18RS21} 
msal84750.2{l50_2603} 
msal84750 . 2 { 150_H36B } 

Consensus 



601 

GAAATTGAGG 

GAAATTGAGG 

GAAATTGAGG 

GAAATTGAGG 

GAAATTGAGG 

GAAATTGAGG 
********** 



TTGAAGGTTT 

TTGAAGGTTT 

TTGAAGGTTT 

TTGAAGGTTT 

TTGAAGGTTT 

TTGAAGGTTT 
********** 



ATTACCTGGT 

ATTACCTGGT 

ATTACCTGGT 

ATTACCTGGT 

ATTACCTGGT 

ATTACCTGGT 
********** 



AAGTATATTT 

AAGTATATTT 

AAGTATATTT 

AAGTATATTT 

AAGTATATTT 

AAGTATATTT 
********** 



650 

TTCGAGAAGC 

TTCGAGAAGC 

TTCGAGAAGC 

TTCGAGAAGC 

TTCGAGAAGC 

TTCGAGAAGC 
********** 



msal84750 . 2 { 150_090 } 
msal84750 . 2 { 150_1169NT} 
msal84750 .2f 150_CJB110} 
msal84750 . 2 { 150_18RS21 } 
msal84750 .2{150_2603} 
msal84750.2{l50_H36B} 

Consensus 



651 

AAAAGCACTA 

AAAAGCACTA 

AAAAGCACTA 

AAAAGCACTA 

AAAAGCACTA 

AAAAGCACTA 
********** 



ACTGGTTACC 

ACTGGTTACC 

ACTGGTTACC 

ACTGGTTACC 

ACTGGTTACC 

ACTGGTTACC 
********** 



GTATATCTAT 

GTATATCTAT 

GTATATCTAT 

GTATATCTAT 

GTATATCTAT 

GTATATCTAT 
********** 



GAAGGATGCT 

GAAGGATGCT 

GAAGGATGCT 

GAAGGATGCT 

GAAGGATGCT 

GAAGGATGCT 
********** 



700 

GTAGTTGCTG 

GTAGTTGCTG 

GTAGTTGCTG 

GTAGTTGCTG 

GTAGTTGCTG 

GTAGTTGCTG 
********** 



msal84750.2{l50_090; 
msal84750 .2{150_1169NT 
msa!84750 . 2 { 150_CJB110 
msal84750 . 2 { 150__18RS21 
msal84750 . 2 { 150_2603 
msal84750.2{150_H36B 

Consensus 



701 

TAGTTGCTAA 

TAGTTGCTAA 

TAGTTGCTAA 

TAGTTGCTAA 

TAGTTGCTAA 

TAGTTGCTAA 
********** 



TAAAACACAG 

TAAAACACAG 

TAAAACACAG 

TAAAACACAG 

TAAAACACAG 

TAAAACACAG 
********** 



GAAGTAGAGG 

GAAGTAGAGG 

GAAGTAGAGG 

GAAGTAGAGG 

GAAGTAGAGG 

GAAGTAGAGG 
********** 



TAGAAAACGA 

TAGAAAACGA 

TAGAAAACGA 

TAGAAAACGA 

TAGAAAACGA 

TAGAAAACGA 
********** 



750 

AAAAGAAACT 

AAAAGAAACT 

AAAAGAAACT 

AAAAGAAACT 

AAAAGAAACT 

AAAAGAAACT 
********** 



msal84750.2{150__090} 
msal84750 . 2 { 150_1169NT} 



751 

CCTCCACCAA CAAATCCTAA AC CAT CACAA CCg- 
CCTCCACCAA CAAATCCTAA AC CAT CACAA CC-- 



800 



795 



WO 2004/018646 PCT/US2003/026827 
Table 45: Comparative Sequences relating to SAG1404 (strain info highlighted in BOLD) 



msal84750 - 2 ( 150J2JB110 } 
msal84750.2{l50_18RS21) 
msal847S0 . 2 { 150_2603 } 
msal84750.2{l50_H36B} 

Consensus 



msal847S0 . 2 { 150_090 } 
msal84750 . 2 {150JL169NT} 
msal84750 . 2 { 150_CJB110 } 
msal84750 . 2 { 150_18RS21 } 
msal84750 . 2 { 150_2603 } 
msal84750 .2{150_H36B} 

Consensus 



msal84750 .2{l50_090} 
msal8 47 5 0 . 2 { 150_1 16 9NT } 
msal84750 . 2 { 150_CJB110 } 
msal84750 . 2 { 150_18RS21 J 
rasal84750 . 2 { 150_2603 } 
msal84750 .2{150_H36B} 

Consensus 



msal84750 .2{l50_090} 
msal84750 .2 f 150_1169NT} 
rasal84750 . 2 f 150_CJB110 } 
msal84750.2(150_18RS2l} 
msal84750 . 2 { 150_2603 } 
msal84750.2{l50_H36B} 

Consensus 

SEQ TD NO. 4507 
STRAIN 2603 

MKKIRKSriGLLLCCFLGLVQLAFFSVASVNiUDTPNQLTITQIGLQPNTTEEGISYRLWTV - 
TDNLKVDLLSQMTDSELNQKYKS I LTS PTDTNGQTKI ALPNGS YFGRAYKADQSVST I VP 
FYI ELiPDDKLSNQLQ I NPKRKVETGRLKL I KYTKEGKI KKRLSGVI FVLYDNQNQPVRFK 
NGRFTTDQDG ITSLVTDDKGE I EVEGLLPGKY I FREAKALTG YR I SM KD AWAWANKTQ 
EVEVENEKETPPPTNPKPSQPLFPQSFLPKTGMI IGGGLTILGCI ILGILFI FIiRKTKNS 
KSERNDTV 

SEQ ZD NO. 4508 
STRAIN 090 

DTPNQLT I TQ I GLQPNTTEEG I S YRLWTVTDNLKVDLLSQMTDSELNQKYKS ILTS PTDT 
NGQTKI ALPNGS YFGRAYKADQS VSTI VPF Y I ELPDDKLSNQLQ I NPKRKVETGRLKL I K 
YTKEGKI KKRLSGVI FVLYDNQNQPVRFKNGRFTTDQDG ITSLVTDDKGE I EVEGLLPGK 
YI FREAKALTG YR I SMKDAWAWANKTQEVEVENEKETPPPTNPKPSQP 

SEQ XD NO. 4509 
STRAIN H36B 

DTPNQLTITQ I GLQPNTTEEG I S YRLWTVTDNLKVDLLSQMTDSELNQKYKS I LTS PTDT 
NGQTKI ALPNGS YFGRAYKADQSVSTI VPF YI ELPDDKLSNQLQ I NPKRKVETGRLKLI K 
YTKEGKI KKRLSGVI FVLYDNQNQPVRFKNGRFTTDQDGI TSLVTDDKGE I EVEGLLPGK 
YI FREAKALTGYRI SMKDAWAWANKTQEVEVENEKETPPPTNPKPSQP 

SEQ XD NO. 4510 
STRAIN 18RS21 

DTPNQLTITQIGLQPNTTEEGISYRLWTVTDNLKVDLLSQ^^T>SEI^QKYKSILTSPTDT 
NGQTKI ALPNGS YFGRAYKADQSVSTI VPFY I ELPDDKLSNQLQ I NPKRKVETGRLKLI K 
YTKEGKI KKRLSGVI FVLYDNQNQPVRFKNGRFTTDQDG ITSLVTDDKGE I EVEGLLPGK 
YI FREAKALTGYRI SMKDAWAVVANKTQEVEVENEKETPPPTNPKPSQ 

SEQ ID NO. 4511 
STRAIN 1169NT 

DTPNQLTITQ I GLQPNTTEEG IS YRLWTVTDNLKVDLLSQMTDSELNQKYKS I LTS PTDT 
NGQTKIALPNGSYFGRAYKATXJSVSTIVPFYIELPDDKLSNQI^INPKRKVETGRLKLIK 
YTKEGKI KKRLSGVI FVLYDNQNQPVRFKNGRFTTDQDG I TSLVTDDKGE I EVEGLLPGK 
YI FREAKALTGYRI SM KDA WA WANKTQEVE VENE KET P P PTN PKP S Q 



CCTCCACCAA CAAATCCTAA ACCATCACAA CC 

CCTCCACCAA CAAATCCTAA ACCATCACAA CC 

CCTCCACCAA CAAATCCTAA ACCATCACAA CCgCtttttc cacaatcatt 

CCTCCACCAA CAAATCCTAA ACCATCACAA CCgC 

********** ********** ********** **_******* ********** 

801 850 



tcttcctaaa acaggaatga ttattggtgg aggactgaca attcttggtt 
********** ********** ********** ********** ********** 

851 900 



gtattatttt gggaattttg tttatctttt taagaaaaac taaaaatagc 
********** ********** ********** ********** ********** 

901 924 



aaatctgaaa gaaacgatac agta 



********** ********** **** 



PRETTY of: /biotmp/msal84868 . 2 { * } May 13, 2003 06:25 .. 

1 50 

msal84868 .2{150_090} -DTPNQLTIT QIGLQPNTTE 

msa!84868 .2f 150_2603} mkkirkslgl llccflglvq laffsvasvn aDTPNQLTIT QIGLQPNTTE 

msal84868 .2{150_H36B} DTPNQLTIT QIGLQPNTTE 

msal84868 .2f 150_1169NT) DTPNQLTIT QIGLQPNTTE 

msal84868 .2{150_18RS2l} DTPNQLTIT QIGLQPNTTE 

Consensus ********** ********** ********** ********** ********** 

51 100 
msal84868 .2{150_090} EGISYRLWTV TDNLKVDLLS QMTDSELNQK YKSILTSPTD TNGQTKIALP 



796 



WO 2004/018646 



PCT/US2003/026827 



Table 45: Comparative Sequences relating to SAG1404 (strain info highlighted in BOLD) 



msal84868 . 2 { 150_2603 } 
rasal84868 . 2 { 150_H36B} 
msal84868 . 2 { 150_1169NT} 
msal84868 . 2 { 150_18RS21 } 

Consensus 



EGISYRLWTV 
EGISYRLWTV 
EGISYRLWTV 
EGISYRLWTV 



TDNLKVDLLS 
TDNLKVDLLS 
TDNLKVDLLS 
TDNLKVDLLS 



QMTDSELNQK 
QMTDSELNQK 
QMTDSELNQK 
QMTDSELNQK 



YKSILTSPTD 
YKSILTSPTD 
YKSILTSPTD 
YKSILTSPTD 



TNGQTKIALP 
TNGQTKIALP 
TNGQTKIALP 
TNGQTKIALP 



********** ********** ********** ********** ********** 



rasal84868 .2{150_090} 
msal84868 .2{l50_2603 } 
msal84868.2{150_H36B) 
msal848 68 . 2{l50__1169NT} 
msa!84868 . 2 { 150_18RS21 } 

Consensus 



101 

NGSYFGRAYK 

NGSYFGRAYK 

NGSYFGRAYK 

NGSYFGRAYK 

NGSYFGRAYK 
********** 



ADQSVSTIVP 
ADQSVSTIVP 
ADQSVSTIVP 
ADQSVSTIVP 
ADQSVSTIVP 



FYIELPDDKL 
FYIELPDDKL 
FYIELPDDKL 
FYIELPDDKL 
FYIELPDDKL 



SNQLQINPKR 
SNQLQINPKR 
SNQLQINPKR 
SNQLQINPKR 
SNQLQINPKR 



150 

KVETGRLKLI 
KVETGRLKLI 
KVETGRLKLI 
KVETGRLKLI 
KVETGRLKLI 



********** ********** ********** ********** 



msal84868 . 2 { 150_090 } 
msa!84868 .2{l50_2603] 
msal84868.2{l50_H3 6B} 
msal84868 . 2 {l50_1169NT} 
msal84868 -2{l50_18RS2l} 

Consensus 



151 

KYTKEGKIKK 

KYTKEGKIKK 

KYTKEGKIKK 

KYTKEGKIKK 

KYTKEGKIKK 
********** 



RLSGVIFVLY 
RLSGVIFVLY 
RLSGVIFVLY 
RLSGVIFVLY 
RLSGVIFVLY 



DNQNQPVRFK 
DNQNQPVRFK 
DNQNQPVRFK 
DNQNQPVRFK 
DNQNQPVRFK 



NGRFTTDQDG 
NGRFTTDQDG 
NGRFTTDQDG 
NGRFTTDQDG 
NGRFTTDQDG 



200 

ITSLVTDDKG 
I TSLVTDDKG 
ITSLVTDDKG 
ITSLVTDDKG 
ITSLVTDDKG 



********** ********** ********** ********** 



msal84868 .2{l50_090} 
msal84868 .2{150__2603 } 
msal84868 .2{150_H36B} 
msal848 68 - 2 { 150_1169NT} 
tnsal84868 .2{150_18RS21} 

Consensus 



msal84868 . 2 { 150_090 } 
msal84868 .2{150_2603} 
msal84868 .2{150_H36B} 
msal84868 . 2 { 150_1169NT} 
msal84868 .2{l50_18RS2l} 

Consensus 



201 

EIEVEGLLPG 
EIEVEGLLPG 
EIEVEGLLPG 
EIEVEGLLPG 
EIEVEGLLPG 



KYIFREAKAL 
KYIFREAKAL 
KYIFREAKAL 
KYIFREAKAL 
KYIFREAKAL 



TGYRI SMKDA 
TGYRI SMKDA 
TGYRI SMKDA 
TGYRI SMKDA 
TGYRI SMKDA 



WAWANKTQ 
WAWANKTQ 
WAWANKTQ 
WAWANKTQ 
WAWANKTQ 



250 

EVEVENEKET 
EVEVENEKET 
EVEVENEKET 
EVEVENEKET 
EVEVENEKET 



********** ********** ********** ********** ********** 

251 300 
PPPTNPKPSQ p 

PPPTNPKPSQ plfpqsflpk tgmiiggglt ilgciilgil fiflrktkns 

PPPTNPKPSQ p 

PPPTNPKPSQ 

PPPTNPKPSQ 

********** „********* ********** ********** ********** 



msal848 68 .2{l50_090} 
msal84868.2{l50_2603; 
msal84868 . 2{l50_H3SB 
msal84868 . 2 { 150_1169NT 
msal84868 . 2{l50_18RS2l' 

Consensus 



301 



kserndtv 



******** 
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WO 2004/018646 
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Table 46: Comparative Sequences relating to SAG1615 (strain info highlighted in BOLD) 



SEQ ID NO. 4601 
STRAIN A909 

TG ACAAATATTATTTTAC C CAACG TGGTTTAGAG CAAG CAGGTGTAACTATACTAC CTTT 
CTCAC CGAATAATAT CAGTGAGGATTTAG AGATTATTG CAGG AAATG CTTTT CGT CCAGA 
TAACAATGAAGAGTTGGCTTATGTTATTGAAAA.GGGCTATCATTTTAAACGATATCATGA 
ATTT CTCGG AGATTTT ATGCGT CAGTT CACTAGT CTAGGTGTAG CTG GGG CACATGGAAA 
AACCTCAACGACAGGTTT ATT AGCT CATGTTTT AAAAAATATT ACAG ACACTT CTTT C CT 
AACTGGAGATGGTACAGGACGTC^TTCTGCTAATGCTAACTACCT 

TGAATACGAACGTCATTTTATGCCGTACCATCCAGAATACTCA^TTATTACCAAtATTGA 
TTTTG AC CATCCTG ATTATTTTAGA.GG CCTAGAGGACGTATTCAATGCCTTTAATGACT 
TGCT AAG CAAGTTCAAAAAGGTTTATT CATCTATGGAGAAGATC CAAAACTTCATGAAAT 
CACT T CTGAGGCAC CAATATATTATTATGGTTTTGAAGATTCAAATGATTTT AT AGCAAA 
AGACATCACTCGAACTGTTAA.TGGTTCTGAC"lUTAAGGTlTT'CrATAACCAAGAAGAAAT 
TGGT CAGTT T CATGTACCAG CATACGGTAAACAT AAT AT CTT AAATGCAACTGCTGTTAT 
TGCTAACCTTTAC^TAATGGGAATTGATATGGC^TTAGTAGCTGAGCATTTGAAGACATT 
TT CAG GGGT AAAGCGT CGTTCTACTGAG AAGACT ACTGACGATACTGT CATT ATTGATGA 
CTTTG CT CACCATC CTACTGAGATTATTGCGACATTAGATG CTG CT CGACAAAAATACCC 
GT CAAAAGAAATTGT AG CTATTTT C CAAC CGCATACGTTCACTC GTACGATAGCT CTTTT 
AGACGAATTTG C CCATG CCTTGAGTCAAGCGGAT AG CGTTTATCT CG CT CAAATATATGG 
TTCTGCTAGAGAAGTAGATAATGGTGAGGTGAAGGTAGAAGATTTAGCTGCTAAGATTGT 
C^^CACTCAGATCTAGTGACAGT CGAAAATGT CTCG C CTTTACTCAAT CATGAT AATGC 
TGTCTATGTCTCTATGGGTG CTGGAGACACTCAACTGTAT ^ 
ATTAG CTAACCTAACTAAAAATACACAA 

SEQ ID NO. 4602 
STRAIN 1169NT 

AAAAGCAGGCTCTAGTGACGTTGACAAATATTATTT^ 

AGGTGTAACTATATTAC CTTTCTCACCGAATAATATCAGTGAGGATTTAGAGATT ATTG C 
AGGAAATGCTTCTCGTC CAGATAACAATGAAGAGTTGGCTT^ 
TCATTTTAAACGATATCATGAATTTCTCGGAGATTTTATGCG 
TGTAGCTGGGGCACATGGAAAAACCTCAACGACAGG 

TATTACAGACACTTC'iTT CCT AATTGG AGATGGT ACAGGACGTGGTT CTG CTAATG CTAA 
TTACTTTGTGTTTGAAGCTGATGAATACGA 

CT CAATTATT AC CAATATTG ATTTTGAC CAT CCTG ATTATTTTACAGGC CTAGAGGACGT 
ATTCAATGCCTTTAATGACTATGCTAAGCAAGTTCAA 

AGAT C CAAAACTT CATG AAATCACTTCTG AG<3CACCAATATATT ACTATGG'CT 
TTCAAATGATTTTATAG CAAAAGACAT CACT CGAACTGCTAATGGCT CTGACTTT AAGGT 
TTTCTATAACCAAGAAGAAACTGGTCAGTCTCATGTACCAGCAT^ 
CTTAAATGCAACTGCTGCTACTGCTAACCTTTACA^ 

AGCTGAG CATTTGAAGACATTTTCAGGGGTAAAG CGT CGTTTTACTG AGAAGATTATTGA 
CGATACTGTCACTATTGATGACTTTGCTCACCATCCTACTG^ 

TG CTG CT CGACAAAAATACCCGTCAAAAGAAATTGTAG CTATTTTCCAACCG CATACGTT 
CACT CGT ACGAT AGCTCCTTTAGACGAATCTG CC CATG CCT^ 

CTATCTCGCTCAAATATATGGTTCTGCTAGAGAAGTAGATAATGGTGAGGTGAAGGTAGA 
AGATTTAGCTG CTAAGACTGTCAAACACTCAGATTTAGTGACAGT CG 
TTTACTCAATCATGATAATG CTGTCTATGTCTTTATGGGTG CTGG AG ACATT CAATTGTA 
TGAG CGCTCTTTTG AAG AATTATTAG CTAACCTAACTAAAAATACACAA 

SEQ ID NO. 4603 
STRAIN 090 

AAAGCAGGCTCTAGTGACGTTGACAAATACTATTTT^ 

GGTGTAACTATACTACCTTTCTCACCX3AATAATATCAGTGAGGATCTAG 

GGAAATG CTTTT CGTCCAGATAACAATGAAGAGTTGGCTTATGCT ATTG AAAA.GGGCTAT 

CATTTTAAA CG ATATCATG AATCT CTCGGAGATTTTATGCGT CAGTT CACTAGTCTAGGT 

GTAG CTGGGG CACATGGAAAAACCTCAACGACAGGTCT ACT AGCTCA 

ATTACAGACACTTCTTTCCTAACTGGAGATGGTACAGGACGTGGCTCT 

TACTTTGTGTTTGAAG CTG ATGAAT ACGAACGTCATTTT ATG C C GTACCATC CAGAATAC 

TGAACTACTACCAATACTGATTTTGAC CATCCTGACTATTTTACAGG CCTAGAGGACGTA 

CTCAATGCTTCTAATGACTATGCTAAG(ZAAGCTCAAA 

GATT CAAAACTTCATGAAAT CACTTCTAAGG CACCAATATACTATTATGGTTTTGAAGAT 
TCAAATGATTTTATAG CAAAAGACAT CACT CG AACTGCTAATGGTT CTGACTTTAAGGTT 
CTCTATAACCAAGAAGAAATTGGTCAGTTT CATGTACCAG CATACGGTAAACAT AAT AT C 
TTAAATG CAACTGCTGTTATTG CTAAC CTTTACATAATGGGAATTGATATGG CATTAGTA 
GCTGAG CATTTGAAG ACATTTT CAGGGGTAAAACGTCGTTCT ACTGAGAAGATT ATTGAC 
GATACTGTCATTACTGATGACTCTGCTCACCATCCTACTGAGACT 

GCTG CTCGACAAAAAT ACC CGT CAAAAGAAATTGT AG CTATTTTC CAAC CG CATACGTT C 
ACTCGTACGATAG CT(CTITTAGACGATTTTGC CCATGCTTTG AGT CAAGCGGATAG CGTT 
TATCTTG CT CAAAT ATATG GTT CT G CTAGAGAAGTAGATAATGGTGAGGTGAAGGTAGAA 
GATTTAGCTGCTAAGATTGTCAAACACTCAGATCTAGTGACAGTCG 
CTACTCAATGATGATAATGCTGTCTATGTCTCTATGGGTGCT 
GAGCGCT'CTT H 1T'GAAGAACTACTAGCTAACCTAACTAAAAATACA.CU\A 

SEQ ID NO. 4604 
STRAIN H36B 

AAAAG CAGGCT CTAGTgACGCTgACAAATAT t ATTTTACT CAACGTGGTT t AGAG CAAG CAGGT 
ATAACTATACTACCTTTCT CAC CGAATAATAT CAGTGAGGATTTAGAGATTATTG CAGG A 
AATGCTTCTCGT CCAGATAACAATGAAGAGTTGG CTTATGTTATTGAAAAGGGCTAT CAT 
TCTAAACGATATCATGAATCT CTCGGAGATTCT 
GCTGGGGCACATGGAAAAACCTCAACX3ACAGGTCT 

AGAGACACTTCTCT CCTAACTGGAGATGGTACAGGACGTGGCT CTG CTAATG CT 

TTTGTGTCTGAAGCTGATGAATACGAACGTCATTCTATGCCGTACCATCCAGAATA 

ACTACTACGAATACTGATTTTGACCATCCTGACTATTCT 

AATG CTTTTAATGACTATGCTAAG CAAGCTCAAAAAGGTTTATT CATTTATGGAGAAGAT 



798 



WO 2004/018646 



PCT/US2003/026827 



Table 46: Comparative Sequences relating to SAG1615 (strain info highlighted in BOLD) 



CCAAAACTTC^TGAAATCACTTCTGAGGC 

AATG ATT TT AT AG CAAAAG AT ATCACT CG AACTGTT AAT GGTT CT GACTTT AAG GTTTT C 
TATAACGAAGAAGAAATTGGTCAGTTTCACGTACCAGCATACGGTAAACATAATATCTTA 
AATGCAACTGCTGTTATTGCTAACCITTACATAATGGGAATTGATATGGCATTAGTAG 
GAGCATTTGAAGACATTTTCAGGGGTAAAAaSTCGTTTTACTGAG 

ACTGTCATTATTGATGACTTTGCTCAC CAT CCTACTGAGATTATTG CGACATTAG ATGCT 
GCTCGACAAAAATACCCGTCAAAAGAAATTGTAGCTATTTTCCAACCGCATACGTTCACT 
CXjTACGATAGCTCTTTTAGACGAATTTGCCCATGCC 

CT CG CTCAAATATATGGTT CTG CTAGAGAAGTAGATAATGGTGAGGTGAAGGTAGAAGAT 
TT AG CTG CTAAGATTGTCAAACACT CAGATTTAGT GACAGTCGAAAATGT CT CG CCTTTA 
CTCAATC^TGATAATGCTOTCTATGTCTT^ATGGGTGCT 
CGCTCTTTTGAAGAATTATTAGCTAACCTAACTAAAAATACACAA 

SEQ XD NO. 4605 
STRAIN 18RS21 

AAAGCAGG CT CTAGTGACGTTGACAAATATTATTTTACC CAACGTGGTTT AG AG CAAG CA 
GGTGTAACTATATTACCTTTCTCACCGAATAATATCAGTGAGGA 

GGAAATG CTTTT CGT CCAGATAACAATGAAGAGTTGG CTT ATGTTATTGAAAAGGGCTAT 

CATTTTAAACGATATCATGAATTTCTCGGAGATTTT^ 

GTAGCTGGGGCACATGGAAAAACCrCAACGACAGG 

ATTACAGACACITCTTTCCT AATTGGAGATGGTACAGGAC G CTAATG CTAAT 

TACTTTCTGTTTGAAGCTGATGAATACGAACGTCATTTTATGC^ 

TCAATTATTACCAATATTGATTTTGACC^ 

TT CAATG CCTTT AATGACTATG CTAAG CAAGTTCAAAAAGGTTTATTCATTTATGGAGAA 
GATC CAAAACIT CATGAAATCACTTCTGAGG CACCAATATATTATTATGGTTTTGAAGAT 
T CAAATGATTTTATAG CAA7VAGACATCACTCGAACTGTTAATGGTT CTGACTTTAAGGTT 
TTCT ATAAC CAAGAAGAAATTGGT CAGTTT CATGTACCAGCATACGGTAAACAT AATATC 
TTAAATG CAACTGCTGTTATTG CTAACC"I"i' '1 ACATAATGGGAATTGATATGG CATTAGTA 
GCTGAGCATTTGAAGACGTTTTCAGGGGTAAAGCGTCGTTT^ 
GATACTGTCATTATTGATGACITTGCrCACCATCCrACT 
GCTGCTCGACAAAAATACCCGTCAAAAGAAATTGTAGCTATT^ 

ACTCGTACGATAGCT CTTTT AGACGAATTTG C CCATG C CTTG AGT CAAG CGGATAGCGTT 
TATCT CG CT CAAATATATGGTTCTG OTAGAGAAGTAGAT AATGGTGAGGTGAAGGTAGAA 
GATTTAGCTGCTAAGATTGTCAAACACTCAGATTTAGTGACAGTCG 
TTACTCAATCATGATAATG CTGTCTATGT CTTTATGGGTG CTGG AGACA7TCAATTGTAT 
GAGCG CT CrTTTGAAGAATTATTAG CTAAC CTAACTAAAAATACACAA 

SEQ ID NO. 4606 
STRAIN M732 

AAAAGCAGG CTCTAGTGACGT t GACAAATA t TATTTTAC CCAACGTGGTTT AGAG CAAG CAG 
GTGTAACTATATTACCTTTCTCACCGAATAATATCAGTGAGGATTTAGAG 
GAAATGCTTTT CGT CCAGATAACAATGAAG AGTTGGCTT^ CTATC 
ATTTTAAACGATATaATGAATTTCTCGGAGA 

TAG CTGGGGCACATGGAAAAACCT CAACGACAGGTTTATTAG CT CATGTTTT AAAAAATA 
TTACAGACACJ.TC'r IT C CTAATTGGAGATGGTACAGGACGTGGTTCTG CTAATG CTAATT 
ACTTTGTGTTTGAAG CTGATGAATACGAACGT CATTTTATG C CGT ACC^T CCAGAATACT 
CAATTATTACCAATATTGATTTTGAC CAT CCTGATTATTTTACAGGCCTAGAGGACGTAT 
TCAATGCCTTTAATGACTATGCTAAGCAAGTTCAAAAAGGT^ 

ATCCAAAACTT CATGAAATCACTTCTGAGG CAC CAATATATTATTATGGT TTTGAAGATT 
CAAATGATTTTATAG CAAAAGACAT CACT CGAACTGTTAATGGTTCTGACT^ 
TCTATAACCAAGAAGAAATTGGTCAGTTT CATGTACCAG CATACGGTAAACATAATATCT 
TAAATGCAACTG CTGTTATTG CTAACCTTTACATAATG^ 

CTGAG CATTTGAAGACATTTTCAGGGGTAAAG CGTCGTTTTACTG AGAAGATTATTG ACG 
ATACTGrTCATTATTGATGACrTTTGCT 

CTG CT CGACAAAAATAC CCGTCAAAAGAAATTGTAG CTATTTT CCAACCG CATACGTTCA 
CTCGTACGATAGCTCTTTTAGACSA^ 

ATCTCGCTCAAATATATGGTTCTGCTAGAGAAGTAGATAATGGTGAGGTGAAgGTAGAAG 

ATTTAG CTG CTAAgATTGT CAAACACTCAGATTTAGTGACAGTCGAAAA.TGTCT CGC CTT 

TACTGAATCATGATAATGCTGTCTATGTCTTTATGGGTGCT 

AG CG CT CTTTTGAAGAATTATTAG CTAAC CTAACTAAAAATACACAA 

SEQ ID NO. 4607 
STRAIN M781 

AAAG CAGG CTCTAGTGACGT t GACAAATATT ATTTTAC C CAACGTGGTTTAG AGCAAGCAG 
GTGTAACTATATTACCTTTCTCACCGAATAATATCAGTGAGGATTTAGAGA 
GAAATGCTTTTCGT C CAGATAACAATGAAGAGTTGGCTTATGTTATTGAAAAGGG CTATC 
ATTTT AAACGATAT CATGAATTTCT CGGAGATTTTATGCGTCAGTT CACT AGTCTAGGT 
GTAGCTGGGGCACATGGAAAAACCT CAACGACAGGTTTAT^ 

TATTACAGACACTTCTTTC CTAATTGG AGATGGTACAGGACGTGGTTCTG CT AATGCTAA 
TT ACTTTGTGTTTGAAG CTGATGAATACGAACGTCATTTTATGCCGTAC CAT CCAGAATA 
CT CAATTATTACCAATATTG ATTTTGAC CATCCTGATTATTTTACAGGC CTAGAGGACGT 
ATTCAATGC CTTTAATGACTATG CTAAG CAAGTTCAAAAAGGTTTATTCATTTATGGAGA 
AGAT CCAAAACTTCATGAAAT CACTTCTGAGG CACCAATATATT ATTATGGTTTTGAAGA 
TT CAAATGATTTTATAG CAAAAGACATCACT CGAACTGTTAATGGTT CTG ACTTTAAGGT 
TTTCTATAAC CAAG AAGAAATTGGT CAGTTT CATGTAC CAGCAT ACGGT AAACATAATAT 
CTTAAATG CAACTG CTGTTATTG CTAACCTTTACATAATGGGAATTGAT^ 
AG CTG AG CATTTGAAGACATTTTCAGGGGTAAAG CGT CGTTTTACTG AG AAGATTATTGA 
CG ATACTGT CATTATTGATG ACTTTG CTCACCAT C CTACTGAGATTATTG CGACATTAG A 
TGCTGCTCCJACAAAAATACCCGTCAAAAGAAATTGTAGCT^ 

CACT CGTACGATAG CTCTTTTAGACGAATTTG CC CATGCCT^ CGGATAG CGT 

TTATCTCGCrrCAAATATATGGTTCTGCTAGAGAAGTAGATAATGGTGAGGTGAAGGTAGA 
AGATTTAGCTGCTAAGATTGTCAAACACTCAGATTTAGTGACAGTCGAAAA 
TTTACTC^\ATCATC4ATAATG CTGTCTATGTCTTTATGGGTGCT CAATTGT A 
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Table 46: Comparative Sequences relating to SAG1615 (strain info highlighted in BOLD) 



TGAGCGCTCITTTGAAGAATTATTAGCTAACCTAACTAAAAATACACAA 

SEQ ID NO. 4608 
STRAIN CJB110 

AAAAAGCAGGCTCTAGTGACGTtGACAAATAt.TATTTrACCCAACGTGGTTTAGAGCAA 
GGTGTAACTATATTACCTTT CTCAC CGAATAATAT CAGTG AGGATTT AG AGATTATTG CA 
GG AAATGCTTTT CGT CCAG AT AACAATGAAGAGTTGG CTTATGTTATTG AAAAGGG CTAT 
CATTTTAAACGATATCATGAATTTCrCGGAGATTTTATGCGTCAGTTCACTAGTCTAGGT 
GT AG CTGGGG CACATGGAAAAACCT CAACGACAGGTTTATTAG CT 
ATTACAGACACTTCTTTCCTAATTGGAGAT 

TACTTTGTGTTTGAAGCTGATGAATACGAACGTCATTTTATGCCGTACCATCCAGAATAC 
TCAAT TATTACCAAT ATTG ATTTTGAC GAT CCTG ATTATTTT ACAGG CCT AG AGG ACGT A 
TT CAATG CTTTTAATG ACTATG CTAAG CAAGTTCAAAAAGGTTTATTCATTTATGGAGAA 
GATTCAAAACTT CATGAAAT CACTTCTAAGGCACCAATATATTATTATGGTTTTGAAGAT 
TGAAATGATTTTATAGCAAAAGACATGA.CTCX3AACTGTTAATGGTTCT 
TT CT AT AACCAAGAAGAAATTGGT CAGTTTCATGTAC CAG CATACGGTAAACATAAT ATC 
TT AAATG CAACTGCTGTTATTG CT AAC CTTTACATAATG G GAATTGATATGG CATTAGTA 
G CTG AGCATTTGAAG ACATTTTCAGGGGTAA^CGT CG1T1T ACTGAGAAGATTATTGAC 
GATACTGTGATTATTGATGACITTGCT 

GCTGCTCGACAAAAATACCCGTCAAAAGAAATTGTAGCTATTTTCCAACCGCATACGT^ 

ACTCGTACGATAGCTCITTTAGACGATTT^ 

TATCTTGCTCAAATATATGGTTCTGCTAGAGAAGTA 

GATTTAGCTG CTAAG ATTGT CAAACACTCAGATTT AGTGACAGT CGAAAATGTCTCG CCT 
TTACTCAAT(2ATGATAATGCTGTCTATGTCTTT^ 

GAGCG CT CTTTTGAAGAATTATTAGCTAAC CTAACTAAAAATACACAA 
SEQ ID NO. 4609 

STRAIN JM9130013 (reverse complement) 

GTTGAAAAAAGCAGGCTCTAGTGACGTTGACAAATATTATTTTACT 

GCAAGCAGGTATAACTATATTACCTTTCTCACCGAA 

TATTGCAGGAAATGCTTTTCGTCCAGATAACAATGAAG^ 

GGGCT AT CATTTTAAACGATATCAT GAATTT CT CGGAGATTTTATG CGT CAGTT CACTAG 
T CTAGGTGTAGCTGGGGCACATGGAAAAA.C CT CAACGACAGGTTTATTAG CTCATGTTTT 
AAAAAATATTACAGACACTTCTTTCCTAATTGGA 

TG CTAATTACTTTGTGTTTGAAGCTGATG AATACGAACGT CATTTTATG C CGT AC CATCC 
AGAATACTO\ATTATTACCAATATTGATTT^ 

GGACGTATTCAATGCTTTTAATGACTATG CTAAG CAAGTT CAAAAAGGTTTATT CATTT A 
TG GAGAAGAT CCAAAA.CTT CATGAAAT CACTTCTGAGG CACCAATATATTATTATGGTTT 
TG AAG ATTCAAATG ATTTT ATAGCAAAAGAT AT CACT CGAACTGTTAATGGTTCTGACTT 
TAAGGTTTT CTATAACCAAGAAGAAATTGGTCAGTTT CACGTAC CAG CATACGGTAAACA 
T AATATCTT AAATGCA^CTG CTGTTATTG CTAACCTTTACATAATGGGAATTGAT ATGG C 
ATTAGTAGCTGAGCATTTGAAGACATTTTCAGGGGTAAAACGTCG1T 
TATTGACGATACTGTCATTATTGATGACTTTGCTCACCA 

ATTAGATG CTG CTCGACAAAAATACCCGT CAAAAGAAATTGT AGCTATTTTC CAACCGCA 
TACGTTCACT CGTACGATAG CTCTTTTAGACGAATTTGC CCATG CCTTGAGT CAAGCGGA 
TAGCGTTTATCTCGCTCAAATATATGGTTCTGCTAGAGAAGTAGATAATGGTGAGGTGAA 
GGTAGAAGATTTAG CTG CTAAGATTGT CAAACACT CAGATTTAGTGACAGTCGAAAATGT 
CT CG C CTTT ACT CAATCATGAT AATGCTGTCTATGT CTTT ATGGGTG CTGGAGACATTCA 
ATTGTATGAGCGCTCTTTTGAAGAATTATTAGCT 

SEQ ID NO. 4610 

STRAIN COH1 reverse complement 

CAGGCTCTAGTGACGTGACAAATATtATTTTACCCAACGTGGTTAGAGCAAGC^ 

CTATATTACCTTTCTCACCGAATAATATC^ 
CTTTTCGTCCAGATAACAATGAAGAGTTGGCTTAT 

AACGATATCATGAATTTCTCGGAGATTTTATG CGTCAGTT CACTAGTCTAGGTGTAGCTG 

GGGCACATGGAAAAACCTCAACGAC^GGTTTATTAGCT 

ACACTTCITTCCTAATTGGAGATGGTACAGGACGTGGTTCT 

TGTTTGAAG CTGATGAATACGAACGTCATTTTATGCCGTACCATCCAGAATACT CAATT A 
TTAC CAATATTG ATTTTGACCATCCTGATTATTTTACAGGCCTAGAGGACGT ATT CAATG 
CCTTT AATGACTATG CTAAG CAAGTTCAAAAAGGTTTATT CATTTATGGAGAAGATC CAA 
AACTT CATGAAA.TCACTT CTGAGG CACCAATATATT ATT ATGGTTTTGAAGATT CAAATG 
ATTTTATAG CAAAAGACAT CACTCG AACTGTTAATGGTT CTGACTTTAAGGTTTT CTATA 
ACCAAGAAGAAATTGGTCAGTTTCATGTACCAGCAT 
CAACTGCTGTTATTGCTAACCTTTACATAATGGGAATTGATAT 
ATTTGAAGACATTTTCAGGGGTAAAGCGTCGTTTTACTG 
TCATTATTGATGACTTTGCTCACCATCCTACT 

GACAAAAATAC CCGT CAAAAGAAATTGTAG CTATTTT CCAAC CG CATACGTTCACT CGTA 
CXSATAGCTCTTTTAGACGAATTTGCCCA^ 

CTCAAATATATGGTT CTG CTAGAGAAGTAGATAATGGTGAGGT^ 
CTGCTAAGATTGTCAAACACTCAGATTTAGTGACAGTCGAA 
ATCATGATAATGCTGTCTATGTCTTTATGG^TGCrGGAGACATT 
CrTTTG^GAATTATTAGCTAACCn'AACTAAAAATACAC^^ 

SEQ ID NO. 4611 
STRAIN 2603 

atgtcaaaaacttatcattttattggtattaaaggatccggaatgagtgccctagcactg 
atgcttcatcaaatgggacataacgtccaaggaagtgacgttgacaaatattattttacc 
caacgtggtttagagcaagcaggtgtaactatattacctttctcaccgaataatatcagt 
gaggatttagagattattgcaggaaatgcttttcgtccagataacaatgaagagttggct 
tatgttattgaaaagggctatcaatttaaacgatatcatgaatttctcggagattttatg 
cgtcagttcactagtctaggtgtagctggggcacatggaaaaacctcaacgacaggttta 
ttagctcatgttttaaaaaatattacagacacttctttcctaattggagatggtacagga 
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Table 46: Comparative Sequences relating to SAG1615 (strain info highlighted in BOLD) 



cgtggttctgctaatgctaattactttgtgtttgaagctgatgaatacgaacgtcatttt 
atgccgtaccatccagaatactcaattattaccaatattgattttgaccatcctgattat 
tttacaggcttagaggacgtattcaatgcctttaatgactatgctaagcaagttcaaaaa 
ggtt tat teat ttatggagaagatccaaaacttcatgaaatcacttctgaggcaccaata 
tattattatggttttgaagattcaaatgattttatagcaaaagacatcactcgaactgtt 
aatggttctgactttaaggttttctataaccaagaagaaattggtcagtttcatgtacca 
gcatacggtaaacataatatcttaaatgcaactgctgttattgctaacctttacataatg 
ggaa t tgatatggcattagtagct gagca 1 1 1 ga agacg 1 1 1 1 c agggg taaagcgtcgt 
tttactgagaagattattgacgatactgtcattattgatgactttgctcaccatcctact 
gagattattgcgacattagatgctgctcgacaaaaatacccgtcaaaagaaattgtagct 
attttccaaccgcatacgttcactcgtacgatagctcttttagacgaatttgcccatgcc 
ttgagtcaagcggatagcgtttatctcgctcaaatatatggttctgctagagaagtagat 
aatggtgaggtgaaggtagaagatttagctgctaagattgtcaaacactcagatttagtg 
acagtcgaaaatgtctcgcctttactcaatcatgataatgctgtctatgtctttatgggt 
gctggagacattcaattgtatgagcgctcttttgaagaattattagctaacctaactaaa 
aatacacaa 

SEQ ID NO. 4612 

STRAIN COH1 reverse complement 

CAGG CT CTAGTGACGT t GACAAAT A t TATTTT AC C CAACGTGG t TTAGAG CAAGCAGGTGTAA 
CTATATTACCTTTCTCACCGAATAATATCAGTGAGGATTTAGAGATTATTGCAGGAAA^ 
CTTTT CGTC CAG ATAACAATGAAG AGTTGG CTTATGTTATTG AAAAGGG CTATCATTTTA 
AACGATATCATG AATTT CT CGGAGATTTTATG CGT CAGTT CACTAGTCTAGGTGTAGCTG 
GGGCACATGGAAAAACCTCAACGACAGGTTTATTAGCT 

ACACTT CTTTCCTAATTGGAGATGGTACAGGACGTGGTT CTG CTAATG CTAATTACTTTG 
TGTTTGAAG CTGATG AATACGAACGTCATTTTATG CCGTACCAT C CAGAATACT CAATTA 
TTAC CAATATTGATTTTGACCATC CTGATTATTTTACAGG CCT 

CCTTTAATG ACTATG CTAAG CAAGTTCAAAAAGGTTTATT CATTTATGGAGAAGATCCAA 
AACTTCATG AAATCACTTCTGAGG CAC CAATATATTATTATGGTTTTGAAGATTCAAATG 
ATTTTATAG CAAAAGACAT CACT CGAACTGTT AATGGTT CTGACTTTAAGGTTTT CT ATA 
AC CAAGAAG AAATTGGT CAGTTTCATGTACCAG CATACGGTAAACATAATATCTTAAATG 
CAACTGCTGTTATTGCTAACCTTTACATAATGGGAATTGAT^ 

ATTTGAAGACATTTT CAGGGGTAAAG CGT CGTTTTACK3AGAAGATT ATTGACGATACTG 
TCATTATTGATGAOTTTGCTCACCATCCTACTGAGATTATTG CGACATTAGATG CTG CT C 
GACAAAAATACCCGTCAAAAGAAACTGTAGCTATTCTC 

CGATAGCTCTTTTAGACGAATTTG C C CATG CCTTGAGTCAAGCGG ATAG CGTTTATCTCG 
CTCAAATATATGGTT CTG CTAG AGAAGTAGATAATGGTCAG^ 
CTGCTAAGATTGTCAAACACTCAGATTTAGTGACAGTCXSAAA^ 

ATCATGATAATG CTGTCTATGTCTTTATGGGT G CTGGAGAC^TTCAATTGTATG AG CG CT 
CTTTTGAAGAATTATTAG CT AACCTAACTAAAAATACACAA 

PRETTY of: /biotmp/msa56524 . 2 { *} November 26, 2002 08:06 
PRETTY of : /biotmp/msa253045.2{*} January 31, 2003 03:51 



msa253045 
msa253045.2{ 

msa253045 . 
msa253045 . 2 { 157 
msa253045 .2 { 

msa253045 . 

msa253045 . 

msa253045 . 

msa253045 . 
msa253045.2{ 

msa253045 . 



.2{157_090} 
157_CJB110} 
2{157_H36B} 
_JM9130013 ! 
157_1169NT 
2{l57_A909 
2f 157_COHi; 
2{157_M732" 
2{l57_M781 
157_18RS21 
2{157_2603; 
Consensus 



50 



atgtcaaaaa cttatcattt tattggtatt aaaggatccg gaatgagtgc 
********** ********** ********** ********** ********** 



msa253045. 2 { 157_090) 
msa253 045 .2{157_CJB110) 
msa253045 . 2 { 157_H36B} 
msa253045.2{l57_JM9130013} 
msa253045 . 2 { 157_1169NTj 
msa253045.2{l57_A909) 
msa253045.2{l57_COHl} 
msa253045.2{l57_M732} 
msa253045 . 2 { 157_M781 } 
msa253045 . 2 { 157_18RS21 } 
msa253045 . 2 { 157_2603 } 

Consensus 



51 



100 

-aaagcaggc tctagtgacg 

A Aaaagcaggc tctagtgacg 

Aaaagcaggc tctagtgacg 

'GttcaA Aaaagcaggc tctagtgacg 
. Aaaagcaggc tctagtgacg 



. — caggc tctagtgacg 

Aaaagcaggc tctagtgacg 

— -aaagcaggc tctagtgacg 

„ — ,~ -aaagcaggc tctagtgacg 

cctagcactg atgettcate aaatGggacA taacgtccaa ggaagtgacg 



********** ********** ********** * 



msa253045 .2{l57__090 
msa253045 .2{157_CJB110' 
msa253045.2{l57_H36B : 
msa253045 .2{157_JM9130013' 
msa253045.2{l57_1169NT} 
msa253045.2{l57_A909j 
msa253045 . 2 { 157_COHl } 
msa253045 . 2 i 157_M732 } 
msa253045.2{157_M78l} 
msa253045 .2 {157_18RS21} 



101 

t TGACAAAT A 
tTGACAAATA 
t TGACAAATA 
tTGACAAATA 
tTGACAAATA 
-TGACAAATA 
tTGACAAATA 
tTGACAAATA 
tTGACAAATA 
tTGACAAATA 



TTATTTTACc 
TTATTTTAC C 
TTATTTTACt 
TTATTTTAC t 
TTATTTTACc 
TTATTTTACc 
TTATTTTACc 
TTATTTTACc 
TTATTTTACc 
TTATTTTACc 



CAACGTGGTT 
CAACGTGGTT 
CAACGTGGTT 
CAACGTGGTT 
CAACGTGGTT 
CAACGTGGTT 
CAACGTGGTT 
CAACGTGGTT 
CAACGTGGTT 
CAACGTGGTT 



TAGAGCAAGC 
TAGAGCAAGC 
TAGAGCAAGC 
TAGAGCAAGC 
TAGAGCAAGC 
TAGAGCAAGC 
TAGAGCAAGC 
TAGAGCAAGC 
TAGAGCAAGC 
TAGAGCAAGC 



150 

AGGTgTAACT 
AGGTgTAACT 
AGGTaTAACT 
AGGTaTAACT 
AGGTgTAACT 
AGGTgTAACT 
AGGTgTAACT 
AGGTgTAACT 
AGGTgTAACT 
AGGTgTAACT 
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msa253045 . 2 { 157__2603 } 
Consensus 



t TGACAAATA 
_********* 



TTATTTTACc 
*********_ 



CAACGTGGTT 
********** 



TAGAGCAAGC 
********** 



AGGTgTAACT 
****_***** 



msa253045 
msa253045 .2{ 

msa253045. 
msa253045 .2 {157 
msa253045.2{ 

msa253045 . 

msa253045. 

msa2S3045 . 

msa253045 . 
msa253045 .2{ 

msa253045 . 



.2{l57_090} 
157_CJB110} 
2{l57_H36B} 
_JM9130013} 
157_1169NT} 
2{157_A909} 
2{l57_COHl} 
2{157_M732} 
2{l57_M78l} 
157_18RS2l} 
2{157_2603} 
Consensus 



151 

ATATTACCTT 
ATATTACCTT 
ATATTACCTT 
ATATTACCTT 
ATATTACCTT 
ATATTACCTT 
ATATTACCTT 
ATATTACCTT 
ATATTACCTT 
ATATTACCTT 
ATATTACCTT 



TCTCACCGAA 
TCTCACCGAA 
TCTCACCGAA 
TCTCACCGAA 
TCTCACCGAA 
TCTCACCGAA 
TCTCACCGAA 
TCTCACCGAA 
TCTCACCGAA 
TCTCACCGAA 
TCTCACCGAA 



TAATATCAGT 
TAATATCAGT 
TAATATCAGT 
TAATATCAGT 
TAATATCAGT 
TAATATCAGT 
TAATATCAGT 
TAATATCAGT 
TAATATCAGT 
TAATATCAGT 
TAATATCAGT 



********** ********** ********** 



GAGGATTTAG 

GAGGATTTAG 

GAGGATTTAG 

GAGGATTTAG 

GAGGATTTAG 

GAGGATTTAG 

GAGGATTTAG 

GAGGATTTAG 

GAGGATTTAG 

GAGGATTTAG 

GAGGATTTAG 
********** 



200 

AGATTATTGC 

AGATTATTGC 

AGATTATTGC 

AGATTATTGC 

AGATTATTGC 

AGATTATTGC 

AGATTATTGC 

AGATTATTGC 

AGATTATTGC 

AGATTATTGC 

AGATTATTGC 
********** 



msa253045 .2 {l57_090} 
msa253 045 . 2 { 157_CJB110 
msa253 045 .2{l57_H36B 
msa253045 . 2 { 157_JM9130013 
msa253045 . 2 { 157_1169NT} 
msa253045.2fl57_A909 
msa253045 . 2 { 157_COHl 
msa253045 . 2 { 157_M732 
msa2 53 04 5 . 2 { 157__M7 8 1 
msa253045 . 2 { 157_18RS21 
msa253045 . 2{ 157_2603 
Consensus 



201 

AGGAAATGCT 

AGGAAATGCT 

AGGAAATGCT 

AGGAAATGCT 

AGGAAATGCT 

AGGAAATGCT 

AGGAAATGCT 

AGGAAATGCT 

AGGAAATGCT 

AGGAAATGCT 

AGGAAATGCT 
********** 



TTTCGTCCAG 

TTTCGTCCAG 

TTTCGTCCAG 

TTTCGTCCAG 

TTTCGTCCAG 

TTTCGTCCAG 

TTTCGTCCAG 

TTTCGTCCAG 

TTTCGTCCAG 

TTTCGTCCAG 

TTTCGTCCAG 
********** 



ATAACAATGA 

ATAACAATGA 

ATAACAATGA 

ATAACAATGA 

ATAACAATGA 

ATAACAATGA 

ATAACAATGA 

ATAACAATGA 

ATAACAATGA 

ATAACAATGA 

ATAACAATGA 
********** 



AGAGTTGGCT 
AGAGTTGGCT 
AGAGTTGGCT 
AGAGTTGGCT 
AGAGTTGGCT 
AGAGTTGGCT 
AGAGTTGGCT 
AGAGTTGGCT 
AGAGTTGGCT 
AGAGTTGGCT 
AGAGTTGGCT 



250 

TATGTTATTG 
TATGTTATTG 
TATGTTATTG 
TATGTTATTG 
TATGTTATTG 
TATGTTATTG 
TATGTTATTG 
TATGTTATTG 
TATGTTATTG 
TATGTTATTG 
TATGTTATTG 



********** ********** 



msa253045 
msa253045 ,2{ 

msa253045 . 
msa253045.2{l57 
msa253045.2{ 

rasa253045 . 

msa253045 . 

msa253045. 

rasa253045 . 
msa253045.2{ 

msa253045 . 



.2{157_090 
157_CJB110 
2{157_H36B; 
_JM9130013 
157_116 9NT 
2{157_A909 
2 1 157_C0H1 
2{157_M732 
2{157_M781 
157_18RS21 
2{157_2603 
Consensus 



251 

AAAAGGGCTA 
AAAAGGGCTA 
AAAAGGGCTA 
AAAAGGGCTA 
AAAAGGGCTA 
AAAAGGGCTA 
AAAAGGGCTA 
AAAAGGGCTA 
AAAAGGGCTA 
AAAAGGGCTA 
AAAAGGGCTA 



TCAtTTTAAA 
TCAtTTTAAA 
TCAtTTTAAA 
TCAtTTTAAA 
TCAtTTTAAA 
TCAtTTTAAA 
TCAtTTTAAA 
TCAtTTTAAA 
TCAtTTTAAA 
TCAtTTTAAA 
TCAaTTTAAA 



CGATATCATG 
CGATATCATG 
CGATATCATG 
CGATATCATG 
CGATATCATG 
CGATATCATG 
CGATATCATG 
CGATATCATG 
CGATATCATG 
CGATATCATG 
CGATATCATG 



AATTTCTCGG 
AATTTCTCGG 
AATTTCTCGG 
AATTTCTCGG 
AATTTCTCGG 
AATTTCTCGG 
AATTTCTCGG 
AATTTCTCGG 
AATTTCTCGG 
AATTTCTCGG 
AATTTCTCGG 



********** ***_****** ********** ********** 



300 

AGATTTTATG 

AGATTTTATG 

AGATTTTATG 

AGATTTTATG 

AGATTTTATG 

AGATTTTATG 

AGATTTTATG 

AGATTTTATG 

AGATTTTATG 

AGATTTTATG 

AGATTTTATG 
********** 



msa253045 
msa253045.2{ 

msa253045 . 
msa253045.2{l57 
msa253045.2{ 

msa253045 . 

msa253045 . 

msa253045 . 

msa253045. 
msa253045 .2 { 

msa253045 . 



.2{l57_090} 
157_CJB110} 
2{157_H36B} 
_JM9130013} 
157_1169NT} 
2{157_A909} 
2{157_C0H1} 
2{157_M732} 
2{l57_M78ll 
157_18RS2l} 
2{157_2603} 
Consensus 



301 

CGTCAGTTCA 

CGTCAGTTCA 

CGTCAGTTCA 

CGTCAGTTCA 

CGTCAGTTCA 

CGTCAGTTCA 

CGTCAGTTCA 

CGTCAGTTCA 

CGTCAGTTCA 

CGTCAGTTCA 

CGTCAGTTCA 
********** 



CTAGTCTAGG 

CTAGTCTAGG 

CTAGTCTAGG 

CTAGTCTAGG 

CTAGTCTAGG 

CTAGTCTAGG 

CTAGTCTAGG 

CTAGTCTAGG 

CTAGTCTAGG 

CTAGTCTAGG 

CTAGTCTAGG 
********** 



TGTAGCTGGG 
TGTAGCTGGG 
TGTAGCTGGG 
TGTAGCTGGG 
TGTAGCTGGG 
TGTAGCTGGG 
TGTAGCTGGG 
TGTAGCTGGG 
TGTAGCTGGG 
TGTAGCTGGG 
TGTAGCTGGG 



GCACATGGAA 
GCACATGGAA 
GCACATGGAA 
GCACATGGAA 
GCACATGGAA 
GCACATGGAA 
GCACATGGAA 
GCACATGGAA 
GCACATGGAA 
GCACATGGAA 
GCACATGGAA 



********** ********** 



350 

AAACCTCAAC 

AAACCTCAAC 

AAACCTCAAC 

AAACCTCAAC 

AAACCTCAAC 

AAACCTCAAC 

AAACCTCAAC 

AAACCTCAAC 

AAACCTCAAC 

AAACCTCAAC 

AAACCTCAAC 
********** 



msa253045 
msa253045.2{ 

msa253045 » 
msa253045.2{l57 
msa253045.2{ 

msa253045 . 

msa253045 . 

msa253045. 

msa253045 . 
msa253045 .2 { 

msa253045 . 



.2{l57_090} 
157_CJB110} 
2{157_H36B} 
JM9130013} 
157_1169NT} 
2{157_A909} 
2(157_C0H1} 
2{157_M732} 
2{157_M781} 
157_18RS21} 
2{157_2603} 
Consensus 



351 

GACAGGTTTA 
GACAGGTTTA 
GACAGGTTTA 
GACAGGTTTA 
GACAGGTTTA 
GACAGGTTTA 
GACAGGTTTA 
GACAGGTTTA 
GACAGGTTTA 
GACAGGTTTA 
GACAGGTTTA 



TTAGCTCATG 
TTAGCTCATG 
TTAGCTCATG 
TTAGCTCATG 
TTAGCTCATG 
TTAGCTCATG 
TTAGCTCATG 
TTAGCTCATG 
TTAGCTCATG 
TTAGCTCATG 
TTAGCTCATG 



********** ********** 



TTTTAAAAAA 

TTTTAAAAAA 

TTTTAAAAAA 

TTTTAAAAAA 

TTTTAAAAAA 

TTTTAAAAAA 

TTTTAAAAAA 

TTTTAAAAAA 

TTTTAAAAAA 

TTTTAAAAAA 

TTTTAAAAAA 
********** 



TATTACAGAC 

TATTACAGAC 

TATTACAGAC 

TATTACAGAC 

TATTACAGAC 

TATTACAGAC 

TATTACAGAC 

TATTACAGAC 

TATTACAGAC 

TATTACAGAC 

TATTACAGAC 
********** 



400 

ACTTCTTTCC 

ACTTCTTTCC 

ACTTCTTTCC 

ACTTCTTTCC 

ACTTCTTTCC 

ACTTCTTTCC 

ACTTCTTTCC 

ACTTCTTTCC 

ACTTCTTTCC 

ACTTCTTTCC 

ACTTCTTTCC 
********** 



msa253045 
Tnsa253045 .2{ 
msa253045 . 
msa253045.2{!57 
msa253045 .2 { 
. msa253045. 
msa253045 . 
msa253045 . 
msa253045 . 



,2{157_090} 
157_CJB110} 
2{157_H36B} 
_JM9130013} 
157_1169NT} 
2{157_A909} 
2{157_C0H1} 
2{157__M732} 
2{157_M78l} 



401 

TAATTGGAGA 
TAATTGGAGA 
TAATTGGAGA 
TAATTGGAGA 
TAATTGGAGA 
TAATTGGAGA 
TAATTGGAGA 
TAATTGGAGA 
TAATTGGAGA 



TGGTACAGGA 
TGGTACAGGA 
TGGTACAGGA 
TGGTACAGGA 
TGGTACAGGA 
TGGTACAGGA 
TGGTACAGGA 
TGGTACAGGA 
TGGTACAGGA 



CGTGGTTCTG 
CGTGGTTCTG 
CGTGGTTCTG 
CGTGGTTCTG 
CGTGGTTCTG 
CGTGGTTCTG 
CGTGGTTCTG 
CGTGGTTCTG 
CGTGGTTCTG 



CTAATGCTAA 
CTAATGCTAA 
CTAATGCTAA 
CTAATGCTAA 
CTAATGCTAA 
CTAATGCTAA 
CTAATGCTAA 
CTAATGCTAA 
CTAATGCTAA 



450 

TTACTTTGTG 
TTACTTTGTG 
TTACTTTGTG 
TTACTTTGTG 
TTACTTTGTG 
TTACTTTGTG 
TTACTTTGTG 
TTACTTTGTG 
TTACTTTGTG 



802 



WO 2004/018646 



PCT/US2003/026827 



Table 46: Comparative Sequences relating to SAG1615 (strain info highlighted in BOLD) 



msa253 045 . 2 { 157_18RS2 1 } 
msa253045 .2{l57_2603} 

Consensus 



TAATTGGAGA TGGTACAGGA CGTGGTTCTG CTAATGCTAA TTACTTTGTG 
TAATTGGAGA TGGTACAGGA CGTGGTTCTG CTAATGCTAA TTACTTTGTG 
********** ********** ********** ********** ********** 



msa253045 
msa253045.2{ 

msa253045 . 
msa253045.2{l57 
msa253045 .2{ 

msa253045 . 

msa253045 . 

msa253045 . 

rasa2 53045 . 
msa253045.2{ 

msa253045 . 



.2{157_090} 
157_CJB110} 
2{157_H36B} 
__JM9130013) 
157_1169NT) 
2{l57_A909} 
2{157_C0H1} 
2{157_M732> 
2{157_M781} 
157_18RS21} 
2{157_2603) 
Consensus 



451 

TTTGAAGCTG 
TTTGAAGCTG 
TTTGAAGCTG 
TTTGAAGCTG 
TTTGAAGCTG 
TTTGAAGCTG 
TTTGAAGCTG 
TTTGAAGCTG 
TTTGAAGCTG 
TTTGAAGCTG 
TTTGAAGCTG 



ATGAATACGA 
ATGAATACGA 
ATGAATACGA 
ATGAATACGA 
ATGAATACGA 
ATGAATACGA 
ATGAATACGA 
ATGAATACGA 
ATGAATACGA 
ATGAATACGA 
ATGAATACGA 



ACGTCATTTT 
ACGTCATTTT 
ACGTCATTTT 
ACGTCATTTT 
ACGTCATTTT 
ACGTCATTTT 
ACGTCATTTT 
ACGTCATTTT 
ACGTCATTTT 
ACGTCATTTT 
ACGTCATTTT 



ATGCCGTACC 
ATGCCGTACC 
ATGCCGTACC 
ATGCCGTACC 
ATGCCGTACC 
ATGCCGTACC 
ATGCCGTACC 
ATGCCGTACC 
ATGCCGTACC 
ATGCCGTACC 
ATGCCGTACC 



500 

ATCCAGAATA 
ATCCAGAATA 
ATCCAGAATA 
ATCCAGAATA 
ATCCAGAATA 
ATCCAGAATA 
ATCCAGAATA 
ATCCAGAATA 
ATCCAGAATA 
ATCCAGAATA 
ATCCAGAATA 



********** ********** ********** ********** ********** 



msa253045 
msa253045.2{ 

msa253045 . 
msa253045.2{!57 
msa253045.2{ 

msa253045 . 

msa253045 . 

msa253045 . 

msa253045 . 
msa253045.2-{ 

msa253045 . 



.2{157_090} 
157__CJB110} 
2{157JH36B} 
_JM9130013} 
157_1169NT) 
2{157_A909; 
2{157_C0H1 : 
2f 157_M732; 
2{157_M781^ 
157_18RS21 
2{157_2603 
Consensus 



msa253045 
msa253045.2{ 

msa253045. 
msa253045 .2 {157 
msa253045.2{ 

msa253045 . 

msa253045 . 

msa253045. 

msa253045 . 
msa253045.2{ 

msa253045 . 



.2{157_090 
157_CJB110. 
2{157_H36B 
_JM9130013 
157_1169NT 
2{157_A909 
2{157_C0H1 
2{157_M732 
2{157_M781 
157_18RS21 
2{157_2603 
Consensus 



501 

CTCAATTATT 
CTCAATTATT 
CTCAATTATT 
CTCAATTATT 
CTCAATTATT 
CTCAATTATT 
CTCAATTATT 
CTCAATTATT 
CTCAATTATT 
CTCAATTATT 
CTCAATTATT 



ACCAATATTG 
ACCAATATTG 
ACCAATATTG 
ACCAATATTG 
ACCAATATTG 
ACCAATATTG 
ACCAATATTG 
ACCAATATTG 
ACCAATATTG 
ACCAATATTG 
ACCAATATTG 



********** ********** 



551 

TAGAGGACGT 

TAGAGGACGT 

TAGAGGACGT 

TAGAGGACGT 

TAGAGGACGT 

TAGAGGACGT 

TAGAGGACGT 

TAGAGGACGT 

TAGAGGACGT 

TAGAGGACGT 

TAGAGGACGT 
********** 



ATTCAATGCt 
ATTCAATGCt 
ATTCAATGCt 
ATTCAATGCt 
ATTCAATGCc 
ATTCAATGCc 
ATTCAATGCc 
ATTCAATGCc 
ATTCAATGCc 
ATTCAATGCc 
ATTCAATGCc 
*********- 



ATTTTGACCA 

ATTTTGACCA 

ATTTTGACCA 

ATTTTGACCA 

ATTTTGACCA 

ATTTTGACCA 

ATTTTGACCA 

ATTTTGACCA 

ATTTTGACCA 

ATTTTGACCA 

ATTTTGACCA 
********** 



TTTAATGACT 

TTTAATGACT 

TTTAATGACT 

TTTAATGACT 

TTTAATGACT 

TTTAATGACT 

TTTAATGACT 

TTTAATGACT 

TTTAATGACT 

TTTAATGACT 

TTTAATGACT 
********** 



TCCTGATTAT 
TCCTGATTAT 
TCCTGATTAT 
TCCTGATTAT 
TCCTGATTAT 
TCCTGATTAT 
TCCTGATTAT 
TCCTGATTAT 
TCCTGATTAT 
TCCTGATTAT 
TCCTGATTAT 



550 

TTTACAGGCc 
TTTACAGGCc 
TTTACAGGCc 
TTTACAGGCC 
TTTACAGGCc 
TTTACAGGCc 
TTTACAGGCC 
TTTACAGGCc 
TTTACAGGCc 
TTTACAGGCt 
TTTACAGGCt 



********** *********_ 



ATGCTAAGCA 
ATGCTAAG CA 
ATGCTAAGCA 
ATGCTAAGCA 
ATGCTAAGCA 
ATGCTAAGCA 
ATGCTAAGCA 
ATGCTAAGCA 
ATGCTAAGCA 
ATGCTAAGCA 

ATGCTAAGCA 
********** 



600 

AGTTCAAAAA 

AGTTCAAAAA 

AGTTCAAAAA 

AGTTCAAAAA 

AGTTCAAAAA 

AGTTCAAAAA 

AGTTCAAAAA 

AGTTCAAAAA 

AGTTCAAAAA 

AGTTCAAAAA 

AGTTCAAAAA 
********** 



msa253045 
msa253045.2{ 

msa253045. 
msa253045.2{l57 
msa253045 ,2{ 

msa253045 . 

msa253045 . 

msa253045 . 

msa253045 . 
msa253045 .2{ 

msa253045 . 



.2{l57_090 
157_CJB110" 
2{157_H36B 
_JM9130013 
157_1169NT 
2{157_A909 
2{157_C0H1 
2{157_M732 
2{157_M781 
157_18RS21 1 
2 {157J2603/ 
Consensus 



601 

GGTTTATTCA 
GGTTTATTCA 
GGTTTATTCA 
GGTTTATTCA 
GGTTTATTCA 
GGTTTATTCA 
GGTTTATTCA 
GGTTTATTCA 
GGTTTATTCA 
GGTTTATTCA 
GGTTTATTCA 



TTTATGGAGA 
TTTATGGAGA 
TTTATGGAGA 
TTTATGGAGA 
TTTATGGAGA 
TTTATGGAGA 
TTTATGGAGA 
TTTATGGAGA 
TTTATGGAGA 
TTTATGGAGA 
TTTATGGAGA 



********** ********** 



AGATtCAAAA 

AGATtCAAAA 

AGATc CAAAA 

AGATcCAAAA 

AGATc CAAAA 

AGATcCAAAA 

AGATcCAAAA 

AGATcCAAAA 

AGATcCAAAA 

AGATcCAAAA 

AGATcCAAAA 
****_***** 



CTTCATGAAA 
CTTCATGAAA 
CTTCATGAAA 
CTTCATGAAA 
CTTCATGAAA 
CTTCATGAAA 
CTTCATGAAA 
CTTCATGAAA 
CTTCATGAAA 
CTTCATGAAA 
CTTCATGAAA 



650 

TCACTTCTaA 
TCACTTCTaA 
TCACTTCTgA 
TCACTTCTgA 
TCACTTCTgA 
TCACTTCTgA 
TCACTTCTgA 
TCACTTCTgA 
TCACTTCTgA 
TCACTTCTgA 
TCACTTCTgA 



********** ********_* 



msa253045 
msa253045 .2{ 

msa253045 . 
msa253045.2{l57 
msa253045.2{ 

msa253045 . 

w\sa253045 . 

msa253045 . 

msa253045 . 
msa253045 ,2{ 

msa253045. 



.2{157_090 
157_CJB110 
2{157_H36B' 
_JM9130013 
157_1169NT; 
2{157_A909 
2{l57__C0Hl) 
2{157_M732) 
2{157_M781} 
157_18RS21) 
2 { 157^2603} 
Consensus 



651 

GGCACCAATA 
GGCACCAATA 
GGCACCAATA 
GGCACCAATA 
GGCACCAATA 
GGCACCAATA 
GGCACCAATA 
GGCACCAATA 
GGCACCAATA 
GGCACCAATA 
GGCACCAATA 



TATTATTATG 
TATTATTATG 
TATTATTATG 
TATTATTATG 
TATTATTATG 
TATTATTATG 
TATTATTATG 
TATTATTATG 
TATTATTATG 
TATTATTATG 
TATTATTATG 



GTTTTGAAGA 
GTTTTGAAGA 
GTTTTGAAGA 
GTTTTGAAGA 
GTTTTGAAGA 
GTTTTGAAGA 
GTTTTGAAGA 
GTTTTGAAGA 
GTTTTGAAGA 
GTTTTGAAGA 
GTTTTGAAGA 



TTCAAATGAT 
TTCAAATGAT 
TTCAAATGAT 
TTCAAATGAT 
TTCAAATGAT 
TTCAAATGAT 
TTCAAATGAT 
TTCAAATGAT 
TTCAAATGAT 
TTCAAATGAT 
TTCAAATGAT 



********** ********** ********** ********** 



700 

TTTATAGCAA 

TTTATAGCAA 

TTTATAGCAA 

TTTATAGCAA 

TTTATAGCAA 

TTTATAGCAA 

TTTATAGCAA 

TTTATAGCAA 

TTTATAGCAA 

TTTATAGCAA 

TTTATAGCAA 
********** 



msa253 045 . 2 { 157_090 } 
msa2 53 0 4 5 . 2 { 1 57_CJB1 1 0 } 
msa253045 . 2 { 157_H36B j 
msa253045 . 2 { 157_JM9130013 } 
msa253045 . 2 { 157__1169NT} 
msa253045.2fl57_A909} 
msa253045 . 2 { 157_C0H1 } 
msa253045.2{l57JM732) 



701 

AAGAcATCAC 
AAGAcATCAC 
AAGAtATCAC 
AAGAtATCAC 
AAGAcATCAC 
AAGAcATCAC 
AAGAcATCAC 
AAGAcATCAC 



TCGAACTGTT 
TCGAACTGTT 
TCGAACTGTT 
TCGAACTGTT 
TCGAACTGTT 
TCGAACTGTT 
TCGAACTGTT 
TCGAACTGTT 



AATGGTTCTG 
AATGGTTCTG 
AATGGTTCTG 
AATGGTTCTG 
AATGGTTCTG 
AATGGTTCTG 
AATGGTTCTG 
AATGGTTCTG 



ACTTTAAGGT 
ACTTTAAGGT 
ACTTTAAGGT 
ACTTTAAGGT 
ACTTTAAGGT 
ACTTTAAGGT 
ACTTTAAGGT 
ACTTTAAGGT 



750 

TTTCTATAAC 
TTTCTATAAC 
TTTCTATAAC 
TTTCTATAAC 
TTTCTATAAC 
TTTCTATAAC 
TTTCTATAAC 
TTTCTATAAC 



803 



WO 2004/018646 



PCT/US2003/026827 



Table 46: Comparative Sequences relating to SAG1615 (strain info highlighted in BOLD) 



msa253045 .2{l57_M78l} 
msa253 045 . 2 { 157_18RS21 } 
msa253045 . 2 { 157_2S0 3 } 
Consensus 



AAGAcATCAC TCGAACTGTT AATGGTTCTG ACTTTAAGGT TTTCTATAAC 
AAGACATCAC TCGAACTGTT AATGGTTCTG ACTTTAAGGT TTTCTATAAC 
AAGAcATCAC TCGAACTGTT AATGGTTCTG ACTTTAAGGT TTTCTATAAC 
****_***** ********** ********** ********** ********** 



msa253045 
msa253045.2{ 

msa253045 . 
msa253045.2{l57 
msa253045 .2{ 

msa253045. 

msa253045 . 

msa253045. 

msa253045 . 
msa253045 .2{ 

msa25304S. 



•2{157_090} 
157_CJB110} 
2{l57_H36Bj 
_JM9130013} 
157_1169NT} 
2{l57_A909} 
2{l57_COHl} 
2{157_M732} 
2{l57_M7Sl} 
157__18RS21 J 
2{157_2603J 
Consensus 



751 

CAAGAAGAAA 
CAAGAAGAAA 
CAAGAAGAAA 
CAAGAAGAAA 
CAAGAAGAAA 
CAAGAAGAAA 
CAAGAAGAAA 
CAAGAAGAAA 
CAAGAAGAAA 
CAAGAAGAAA 
CAAGAAGAAA 



TTGGTCAGTT 
TTGGTCAGTT 
TTGGTCAGTT 
TTGGTCAGTT 
TTGGTCAGTT 
TTGGTCAGTT 
TTGGTCAGTT 
TTGGTCAGTT 
TTGGTCAGTT 
TTGGTCAGTT 
TTGGTCAGTT 



TCAt GTACCA 
TCAt GTACCA 
TCAcGTACCA 
TCAcGTACCA 
TCAt GTACCA 
TCAtGTACCA 
TCAtGTACCA 
TCAtGTACCA 
TCAtGTACCA 
TCAtGTACCA 
TCAtGTACCA 



GCATACGGTA 
GCATACGGTA 
GCATACGGTA 
GCATACGGTA 
GCATACGGTA 
GCATACGGTA 
GCATACGGTA 
GCATACGGTA 
GCATACGGTA 
GCATACGGTA 
GCATACGGTA 



800 

AACATAATAT 
AACATAATAT 
AACATAATAT 
AACATAATAT 
AACATAATAT 
AACATAATAT 
AACATAATAT 
AACATAATAT 
AACATAATAT 
AACATAATAT 
AACATAATAT 



********** ********** ***„****** ********** ********** 



msa2 53 045. 2 { 157^090} 
msa253045.2{l57_CJB110} 
msa253045 . 2 { 157_H36B' 
msa253045 - 2 { 157_JM9130013 ' 
msa253045 . 2 { 157_1169Nt1 
msa2 53 04 5 . 2 { 1 57_A9 0 9 } 
msa25304S.2{l57__COHl} 
msa253045.2{l57_M732' 
msa253045.2{157_M781 
msa253 045 . 2 { 157_18RS21 
msa2 53 045. 2 { 157^2603} 
Consensus 



801 

CTTAAATGCA 
CTTAAATGCA 
CTTAAATGCA 
CTTAAATGCA 
CTTAAATGCA 
CTTAAATGCA 
CTTAAATGCA 
CTTAAATGCA 
CTTAAATGCA 
CTTAAATGCA 
CTTAAATGCA 



ACTGCTGTTA 
ACTGCTGTTA 
ACTGCTGTTA 
ACTGCTGTTA 
ACTGCTGTTA 
ACTGCTGTTA 
ACTGCTGTTA 
ACTGCTGTTA 
ACTGCTGTTA 
ACTGCTGTTA 
ACTGCTGTTA 



********** ********** 



TTGCTAACCT 

TTGCTAACCT 

TTGCTAACCT 

TTGCTAACCT 

TTGCTAACCT 

TTGCTAACCT 

TTGCTAACCT 

TTGCTAACCT 

TTGCTAACCT 

TTGCTAACCT 

TTGCTAACCT 
********** 



TTACATAATG 
TTACATAATG 
TTACATAATG 
TTACATAATG 
TTACATAATG 
TTACATAATG 
TTACATAATG 
TTACATAATG 
TTACATAATG 
TTACATAATG 
TTACATAATG 



850 

GGAATTGATA 
GGAATTGATA 
GGAATTGATA 
GGAATTGATA 
GGAATTGATA 
GGAATTGATA 
GGAATTGATA 
GGAATTGATA 
GGAATTGATA 
GGAATTGATA 
GGAATTGATA 



********** ********** 



msa253045 
msa253045.2{ 

msa253045. 
msa253045.2{l57 
msa253045.2{ 

msa253045. 

msa253045 . 

msa253045. 

msa253045 . 
msa253045.2{ 

msa253045 . 



,2{157_090} 
157_CJB110' 
2{157_H36B 1 
_JM9130013 
157_1169NTj 
2{157__A909 
2{l57_COHl} 
2{157_M732} 
2{157_M781) 
157_18RS2l} 
2 {157^2603} 
Consensus 



851 

TGGCATTAGT 

TGGCATTAGT 

TGGCATTAGT 

TGGCATTAGT 

TGGCATTAGT 

TGGCATTAGT 

TGGCATTAGT 

TGGCATTAGT 

TGGCATTAGT 

TGGCATTAGT 

TGGCATTAGT 
********** 



AGCTGAGCAT 

AGCTGAGCAT 

AGCTGAGCAT 

AGCTGAGCAT 

AGCTGAGCAT 

AGCTGAGCAT 

AGCTGAGCAT 

AGCTGAGCAT 

AGCTGAGCAT 

AGCTGAGCAT 

AGCTGAGCAT 
********** 



TTGAAGACaT 

TTGAAGACaT 

TTGAAGACaT 

TTGAAGACaT 

TTGAAGACaT 

TTGAAGACaT 

TTGAAGACaT 

TTGAAGACaT 

TTGAAGACaT 

TTGAAGACgT 

TTGAAGACgT 
********_* 



TTTCAGGGGT 

TTTCAGGGGT 

TTTCAGGGGT 

TTTCAGGGGT 

TTTCAGGGGT 

TTTCAGGGGT 

TTTCAGGGGT 

TTTCAGGGGT 

TTTCAGGGGT 

TTTCAGGGGT 

TTTCAGGGGT 
********** 



900 

AAAaCGTCGT 

AAAaCGTCGT 

AAAaCGTCGT 

AAAaCGTCGT 

AAAgCGTCGT 

AAAgCGTCGT 

AAAgCGTCGT 

AAAgCGTCGT 

AAAgCGTCGT 

AAAgCGTCGT 

AAAgCGTCGT 
***_****** 



msa253045 
msa253045.2{ 

msa253045 . 
msa253045.2{l57 
msa253045.2{ 

msa253045 . 

msa253045 . 

msa253045 . 

msa253045. 
msa253045.2{ 

msa253045. 



2{157_090 
157_CJB110 
2{157_H36B ; 
_JM9130013 
157_1169NT 
2{157_A909 
2fl57_COHl ■ 
2{157_M732> 
2{157_M781; 
157_18RS21 
2{157_2603 
Consensus 



901 

TTTACTGAGA 

TTTACTGAGA 

TTTACTGAGA 

TTTACTGAGA 

TTTACTGAGA 

TTTACTGAGA 

TTTACTGAGA 

TTTACTGAGA 

TTTACTGAGA 

TTTACTGAGA 

TTTACTGAGA 
********** 



AgATTATTGA 

AgATTATTGA 

AaATTATTGA 

AaATTATTGA 

AgATTATTGA 

AgATTATTGA 

AgATTATTGA 

AgATTATTGA 

AgATTATTGA 

AgATTATTGA 

AgATTATTGA 
* -******** 



CGATACTGTC 

CGATACTGTC 

CGATACTGTC 

CGATACTGTC 

CGATACTGTC 

CGATACTGTC 

CGATACTGTC 

CGATACTGTC 

CGATACTGTC 

CGATACTGTC 

CGATACTGTC 
********** 



ATTATTGATG 

ATTATTGATG 

ATTATTGATG 

ATTATTGATG 

ATTATTGATG 

ATTATTGATG 

ATTATTGATG 

ATTATTGATG 

ATTATTGATG 

ATTATTGATG 

ATTATTGATG 
********** 



950 

ACTTTGCTCA 

ACTTTGCTCA 

ACTTTGCTCA 

ACTTTGCTCA 

ACTTTGCTCA 

ACTTTGCTCA 

ACTTTGCTCA 

ACTTTGCTCA 

ACTTTGCTCA 

ACTTTGCTCA 

ACTTTGCTCA 
********** 



msa253045 
msa253045 ,2{ 
«\sa253045 . 
msa253045.2{l57 
msa253045.2{ 
msa253045 • 
msa253045 - 
msa253045 . 
msa253045 . 
msa253045.2{ 
msa253045 . 



.2{157_090> 
157_CJB110} 
2{157_H36B} 
_JM9130013 J 
157__1169NT} 
2{157_A909} 
157_C0H1} 
157_M732} 
157_M78l} 
157_18RS21} 
2{157_2603} 
Consensus 



951 

CCATCCTACT 

CCATCCTACT 

CCATCCTACT 

CCATCCTACT 

CCATCCTACT 

CCATCCTACT 

CCATCCTACT 

CCATCCTACT 

CCATCCTACT 

CCATCCTACT 

CCATCCTACT 
********** 



GAGATTATTG 

GAGATTATTG 

GAGATTATTG 

GAGATTATTG 

GAGATTATTG 

GAGATTATTG 

GAGATTATTG 

GAGATTATTG 

GAGATTATTG 

GAGATTATTG 

GAGATTATTG 
********** 



CGACATTAGA 
CGACATTAGA 
CGACATTAGA 
CGACATTAGA 
CGACATTAGA 
CGACATTAGA 
CGACATTAGA 
CGACATTAGA 
CGACATTAGA 
CGACATTAGA 

CGACATTAGA 
********** 



TGCTGCTCGA 

TGCTGCTCGA 

TGCTGCTCGA 

TGCTGCTCGA 

TGCTGCTCGA 

TGCTGCTCGA 

TGCTGCTCGA 

TGCTGCTCGA 

TGCTGCTCGA 

TGCTGCTCGA 

TGCTGCTCGA 
********** 



1000 

CAAAAATACC 

CAAAAATACC 

CAAAAATACC 

CAAAAATACC 

CAAAAATACC 

CAAAAATACC 

CAAAAATACC 

CAAAAATACC 

CAAAAATACC 

CAAAAATACC 

CAAAAATACC 
********** 



msa253045.2{l57__090} 
msa253045 .2 { 157_CJB110 ) 
msa253045.2{l57_H36B) 
msa253045.2{l57_JM9130013' 
msa253045 . 2 { 157_JL169NT^ 
tcisa253045 . 2 f 157__A909 1 
rasa253045 . 2 { 157_C0H1 



1001 

CGTCAAAAGA 
CGTCAAAAGA 
CGTCAAAAGA 
CGTCAAAAGA 
CGTCAAAAGA 
CGTCAAAAGA 
CGTCAAAAGA 



AATTGTAGCT 
AATTGTAGCT 
AATTGTAGCT 
AATTGTAGCT 
AATTGTAGCT 
AATTGTAGCT 
AATTGTAGCT 



ATTTTCCAAC 
ATTTTCCAAC 
ATTTTCCAAC 
ATTTTCCAAC 
ATTTTCCAAC 
ATTTTCCAAC 
ATTTTCCAAC 



CGCATACGTT 
CGCATACGTT 
CGCATACGTT 
CGCATACGTT 
CGCATACGTT 
CGCATACGTT 
CGCATACGTT 



1050 
CACTCGTACG 
CACTCGTACG 
CACTCGTACG 
CACTCGTACG 
CACTCGTACG 
CACTCGTACG 
CACTCGTACG 



804 



WO 2004/018646 



PCT/US2003/026827 



Table 46: Comparative Sequences relating to SAG1615 (strain info highlighted in BOLD) 



tnsa253045 . 2 { 157_M732 } 
msa253045 . 2 { 157_M78 1 } 
msa253045 .2{l57_18RS2l} 
msa253045.2{l57_2603} 
Consensus 



CGTCAAAAGA 
CGTCAAAAGA 
CGTCAAAAGA 
CGTCAAAAGA 



AATTGTAGCT 
AATTGTAGCT 
AATTGTAGCT 
AATTGTAGCT 



ATTTTCCAAC 
ATTTTCCAAC 
ATTTTCCAAC 
ATTTTCCAAC 



CGCATACGTT 
CGCATACGTT 
CGCATACGTT 
CGCATACGTT 



CACTCGTACG 
CACTCGTACG 
CACTCGTACG 
CACTCGTACG 



********** ********** ********** ********** ********** 



msa253045 
msa253045.2{ 

msa253045 . 
ntsa253045.2{l57 
msa253045.2{ 

msa253045 . 

msa253045. 

msa253045 . 

msa2 53045 . 
msa253045 .2{ 

msa253045 . 



.2{l57_090} 
157_CJB110 
2{l57_H36B 
_JM9130013 
157_1169NT 
2{l57_A90 9 
2{157_COHl 
2{157_M732] 
2{l57_M78l" 
157_18RS21 
2{157_2603 
Consensus 



1051 

ATAGCTCTTT 
ATAGCTCTTT 
ATAGCTCTTT 
ATAGCTCTTT 
ATAGCTCTTT 
ATAGCTCTTT 
ATAGCTCTTT 
ATAGCTCTTT 
ATAGCTCTTT 
ATAGCTCTTT 
ATAGCTCTTT 



TAGACGAtTT 
TAGACGAtTT 
TAGACGAaTT 
TAGACGAaTT 
TAGACGAaTT 
TAGACGAaTT 
TAGACGAaTT 
TAGACGAaTT 
TAGACGAaTT 
TAGACGAaTT 
TAGACGAaTT 



TGCCCATGCt 
TGCCCATGCt 
TGCCCATGCc 
TGCCCATGCC 
TGCCCATGCc 
TGCCCATGCc 
TGCCCATGCc 
TGCCCATGCc 
TGCCCATGCc 
TGCCCATGCc 
TGCCCATGCc 



TTGAGTCAAG 
TTGAGTCAAG 
TTGAGTCAAG 
TTGAGTCAAG 
TTGAGTCAAG 
TTGAGTCAAG 
TTGAGTCAAG 
TTGAGTCAAG 
TTGAGTCAAG 
TTGAGTCAAG 
TTGAGTCAAG 



1100 
CGGATAGCGT 
CGGATAGCGT 
CGGATAGCGT 
CGGATAGCGT 
CGGATAGCGT 
CGGATAGCGT 
CGGATAGCGT 
CGGATAGCGT 
CGGATAGCGT 
CGGATAGCGT 
CGGATAGCGT 



********** *******_** *********_ ********** ********** 



msa253045 
msa253045.2{ 

msa253045 . 
msa253045.2{l57 
msa253045 .2 { 

msa253045 . 

msa253045 . 

msa253045. 

msa253045 . 
msa253045.2{ 

msa253045 . 



.2{157_090} 
157_CJB110} 
2{l57__H36B~ 
_JM9130013 
157_1169NT 
2{157__A909 
2 { 157_COHl 
2(157_M732 
2{157_M78l' 
157_18RS2l} 
2{l57_2603} 
Consensus 



1101 

TTATCT t GCT 

TTATCT t GCT 

TTATCTcGCT 

TTATCTcGCT 

TTATCTcGCT 

TTATCTcGCT 

TTATCTcGCT 

TTATCTcGCT 

TTATCTcGCT 

TTATCTcGCT 

TTATCTcGCT 
******_ *** 



CAAATATATG 

CAAATATATG 

CAAATATATG 

CAAATATATG 

CAAATATATG 

CAAATATATG 

CAAATATATG 

CAAATATATG 

CAAATATATG 

CAAATATATG 

CAAATATATG 
********** 



GTTCTGCTAG 
GTTCTGCTAG 
GTTCTGCTAG 
GTTCTGCTAG 
GTTCTGCTAG 
GTTCTGCTAG 
GTTCTGCTAG 
GTTCTGCTAG 
GTTCTGCTAG 
GTTCTGCTAG 
GTTCTGCTAG 



AGAAGTAGAT 
AGAAGTAGAT 
AGAAGTAGAT 
AGAAGTAGAT 
AGAAGTAGAT 
AGAAGTAGAT 
AGAAGTAGAT 
AGAAGTAGAT 
AGAAGTAGAT 
AGAAGTAGAT 
AGAAGTAGAT 



1150 
AATGGTGAGG 
AATGGTGAGG 
AATGGTGAGG 
AATGGTGAGG 
AATGGTGAGG 
AATGGTGAGG 
AATGGTGAGG 
AATGGTGAGG 
AATGGTGAGG 
AATGGTGAGG 
AATGGTGAGG 



********** ********** ********** 



msa253045 
msa253045.2{ 

msa253045 . 
msa253045.2{l57 
msa253045.2{ 

msa253045 . 

msa253045 . 

msa253045 . 

msa253045 . 
msa253045.2{ 

msa253045 . 



.2{157_090 
157_CJB110| 
2{157_H36B' 
_JM9130013 
157_1169NT 
2{l57_A909 
2{l57_COHl 
2{l57_M732 
2{157_M781 
157_18RS2l} 
2{157_2603} 
Consensus 



1151 

TGAAGGTAGA 
TGAAGGTAGA 
TGAAGGTAGA 
TGAAGGTAGA 
TGAAGGTAGA 
TGAAGGTAGA 
TGAAGGTAGA 
TGAAGGTAGA 
TGAAGGTAGA 
TGAAGGTAGA 
TGAAGGTAGA 



AGATTTAGCT 
AGATTTAGCT 
AGATTTAGCT 
AGATTTAGCT 
AGATTTAGCT 
AGATTTAGCT 
AGATTTAGCT 
AGATTTAGCT 
AGATTTAGCT 
AGATTTAGCT 
AGATTTAGCT 



********** ********** 



GCTAAGATTG 

GCTAAGATTG 

GCTAAGATTG 

GCTAAGATTG 

GCTAAGATTG 

GCTAAGATTG 

GCTAAGATTG 

GCTAAGATTG 

GCTAAGATTG 

GCTAAGATTG 

GCTAAGATTG 
********** 



TCAAACACTC 

TCAAACACTC 

TCAAACACTC 

TCAAACACTC 

TCAAACACTC 

TCAAACACTC 

TCAAACACTC 

TCAAACACTC 

TCAAACACTC 

TCAAACACTC 

TCAAACACTC 
********** 



1200 

AGATTTAGTG 

AGATTTAGTG 

AGATTTAGTG 

AGATTTAGTG 

AGATTTAGTG 

AGATTTAGTG 

AGATTTAGTG 

AGATTTAGTG 

AGATTTAGTG 

AGATTTAGTG 

AGATTTAGTG 
********** 



msa253045 
msa253045 .2{ 

msa253045. 
msa253045.2{l57 
msa253045 .2{ 

msa253045. 

msa253045 . 

msa253045 . 

msa253045 . 
msa253045 .2{ 

msa253045. 



,2{157_090} 
157_CJB110} 
2{157_H36B} 
_JM9130013} 
157_1169NT} 
2{157_A909} 
2 J 157_COHl} 
2{157_M732} 
2{157_M781} 
157_18RS2lj 
2{157__2603} 
Consensus 



1201 

ACAGTCGAAA 

ACAGTCGAAA 

ACAGTCGAAA 

ACAGTCGAAA 

ACAGTCGAAA 

ACAGTCGAAA 

ACAGTCGAAA 

ACAGTCGAAA 

ACAGTCGAAA 

ACAGTCGAAA 

ACAGTCGAAA 
********** 



ATGTCTCGCC 
ATGTCTCGCC 
ATGTCTCGCC 
ATGTCTCGCC 
ATGTCTCGCC 
ATGTCTCGCC 
ATGTCTCGCC 
ATGTCTCGCC 
ATGTCTCGCC 
ATGTCTCGCC 
ATGTCTCGCC 



TTTACTCAAT 
TTTACTCAAT 
TTTACTCAAT 
TTTACTCAAT 
TTTACTCAAT 
TTTACTCAAT 
TTTACTCAAT 
TTTACTCAAT 
TTTACTCAAT 
TTTACTCAAT 
TTTACTCAAT 



CATGATAATG 
CATGATAATG 
CATGATAATG 
CATGATAATG 
CATGATAATG 
CATGATAATG 
CATGATAATG 
CATGATAATG 
CATGATAATG 
CATGATAATG 
CATGATAATG 



1250 
CTGTCTATGT 
CTGTCTATGT 
CTGTCTATGT 
CTGTCTATGT 
CTGTCTATGT 
CTGTCTATGT 
CTGTCTATGT 
CTGTCTATGT 
CTGTCTATGT 
CTGTCTATGT 
CTGTCTATGT 



********** ********** ********** ********** 



msa253045 
msa253045 .2{ 
. msa253045. 
msa253045.2{l57 
msa253045.2{ 
msa253045 . 
msa253045 . 
msa253045 . 
rasa253045 . 
msa253045 .2{ 
msa253045 . 



.2{157_090 
157__CJB11.0 
2{l57JH36B 
_JM9130013 
157_1169NT 
2{157_A909 
2{157_C0H1- 
2{157_M732; 
2{157_M781 
157_18RS21 
2{157_2603 
Consensus 



1251 

CTTTATGGGT 
CTTTATGGGT 
CTTTATGGGT 
CTTTATGGGT 
CTTTATGGGT 
CTTTATGGGT 
CTTTATGGGT 
CTTTATGGGT 
CTTTATGGGT 
CTTTATGGGT 
CTTTATGGpT 



GCTGGAGACA 
GCTGGAGACA 
GCTGGAGACA 
GCTGGAGACA 
GCTGGAGACA 
GCTGGAGACA 
GCTGGAGACA 
GCTGGAGACA 
GCTGGAGACA 
GCTGGAGACA 
GCTGGAGACA 



TTCAATTGTA 
TTCAATTGTA 
TTCAATTGTA 
TTCAATTGTA 
TTCAATTGTA 
TTCAATTGTA 
TTCAATTGTA 
TTCAATTGTA 
TTCAATTGTA 
TTCAATTGTA 
TTCAATTGTA 



********** ********** ********** 



TGAGCGCTCT 

TGAGCGCTCT 

TGAGCGCTCT 

TGAGCGCTCT 

TGAGCGCTCT 

TGAGCGCTCT 

TGAGCGCTCT 

TGAGCGCTCT 

TGAGCGCTCT 

TGAGCGCTCT 

TGAGCGCTCT 
********** 



1300 

TTTGAAGAAT 

TTTGAAGAAT 

TTTGAAGAAT 

TTTGAAGAAT 

TTTGAAGAAT 

TTTGAAGAAT 

TTTGAAGAAT 

TTTGAAGAAT 

TTTGAAGAAT 

TTTGAAGAAT 

TTTGAAGAAT 
********** 



msa253 045 . 2 { 157_09 0 J 
msa253045 . 2 { 157_CJB110 } 
msa253045.2{l57_H36B} 
msa253045 . 2 { 157_JM9130013 } 
msa253045 .2{l57_JL169NT> 
msa253045.2{l57_A909} 



1301 

TATTAGCTAA 
TATTAGCTAA 
TATTAGCTAA 
TATTAGCTAA 
TATTAGCTAA 
TATTAGCTAA 



CCTAACTAAA 
CCTAACTAAA 
CCTAACTAAA 
CCTAACTAAA 
CCTAACTAAA 
CCTAACTAAA 



1329 
AATACACAA 
AATACACAA 
AATACACAA 
AATACACAA 
AATACACAA 
AATACACAA 
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msa253045 ,2{l57_COHl} 
msa253045 . 2 { 157_M732 } 
msa253045 . 2 { 157__M78l} 
msa253045.2{l57_18RS2l} 
msa253045 .2{l57_2603} 
Consensus 



T ATT AG CT AA 

T ATT AG CT AA 

TATTAG CTAA 

TATTAGCTAA 

TATTAGCTAA 
********** 



CCTAACTAAA 
CCTAACTAAA 
CCTAACTAAA 
CCTAACTAAA 
CCTAACTAAA 



AATACACAA 
AATACACAA 
AATACACAA 
AATACACAA 
AATACACAA 



********** ********* 



SEQ ID NO. 4613 
STRAIN A909 frame: 2 

DKYYFTQRGLEQAGVTI LPFSPNNI SEDLEI I AGNAFRPDNNEELAYVI EKGYHFKRYHE 
FLGD FMRQFTSLGVAGAHGKTSTTGLLAHVLKNI tdts fli gdgtgrgsananyf vfead 
EYERHFMPYHPEYS I ITNI D FDHPDYFTGLEDVFNAFND YAKQVQKGLF I YGEDPKLHEI 
TSEAP I YYYGFEDSNDFI AKDITRTVNGSDFKVFYNQEE IGQFHVPAYGKHNILNATAVI 
ANLYIMGIDMALVAEHLKTFSGVKRRFTEKI IDDTVI IDDFAHHPTEI IATL.DAARQKYP 
SKEI VAI FQPHTFTRT IALLDEFAHALSQADS VYLAQ I YGSAREVDNGEVKVEDLAAKI V 
KHSDLVTVENVS PLLNHDNAVYVFMGAGD I QL YERS FEELLANLTKNTQ 

SEQ ID NO. 4614 
STRAIN 1169NT frame: 2 

KAGS SDVDKYYFTQRGIiEQAGVT I LPFSPNNI SEDLEI I AGNAFRPDNNEELAYVI EKGY 
HFKRYHEFLGDFMRQFTSLGVAGAHGKTSTTGLIiAHVLKNITDTSFLIGDGTGRGSANAN 
YFVFEADEYERHFMPYHPEYS 1 1 TNI DFDHPDYFTGLEDVFNAFND YAKQVQKGLF I YGE 
DPKLHEITSEAPIYYYGFEDSNDFIAKDITRTWGSDFKVFYNQEEIGQFHVPAYGKHNI 
LNATAVI ANLYIMG I DMAL VAEHLKTFSGVKRRFTEKI IDDTVI IDDFAHHPTEI IATLD 

AARQKYPSKEIVAIFQPHTFTRTIALLDEFAHALSQADSVYLAQIYGSAREVDNGEVKVE 
DLAAKI VKHSDLVTVENVS PLLNHDNAVYVFMGAGD I QLYERSFEELLANLTKNTQ 

SEQ ID NO. 4615 
STRAIN 090 FRAME: 1 

KAGS SDVDKYYFTQRGLEQAGVTI LPFSPNNI SEDLEI I AGNAFRPDNNEELAYVI EKGY 
HF KR YHE FLGD FMRQ FT S LG VAGAHGKTS TTGLLAHVLKN I TDT S FL I GDGTGRG S ANAN 
YFVFEADEYERHFMPYHPEYS 1 1 TNI DFDHPDYFTGLEDVFNAFND YAKQVQKGLF I YGE 
DS KLHE ITS KAP I YYYGFEDSNDF I AKD I TRTVNGSDFKVFYNQEE I GQFHVPAYGKHN I 
LNATAVI ANLYIMGI DMAL VAEHLKTFSGVKRRFTEKI IDDTVI I DDFAHHPTE I IATLD 
AARQ KYP S KE I VA I FQPHTFTRT I ALLDD FAHAL S QAD S VYLAQ I YG S AREVDNGEVKVE 
DLAAKI VKHSDLVTVENVSPLLNHDNAVYVFMGAGD I QLYERS FEELLANLTKNTQ 

SEQ ID NO. 4616 
STRAIN H36B frame: 2 

KAGS SDVDKY YFTQRGLEQAGI T I LPFSPNNI SEDLE 1 1 AGNAFRPDNNEELAYVI EKGY 
HFKRYHEFXiGDFMRQFTSLGVAGAHGKTSTTGLLAHVLKNITDTS FL I GDGTGRGSANAN 
YFVFEADEYERHFMPYHPEYS I ITN I DFDHPDYFTGLEDVFNAFNDYAKQVQKGLF I YGE 
DPKLHE ITSEAP I YYYGFEDSNDF I AKD ITRTVNGSDFKVFYNQEE I GQFHVPAYGKHN I 
LNATAVI ANLYIMG I DMAL VAEHLKTFSGVKRRFTEKI IDDTVI IDDFAHHPTEI IATLD 
AARQKYPSKEIVAI FQPHTFTRTIALLDEFAHALSC^UDSVYliAQIYGSAREVDNGEVKVE 
DLAAKI VKHSDLVTVENVS PLLNHDNAVYVFMGAGD I QLYERS FEELLANLTKNTQ 

SEQ ID NO. 4617 
STRAIN 18RS21 frame: 1 

KAGS SDVDKYYFTQRGLEQAGVTI LPFSPNNI SEDLE 1 1 AGNAFRPDNNEELAYVI EKGY 
HFKRYHEFLGDFMRQFTSLGVAGAHGKTSTTGLIiAHVLKN ITDTSFL I GDGTGRGSANAN 
YFVFEADEYERHFMPYHPEYS I ITNI DFDHPDYFTGLEDVFNAFNDYAKQVQKGLF I YGE 
DPKLHE ITSEAP I YYYGFEDSNDF I AKDITRTVNGSDFKVFYNQEE I GQFHVPAYGKHN I 
LNATAVIANLYIMGI DMAL VAEHLKTFSGVKRRFTEKI IDDTVI IDDFAHHPTEI IATLD 
AARQKYPSKEIVAI FQPHTFTRT I ALLDEFAHALSQADSVYLAQIYGSAREVDNGEVKVE 
DLAAKI VKHSDLVTVENVS PLLNHDNAVYVFMGAGD IQLYERSFEELLANLTKNTQ 

SEQ ID NO. 4618 
STRAIN M732 frame: 2 

KAGS SDVDKYYFTQRGLEQAGVT I LPFS PNNI SEDLE 1 1 AGNAFRPDNNEELAYVI EKGY 
HFKRYHEFLGDFMRQFTSLGVAGAHGKTSTTGLLAHVLKNI TDTS FL I GDGTGRGSANAN 
YFVFEADEYERHFMPYHPEYS I ITNIDFDHPDYFTGLEDVFNAFNDYAKQVQKGLFIYGE 
DPKLHEITSEAPIYYYGFEDSNDFIAKDITRTVNGSDFKVFYNQEEIGQFHVPAYGKHNI 
LNATAVI ANLYIMG I DMAL VAEHLKTFSGVKRRFTEKI IDDTVI IDDFAHHPTEI IATLD 
AARQKYPSKEIVAI FQPHTFTRT I AJ^DEFAHALSQADS VYLAQ I YGSAREVDNGEVKVE 
DLAAKI VKHSDLVTVENVS PLLNHDNAVYVFMGAGD I QLYERS FEELLANLTKNTQ 

SEQ ID NO. 4619 

STRAIN JM913 0013 frame: 2 

FKKAGSSDVDKYYFTQRGLEQAGIT I LPFS PNNI SEDLE 1 1 AGNAFRPDNNEELAYVI EK 
GYHFKRYHEFLGDFMRQFTSLGVAGAHGKTSTTGLLAHVLKNITDTSFLIGDGTGRGSAN 
ANYFVFEADEYERHFMPYHPEYS 1 1 TNI DFDHPD YFTGLED VFNAFNDYAKQVQKGLFI Y 
GEDPKIiHEITSEAPIYYYGFEDSNDFIAKDITRTVNGSDFKVFYNQEEIGQFHVPAYGKH 
NI LNATAVI ANLY IMG I DMALVAEHLKTFS GVKRRFTEKI IDDTVI IDDFAHHPTEI IAT 
LDAARQKYP SKEI VAI FQPHTFTRT I ALLDEFAHALSQADS VYLAQ I YGSAREVDNGEVK 
VEDLAAKI VKHSDLVTVENVSPLIiNHDNAVYVFMGAGDI QLYERS FEELLANLTKNTQ 

SEQ ID NO. 4620 
STRAIN M781 frame: 1 

KAGSSDVDKYYFTQRGLEQAGVTI LPFSPNNI SEDLE 1 1 AGNAFRPDNNEELAYVI EKGY 
HFKRYHE FLGDFMRQFTSLGVAGAHGKTSTTGLLAHVLKN I TDTS FL I GDGTGRGSANAN 
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YFVFEADEYERHFMPYHPEYSIITNIDFDHPDYFTGLEDVPNAFNDYAKQVQKGLFIYGE 
DPKLHEITSEAP I YYYGFEDSNDF I AKD I TRTVNGSDFKVFYNQEE I GQFHVPAYGKHNI 
LNATAVI ANL Y I MG I DMALVAEHLKT FSG VKRRFTE KI I DDT V I IDDFAHHPTE I IATLD 
AARQKYPSKE I VAI FQPHTFTRTIALLDEFAHALSQADSVYLAQI YGSAREVDNGEVKVE 
DLAAKI VKHSDLVTVENVS PLLNHDNAVYVFMGAGD I QLYERS FEELLANLTKNTQ 

SEQ ID NO. 4621 
STRAIN GTB110 frame: 3 

KAGSSDVDKYYFTQRGLEQAGVTILPFSPNNI SEDLE 1 1 AGNAFRPDNNE ELAYV I EKGY 
HFKRYHE FLGDFMRQFTS LG VAGAHGKTSTTGLIjAHVLKN ITDTS FL I GDGTGRGSANAN 
YFVFEADEYERHFMPYHPEYS I ITN I DFDH PDYFTGLED VFNAFND YAKQVQKGIj F I YGE 
DSKLHE I TSKAP I YYYGFEDSNDF X AKD I TRTVNGSDFKVFYNQEE I GQFHVPAYGKHNI 
LNATAVI ANLYI MGI DMALVAEHLKTFSGVKRRFTEKI IDDTVI IDDFAHHPTE I IATLD 
AARQKYPSKE I VAI FQPHTFTRTI ALLDDFAHALSQADSVYLAQ I YGSAREVDNGEVKVE 
DLAAKIVKHSDLVTVENVSPLLNHDNAVYVFMGAGDIQLYERSFEELLANLTKNTQ 

SEQ ID NO. 4622 
STRAIN 2603 frame: 1 

MSKTYHF IG I KGSGMSALALMLHQMGHNVQGSDVDKYYFTQRGLEQAGVTILPFSPNNI S 
EDLE 1 1 AGNAFRPDNNE ELAYVI E KGYQFKRYHE FLGDFMRQFTSLG VAGAHGKTSTTGL 
LAHVLKNITDTSFL I GDGTGRGSANANYFVFEADEYERHFMPYHPEYS I ITNIDFDHPDY 
FTGLEDVFNAFNDYAKQVQKGLFI YGEDPKLHEITSEAP I YYYGFEDSNDFI AKDITRTV 
NGSDFKVFYNQEE I GQFHVPAYGKHNI I^ATAVIANLYI MGI DMALVAEHLKTFSGVKRR 
FTEKI IDDTVI IDDFAHHPTEI I ATLDAARQKYPS KE I VAI FQPHTFTRTI ALLDEFAHA 
LSQADSVYLAQI YGS AREVDNGEVKVEDLAAKI VTOISDLVTVENVS PLLNHDNAVYVFMG 
AGDI QLYERS FEELLANLTKNTQ 

SEQ ID NO. 4623 
STRAIN COH1 frame: 3 

GSSDVDKYYFTQRGLEQAGVTI LPFSPNNI SEDLE I IAGNAFRPDNNEELAYVI EKGYHF 
KRYHE FIX3DFMRQFTSLGVAGAHGKTSTTGLLAHVLKNI TDTS FL I GDGTGRGS ANANYF 
VFEADEYERHFMPYHPEYSI ITNIDFDHPDYFTGLEDVFNAFNDYAKQVQKGLFIYGEDP 
KLHE ITSEAPI YYYGFEDSNDFI AKDI TRTVNGSDFKVFYNQEE I GQFHVPAYGKHNILN 
ATAVIANLY IMG IDMALVAEHLKTFSGVKRRFTEKI IDDTVI IDDFAHHPTEI IATLDAA 
RQKYP SKEI VAI FQPHTFTRT I ALLDEFAHALSQADS VYLAQI YGSARE VDNGEVKVEDL 
AAKI VKHSDLVTVENVSPLLNHDNAVYVFMGAGD I QLYERS FEELLANLTKNTQ 

PRETTY of: /biotmp/msa56635 . 2 { *} November 26, 2002 08:08 



msa253220 
msa253220 .2{ 
msa253220 .2{ 
rasa2S3220 .2{ 

msa253220 . 

msa253220 . 

msa253220 . 

msa253220 . 
msa253220 .2{l57 

msa253220 . 

msa253220 . 



2{l57_090 
157_CJB110 
157_1169NT; 
157_18RS21 
2{1S7_M732' 
2{l57_M78i; 
2{157_COHl 
2{157_H36B 
_JM9130013 
2{157_2603 
2{l57_A909 
Consensus 



kag ssdvDKYYFT 

, kag ssdvDKYYFT 

kag ssdvDKYYFT 

kag ssdvDKYYFT 

kag ssdvDKYYFT 

, — kag ssdvDKYYFT 

. g ssdvDKYYFT 

. kag ssdvDKYYFT 

fkkag ssdvDKYYFT 

msktyhfigi kgsgmsalal mlhqmghnvq gsdvDKYYFT 

DKYYFT 

********** ********** ******* ****** 



50 

QRGLEQAGvT 

QRGLEQAGvT 

QRGLEQAGvT 

QRGLEQAGvT 

QRGLEQAGvT 

QRGLEQAGvT 

QRGLEQAGvT 

QRGLEQAGiT 

QRGLEQAGiT 

QRGLEQAGvT 

QRGLEQAGvT 
********_* 



msa253220 
msa253220 . 2 { 
msa253220 .2{ 
msa253220 . 2 { 

msa253220 . 

msa253220 . 

msa2 53220 . 

msa253220 . 
msa253220.2{l57 

msa253220 . 

msa253220 . 



.2{157__090} 
157_CJB110 } 
157_1169NT} 
157_18RS2l} 
2(157_M732) 
2(157_M781) 
2{157_C0H1} 
2{157_H36B} 
_JM9130013 j 
2(157__2603) 
2{157_A909} 
Consensus 



51 

ILPFSPNNIS 

I LPFSPNNI S 

ILPFSPNNIS 

ILPFSPNNIS 

ILPFSPNNIS 

ILPFSPNNIS 

ILPFSPNNIS 

ILPFSPNNIS 

ILPFSPNNIS 

ILPFSPNNIS 

ILPFSPNNIS 
********** 



EDLEI IAGNA 
EDLEIIAGNA 
EDLEI IAGNA 
EDLEI IAGNA 
EDLEIIAGNA 
EDLEIIAGNA 
EDLEIIAGNA 
EDLEIIAGNA 
EDLEI IAGNA 
EDLEIIAGNA 
EDLEIIAGNA 
********** 



FRPDNNEELA 

FRPDNNEELA 

FRPDNNEELA 

FRPDNNEELA 

FRPDNNEELA 

FRPDNNEELA 

FRPDNNEELA 

FRPDNNEELA 

FRPDNNEELA 

FRPDNNEELA 

FRPDNNEELA 
********** 



YVIEKGYhFK 

YVIEKGYhFK 

YVIEKGYhFK 

YVIEKGYhFK 

YVIEKGYhFK 

YVIEKGYhFK 

YVIEKGYhFK 

YVIEKGYhFK 

YVIEKGYhFK 

YVI EKGYqFK 

YVIEKGYhFK 
*******_** 



100 

RYHEFLGDFM 

RYHEFLGDFM 

RYHEFLGDFM 

RYHEFLGDFM 

RYHEFLGDFM 

RYHEFLGDFM 

RYHEFLGDFM 

RYHEFLGDFM 

RYHEFLGDFM 

RYHEFLGDFM 

RYHEFLGDFM 
********** 



msa253220 
msa253220 .2{ 
msa253220 .2 
msa253220 .2 

msa253220 . 

msa253220 . 

msa253220 . 

msa253220. 
msa253220.2{!57 

msa253220 

msa253220 



2{157_090' 
157_CJB110 
157_1169NT 
157_18RS21 
2{157_M732" 
2{157_M781 
2f 157_C0H1 
2{157JH36B 
_JM9130013 
2{157_2603 
2{157__A909} 
Consensus 



101 

RQFTSLGVAG 
RQFTSLGVAG 
RQFTSLGVAG 
RQFTSLGVAG 
RQFTSLGVAG 
RQFTSLGVAG 
RQFTSLGVAG 
RQFTSLGVAG 
RQFTSLGVAG 
RQFTSLGVAG 
RQFTSLGVAG 



AHGKTSTTGL 
AHGKTSTTGL 
AHGKTSTTGL 
AHGKTSTTGL 
AHGKTSTTGL 
AHGKTSTTGL 
AHGKTSTTGL 
AHGKTSTTGL 
AHGKTSTTGL 
AHGKTSTTGL 
AHGKTSTTGL 



LAHVLKNITD 
LAHVLKNITD 
LAHVLKNITD 
LAHVLKNITD 
LAHVLKNITD 
LAHVLKNITD 
LAHVLKNITD 
LAHVLKNITD 
LAHVLKNITD 
LAHVLKNITD 
LAHVLKNITD 



TSFLIGDGTG 
TSFLIGDGTG 
TSFLIGDGTG 
TSFLIGDGTG 
TSFLIGDGTG 
TSFLIGDGTG 
TSFLIGDGTG 
TSFLIGDGTG 
TSFLIGDGTG 
TSFLIGDGTG 
TSFLIGDGTG 



150 

RGSANANYFV 
RGSANANYFV 
RGSANANYFV 
RGSANANYFV 
RGSANANYFV 
RGSANANYFV 
RGSANANYFV 
RGSANANYFV 
RGSANANYFV 
RGSANANYFV 
RGSANANYFV 



********** ********** ********** ********** ********** 



msa253220 .2{157_090} 



151 200 
FEADEYERHF MPYHPEYSII TNIDFDHPDY FTGLEDVFNA FNDYAKQVQK 
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Table 46: Comparative Sequences relating to SAG1615 (strain info highlighted in BOLD) 



msa253220 .2{ 
msa253220 . 2{ 
msa253220 .2{ 

msa253220 . 

msa253220 . 

msa253220 . 

msa253220 . 
msa253220 .2(157 

msa253220 . 

msa253220 . 



157JZJB110) 
157_1169NT} 
157_18RS2l} 
2{1S7_M732} 
2{l57_M78lf 
2{l57_COHl) 
2{lS7_H36B} 
_JM9130013> 
2{lS7_2603) 
2{l57_A909j 
Consensus 



FEADEYERHF 
FEADEYERHF 
FEADEYERHF 
FEADEYERHF 
FEADEYERHF 
FEADEYERHF 
FEADEYERHF 
FEADEYERHF 
FEADEYERHF 
FEADEYERHF 



MPYHPEYSII 
MPYHPEYSII 
MPYHPEYSII 
MPYHPEYSII 
MPYHPEYSII 
MPYHPEYSII 
MPYHPEYSII 
MPYHPEYSII 
MPYHPEYSII 
MPYHPEYSII 



TNIDFDHPDY 
TNIDFDHPDY 
TNIDFDHPDY 
TNIDFDHPDY 
TNTDFDHPDY 
TNIDFDHPDY 
TNIDFDHPDY 
TNIDFDHPDY 
TNIDFDHPDY 
TNIDFDHPDY 



FTGLEDVFNA 
FTGLEDVFNA 
FTGLEDVFNA 
FTGLEDVFNA 
FTGLEDVFNA 
FTGLEDVFNA 
FTGLEDVFNA 
FTGLEDVFNA 
FTGLEDVFNA 
FTGLEDVFNA 



FNDYAKQVQK 
FNDYAKQVQK 
FNDYAKQVQK 
FNDYAKQVQK 
FNDYAKQVQK 
FNDYAKQVQK 
FNDYAKQVQK 
FNDYAKQVQK 
FNDYAKQVQK 
FNDYAKQVQK 



********** ********** ********** ********** ********** 



msa253220 
msa253220 . 2 { 
msa253220 .2 { 
msa253220 . 2{ 

msa253220 . 

msa25322 0 . 

msa253220 . 

msa253220. 
msa253220 .2{l57 

msa253220 . 

msa253220 . 



.2{157_090} 
157_CJB110 } 
157_1169NT 
157_1BRS21 
2(l57_M732 
2{157_M781 
2{l57_COHl 
2{157_H36B 
_JM9130013 
2{157_2603 
2{l57_A909 
Consensus 



201 

GLFIYGEDsK 
GLFIYGEDsK 
GLFIYGEDpK 
GLFIYGEDpK 
GLFIYGEDpK 
GLFIYGEDpK 
GLFIYGEDpK 
GLFIYGEDpK 
GLFIYGEDpK 
GLFIYGEDpK 
GLFIYGEDpK 



LHEITSkAPI 
LHEITSkAPI 
LHEITSeAPI 
LHEITSeAPI 
LHEITSeAPI 
LHEITSeAPI 
LHEITSeAPI 
LHEITSeAPI 
LHEITSeAPI 
LHEITSeAPI 
LHEITSeAPI 



YYYGFEDSND 
YYYGFEDSND 
YYYGFEDSND 
YYYGFEDSND 
YYYGFEDSND 
YYYGFEDSND 
YYYGFEDSND 
YYYGFEDSND 
YYYGFEDSND 
YYYGFEDSND 
YYYGFEDSND 



FIAKDITRTV 
FIAKDITRTV 
FIAKDITRTV 
FIAKDITRTV 
FIAKDITRTV 
FIAKDITRTV 
FIAKDITRTV 
FIAKDITRTV 
FIAKDITRTV 
FIAKDITRTV 
FIAKDITRTV 



********„* ******_*** ********** ********** 



250 

NGSDFKVFYN 

NGSDFKVFYN 

NGSDFKVFYN 

NGSDFKVFYN 

NGSDFKVFYN 

NGSDFKVFYN 

NGSDFKVFYN 

NGSDFKVFYN 

NGSDFKVFYN 

NGSDFKVFYN 

NGSDFKVFYN 
********** 



msa253220 
msa253220 ,2{ 
msa253220 . 2 { 
msa253220 .2{ 

msa253220 . 

msa253220 . 

msa253220 . 

msa253220. 
msa253220 .2(157 

msa253220 . 

msa253220 



.2{157_090) 
157_CJB110} 
157_1169NT} 
157_18RS2l" 
2{157_M732 
2{157_M781 
2{l57_COHl} 
2{157_H36B 
_JM9130013~ 
2{157_2603 
2{157_A909 
Consensus 



251 

QEEIGQFHVP 
QEEIGQFHVP 
QEEIGQFHVP 
QEEIGQFHVP 
QEEIGQFHVP 
QEEIGQFHVP 
QEEIGQFHVP 
QEEIGQFHVP 
QEEIGQFHVP 
QEEIGQFHVP 
QEEIGQFHVP 



AYGKHNI LNA 
AYGKHN ILNA 
AYGKHNILNA 
AYGKHNILNA 
AYGKHNILNA 
AYGKHNILNA 
AYGKHNILNA 
AYGKHNILNA 
AYGKHNILNA 
AYGKHNILNA 
AYGKHNILNA 



TAVIANLYIM 
TAVIANLYIM 
TAVIANLYIM 
TAVIANLYIM 
TAVIANLYIM 
TAVIANLYIM 
TAVIANLYIM 
TAVIANLYIM 
TAVIANLYIM 
TAVIANLYIM 
TAVIANLYIM 



GIDMALVAEH 
GIDMALVAEH 
GIDMALVAEH 
GIDMALVAEH 
GIDMALVAEH 
GIDMALVAEH 
GIDMALVAEH 
GIDMALVAEH 
GIDMALVAEH 
GIDMALVAEH 
GIDMALVAEH 



300 

LKTFSGVKRR 

LKTFSGVKRR 

LKTFSGVKRR 

LKTFSGVKRR 

LKTFSGVKRR 

LKTFSGVKRR- 

LKTFSGVKRR 

LKTFSGVKRR 

LKTFSGVKRR 

LKTFSGVKRR 

LKTFSGVKRR 



********** ********** ********** ********** ********** 



msa25322 0 
msa253220 . 2 { 
msa253220 ,2{ 
msa253220.2{ 

msa253220 . 

msa253220 . 

msa253220 . 

msa2 5322 0. 
msa253220. 2(157 

msa253220 . 

msa253220 . 



2{157__090 
157_CJB110 
157_1169NT 
157_18RS21 
2{157_M732 
2{157_M781 
2{157_C0H1 
2{157_H36B" 
_JM9130013 
2{157_2603* 
2{157_A909 
Consensus 



301 

FTEKIIDDTV 
FTEKI IDDTV 
FTEKIIDDTV 
FTEKIIDDTV 
FTEKIIDDTV 
FTEKIIDDTV 
FTEKIIDDTV 
FTEKIIDDTV 
FTEKIIDDTV 
FTEKI IDDTV 
FTEKIIDDTV 



I IDDFAHHPT 
I IDDFAHHPT 
I IDDFAHHPT 
I IDDFAHHPT 
I IDDFAHHPT 
I IDDFAHHPT 
I IDDFAHHPT 
I IDDFAHHPT 
I IDDFAHHPT 
I IDDFAHHPT 
I IDDFAHHPT 



********** ********** 



EI IATLDAAR 

EI IATLDAAR 

EI IATLDAAR 

EI IATLDAAR 

EI IATLDAAR 

EI IATLDAAR 

EI IATLDAAR 

EI IATLDAAR 

EI IATLDAAR 

EI IATLDAAR 

EI IATLDAAR 
********** 



QKYPSKEIVA 
QKYPSKEIVA 
QKYPSKEIVA 
QKYPSKEIVA 
QKYPSKEIVA 
QKYPSKEIVA 
QKYPSKEIVA 
QKYPSKEIVA 
QKYPSKEIVA 
QKYPSKEIVA 
QKYPSKEIVA 



350 

I FQPHTFTRT 
I FQPHTFTRT 
I FQPHTFTRT 
I FQPHTFTRT 
I FQPHTFTRT 
I FQPHTFTRT 
I FQPHTFTRT 
I FQPHTFTRT 
I FQPHTFTRT 
I FQPHTFTRT 
I FQPHTFTRT 



********** ********** 



msa253220 
msa253220 »2{ 
msa253220 .2{ 
tnsa253220 .2{ 

msa253220 . 

msa253220 . 

msa253220 . 

msa253220 . 
msa253220 .2(157 

msa253220 . 

msa253220 . 



.2{l57_090 
157_CJB110 
157_1169NT 
157_18RS21} 
2{157_M732 
2(157_M781 
2{l57_COHl 
2(157_H36B 
_JM9130013 
2{l57_2603 
2(157_A909; 
Consensus 



351 

lALLDdFAHA 
IALLDdFAHA 
IALLDeFAHA 
I ALLDe FAHA 
IALLDeFAHA 
IALLDeFAHA 
IALLDeFAHA 
IALLDeFAHA 
IALLDeFAHA 
IALLDeFAHA 
IALLDeFAHA 



LSQADSVYLA 
LSQADSVYLA 
LSQADSVYLA 
LSQADSVYLA 
LSQADSVYLA 
LSQADSVYLA 
LSQADSVYLA 
LSQADSVYLA 
LSQADSVYLA 
LSQADSVYLA 
LSQADSVYLA 



QIYGSAREVD 
QIYGSAREVD 
QIYGSAREVD 
QIYGSAREVD 
QIYGSAREVD 
QIYGSAREVD 
QIYGSAREVD 
QIYGSAREVD 
QIYGSAREVD 
QIYGSAREVD 
QIYGSAREVD 



*****_**** ********** ********** 



NGEVKVEDLA 

NGEVKVEDLA 

NGEVKVEDLA 

NGEVKVEDLA 

NGEVKVEDLA 

NGEVKVEDLA 

NGEVKVEDLA 

NGEVKVEDLA 

NGEVKVEDLA 

NGEVKVEDLA 

NGEVKVEDLA 
********** 



400 

AKI VKHSDLV 

AKIVKHSDLV 

AKI VKHSDLV 

AKIVKHSDLV 

AKIVKHSDLV 

AKIVKHSDLV 

AKIVKHSDLV 

AKIVKHSDLV 

AKIVKHSDLV 

AKIVKHSDLV 

AKIVKHSDLV 
********** 



msa25322Q 
msa25322 
msa253220 
msa253220.2{ 

msa253220 . 

msa253220 . 

msa253220 . 

msa253220 . 
msa253220.2{l57 

msa253220 . 

msa253220 . 



.2{157_090 
157 CJB110 



0.2{157J 
0.2(157__ 



157__1169NT, 
157_18RS21 
2{157__M732 
2(157_M781 
2{l57_COHl 
2(157_H36B' 
_JM9130013 
2{l57_2603 
2(157_A909; 
Consensus 



401 

TVENVS PLLN 
TVENVSPLLN 
TVENVSPLLN 
TVENVSPLLN 
TVENVSPLLN 
TVENVSPLLN 
TVENVSPLLN 
TVENVSPLLN 
TVENVSPLLN 
TVENVSPLLN 
TVENVSPLLN 



HDNAVYVFMG 
HDNAVYVFMG 
HDNAVYVFMG 
HDNAVYVFMG 
HDNAVYVFMG 
HDNAVYVFMG 
HDNAVYVFMG 
HDNAVYVFMG 
HDNAVYVFMG 
HDNAVYVFMG 
HDNAVYVFMG 



AGDIQLYERS 
AGDIQLYERS 
AGDIQLYERS 
AGDIQLYERS 
AGDIQLYERS 
AGDIQLYERS 
AGDIQLYERS 
AGDIQLYERS 
AGDIQLYERS 
AGDIQLYERS 
AGDIQLYERS 



443 

FEELLANLTK NTQ 
FEELLANLTK NTQ 
FEELLANLTK NTQ 
FEELLANLTK NTQ 
FEELLANLTK NTQ 
FEELLANLTK NTQ 
FEELLANLTK NTQ 
FEELLANLTK NTQ 
FEELLANLTK NTQ 
FEELLANLTK NTQ 
FEELLANLTK NTQ 



********** ********** ********** ********** *'** 
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Table 47: Comparative Sequences relating to SAG0739 (strain info highlighted in BOLD) 



SEQ ID NO. 4701 
STRAIN A909 

TATTTTTTAACAACAAAAAAAGGAAAAGAGCTAAGGAAAAATGCA 
ATTCTATGGAGAATATAAAGAAAATCCAGAAGAATATCATCAAATAGCTA 
AAGATAAAG CAAGTG AATATT CAAATTTAG CTGTTGATAGTTTTAAAGAT 
TATAAAGGTAAATTTGAAT CAGGTGAATTGACAACAGAGGATAT CGT CT C 
AGCCGTTAAGGAAAAAAGCGGAGAAGTAGTTGACTTTGCTAATGATTTTG 
T CAAT CAAG CTAAAT CAAAATT CTCAGACGAGGAT ACTG CTAAAAAAGAA 
GATAAGG CT C CTGAAACAAAAGTAG AAGATATTGTCATT G ATTATAAAG A 
AAACACAGAAGATAAAGAAAAA 

SEQ ID NO. 4702 
STRAIN H3 6B 

TATTTTTTAACAACAAAAAAAGGAAAAGAG CTAAGGAAAAATG CAGAAAA 
ATTCTATGGAGAATATAAAGAAAATCCAGAAGAATATCATCAAATAGCTA 
AAGATAAAGG^GTGAATATTCAAATTTAGCrrGTTGATACTTTTAAAG^ 
TATAAAGGTAAATTTGAATCAGGTG AATTGACAACAGAGGATAT CGT CT C 
AGCCGTTAAGGAAAAAAGCGGAGAAGTAGTTGACTTTGCTAATGATTTTG 
TCAATCAAGCT AAAT CAAAATT CT CAGACGAGGAT ACTG CTAAAAAAGAA 
GATAAGG CTCCTGAAACAAAAGTAGAAGATATTGTCATTGATTATAAAGA 
AAACACAGAAGATAAAGAAAAA 

SEQ ID NO. 4703 
STRAIN 18RS21 

TATTTTTTAAGAAGAAAAAAAGGAAAAGAG CTAAGGAAAAATG CAGAAAA 
ATTCTATGGAGAATATAAAGAAAAT CCAGAAGAATATCATCAAATAGCTA 
AAGATAAAG CAAGTGAATATTCAAATTT AG CXGTTGATACTTTTAAAGAT 
TATAAAGGTAAATTTGAAT CAGGTGAATTGACAACAGAGGATAT CGT CTC 
AGCCGTTAAGGAAAAAAGCGGAGAAGTAGTTGACITTGCTA^ 
TCAATCAAGCrAAATC^AATTCTCAGACGAGGATACTGCTAAAA 
GATAAGG CT C CTGAAACAAAAGTAGAAGATATTGTCATTGATTATAAAGA 
AAACACAGAAGATAAAGAAAAA. 

SEQ ID NO. 4704 
STRAIN M732 

TATTTTTTAACAACAAAAAAAGGAAAAGAG CTAAGGAAAAATG CAGAAAA 
ATTCTATGG AGAATATAAAG AAAATCCAGAAGAATAT CAT CAAATAG CTA 
AAGATAAAG CAAGTGAATATTCAAATTTAGCTGTTGATACTTTTA7kAGAT 
TATAAAGGTAAATTTGAAT CAGGTGAATTGACAACAGAGGATAT CGT CTC 
AGCCGTTAAGGAAAAAAG CGGAGAAGTAGTTGACTTTG CTAATGATTTTG 
T CAAT CAAG CTAAATCAAAATT CTCAGACGAGGATACTG CTAAAAAAGAA 
GATAAGGCTCCTGAAACAAAAGTAGAAGATATTGTCATTGATTATAAAGA 
AAACACAGAAGATAAAGAAAAA 

SEQ ID NO. 4705 
STRAIN COH1 

TATTTTTTAACAACAAAAAAAGGAAAAGAG CTAAGGAAAAATG CAGAAAA 
ATTCTATGGAGAATATAAAGAAAAT CCAGAAGAATATCAT CAAATAG CTA 
AAGATAAAGCAAGTGAATATTCAAATTTAGCTGTTGATACnT^ 
TATAAAGGTAAATTTGAAT CAGGTGAATTGACAACAGAGGATATCGT CTC 
AGCCGTTAAGGAAAAAAGCGGAGAAGTAGTTGACTTTGCTAATGATTTTG 
TCAATCAAG CTAAAT CAAAATT CTCAGACGAGGATACTG CTAAAAAAGAA 
GATAAGGCTCCTGAAACAAAAGTAGAAGATATTGTGATTGATTATAAAGA 
AAACACAGAAGATAAAGAAAAA 

SEQ ID NO. 4706 
STRAIN M781 

TATTTTTTAACAACAAAAAAAGGAAAAGAGC 

TAAGGAAAAATGCAGAAAAATTCTATGGAGAATATAAAGAAAATCCAGAA 
GAATATCAT CAAAT AGCTAAAGATAAAG CAAGTGAATATT CAAATTTAG C 
TGTTGATACTTTTAAAGATTATAAAGGTAAATTTGAATCAGGTGAATTGA 
CAACAGAGGAT AT CGT CTCAGCCGTTAAGG AAAAAAGCGGAGAAGTAGTT 
GACITTGCTAATGATTTTGTCAATCAAGCT^ 

GGATACTG CTAAAAAAG AAGATAAGG CT CCTGAAACAAAAGTAGAAGATA 
TTGTCATTGATTATAAAGAAAACACAGAAGATAAAGAAAAA 

SEQ ID NO. 4707 
STRAIN 2603 

tattttttaacaacaaaaaaaggaaaagagctaaggaaaaatgcagaaaa 
attctatggagaatataaagaaaatccagaagaatatcatcaaatagcta 
aagataaagcaagtgaatattcaaatttagctgttgatacttttaaagat 
tataaaggtaaatttgaatcaggtgaattgacaacagaggatatcgtctc 
agccgttaaggaaaaaagcggagaagtagttgactttgctaatgattttg 
tcaatcaagctaaatcaaaattctcagacgaggatactgctaaaaaagaa 
ga t aaggc t c c t gaaacaa aag t agaaga tattgtcattgatta t aaaga 
aaacacagaagataaagaaaaa 

SEQ ID NO. 4708 
STRAIN 090 

TATITTTT'aACaACAAAAAAAGGAAAAGAGCTAAGGAAAAATGCAGAAAA 
ATTCTATGGAGAATATAAAGAAAAT CCAGAAGAATATCAT CAAATAG CTA 
AAGATAAAG CAAGTG AATATTCAAATTTAG CTGTTGATACTTTT AAAGAT 
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Table 47: Comparative Sequences relating to SAG0739 (strain info highlighted in BOLD) 



TATAAAGGT AAATTTGAAT CAG GTGAATTGACAACAGAGG AT AT CGT CT C 
AGCCGTTAAGGAA\AAAGCGGAGAAGTAGTTGACTTTGCTAATGATTTTG 
TCAAT CAAG CTAAAT CAAAATT CT CAGAC GAGGAT ACTG CTAAAAAAG A a 
GATAAGG CT C CTGAAACAAAaGTAGAAGAT ATTGT CATTG ATTATAAAGA 
AAACACAGAAGATAAAGAAAAA 

SEQ ID NO. 4709 
STRAIN CJB110 

TATTTTTTAACAACAAAAAAAG GAAAAGAG CT AAG GAAAA 
ATGCAGAAAAA.TTCT ATGGAGAATATAAAGAAAAT C CAGAAG AATAT CAT 
CAAATAG CT AAAGAT AAAG CAAGTGAATATTCAAATTTAG CTGTTGATAC 
TTTTAAAGATTATAAAG GT AAATTTGAAT CAGGTgAATTG ACAACAG AG G 
ATATCGTCTCAGCCGtTAAGGAAAAAAGCGGAGAAGTAGTTGACTTTGCT 
AATG ATTTTGT CAAT CAAG CT AAATCAAAATT CT CAGACGAGGAT ACTGC 
TAAAAAAG AAGATAAGG CTC CTGAAACAAAAGTAGAAGATAT TGTCATTG 
ATTATAAAGAAAACACAGAAGATAAAGAAAAA 

SEQ ID NO. 4710 
STRAIN 1169NT 

TATTTTTTAACAACAAAAAAAGGAAAAGAG CTAAGGAAA 
AATG CAGAAAAATT CTATGG AG AAT AT AAAGAAAATCCAGAAGAAT AT CA 
T CAAATAGCTAAAGATAAAG CAAGTGAATATT CAAATTTAGCTGTTGATA 
CTTTTAAAGATT AT AAAGGT AAATTTGAAT CAGGTGAATTGACAACAGAG 
GATATCGTCT CAG C CGTTAAGGAAAAAAG CGG AGAAGTAGTTGACTTTG C 
TAATG ATTTTGT CAATCAAG CT AAATCAAAATTCTCAGATGAGGATACTG 
CTAAAAAAG AAAATAAGG CT CCTGAAACAAAAGTAGAAG ATATTGT CATT 
GATTATAAAGAAAACACAGAAGATAAAGAAAAA 

SEQ ID NO. 4711 
STRAIN JM9130013 

TATTTTTTAa CAACAAAAAAAGGAAAA.GAG CTAAGGAAAA 

AT G CAGAAAAATT CTATGG AGAATATAAAGAAAAT C CAGAAG AATATCAT 

CAAATAG CT AAAGATAAAG CAAGTGAATATT CAAATTTAG CTGTTGATAC 

TTTTAAAGATTATAAAGGTAAATTTGAATCAGGTGAATTGACAACAGAGG 

ATATCGTCTCAGCCGTTAAGGAAAAAAGCGGAGAAGTAGTTGACTTTGCT 

AATGATTTTGTCAAT CAAG CTAAATCAAAATTCT CAGACGAGGATACTG C 

TAAAAAAGAAGATAAGGCT CCTGAAACAAAAGT AGAAGAT ATTGT CATTG 

ATTATAAAGAAAACACAGAAGATAAAGAAAAA 

PRETTY of: /biotmp/msa68511 . 2 { *} January 22, 2003 05:47 



msa68511 
msa68511.2{ 

msa68511 . 

msa68511. 
msa68511 .2{ 

msa68511 . 

msa68511 
msa68511-2{l64 

msa68511 . 

msa68511 . 
msa68511.2{ 



.2{l64_090) 
164_18RS2l" 
2{l64_2603 
2{164_A909 
164_CJB110 
2{164_C0H1} 
2{164__H3 6B} 
: _JM913 0013} 
2{164_M732} 
2{164_M78l) 
164_116.9NT) 
Consensus 



TATTTTTTAA 

TATTTTTTAA 

TATTTTTTAA 

TATTTTTTAA 

TATTTTTTAA 

TATTTTTTAA 

TATTTTTTAA 

TATTTTTTAA 

TATTTTTTAA 

TATTTTTTAA 

TATTTTTTAA 
********** 



CAACAAAAAA 

CAACAAAAAA 

CAACAAAAAA 

CAACAAAAAA 

CAACAAAAAA 

CAACAAAAAA 

CAACAAAAAA 

CAACAAAAAA 

CAACAAAAAA 

CAACAAAAAA 

CAACAAAAAA 
********** 



AGGAAAAGAG 
AGGAAAAGAG 
AGGAAAAGAG 
AGGAAAAGAG 
AGGAAAAGAG 
AGGAAAAGAG 
AGGAAAAGAG 
AGGAAAAGAG 
AGGAAAAGAG 
AGGAAAAGAG 
AGGAAAAGAG 
********** 



CTAAGGAAAA 

CTAAGGAAAA 

CTAAGGAAAA 

CTAAGGAAAA 

CTAAGGAAAA 

CTAAGGAAAA 

CTAAGGAAAA 

CTAAGGAAAA 

CTAAGGAAAA 

CTAAGGAAAA 

CTAAGGAAAA 
********** 



50 

ATGCAGAAAA 

ATGCAGAAAA 

ATGCAGAAAA 

ATGCAGAAAA 

ATGCAGAAAA 

ATGCAGAAAA 

ATGCAGAAAA 

ATGCAGAAAA 

ATGCAGAAAA 

ATGCAGAAAA 

ATGCAGAAAA 
********** 



msa68511 
msa68511.2{ 

msa68511 . 

msa68511 . 
msa68511 .2{ 

msa68511. 

msa68511 . 
msa68 511.2(164 

msa68511. 

msa68511. 
msa68511.2{ 



.2{164_090} 
164__18RS2l} 
2{l64_2603 
2{164_A909 
164_OTB110 
2{l64_COHl 
2{164_H36B • 
_JM9130013 
2{164_M732 
2{164_M78l} 
164_1169NT} 
Consensus 



51 

ATT CTATGGA 
ATT CTATGGA 
ATT CTATGGA 
ATTCTATGGA 
ATTCTATGGA 
ATTCTATGGA 
ATTCTATGGA 
ATTCTATGGA 
ATTCTATGGA 
ATTCTATGGA 
ATTCTATGGA 



GAATATAAAG 
GAATATAAAG 
GAATATAAAG 
GAATATAAAG 
GAATATAAAG 
GAATATAAAG 
GAATATAAAG 
GAATATAAAG 
GAATATAAAG 
GAATATAAAG 
GAATATAAAG 



AAAATCCAGA 
AAAATCCAGA 
AAAATCCAGA 
AAAATCCAGA 
AAAATCCAGA 
AAAATCCAGA 
AAAATCCAGA 
AAAATCCAGA 
AAAATCCAGA 
AAAATCCAGA 
AAAATCCAGA 



AGAATATCAT 
AGAATATCAT 
AGAATATCAT 
AGAATATCAT 
AGAATATCAT 
AGAATATCAT 
AGAATATCAT 
AGAATATCAT 
AGAATATCAT 
AGAATATCAT 
AGAATATCAT 



100 

CAAATAGCTA 
CAAATAGCTA 
CAAATAGCTA 
CAAATAGCTA 
CAAATAGCTA 
CAAATAGCTA 
CAAATAGCTA 
CAAATAGCTA 
CAAATAGCTA 
CAAATAGCTA 
CAAATAGCTA 



********** ********** ********** ********** ********** 



msa68511 
msa68511.2{ 

msa68511. 

msa68511. 
msa68511.2{ 

msa68511- 

msa68511 . 
msa68511.2{!64 

msa68511 . 

msa68511 . 
msa685ll.2{ 



.2 { 164^090} 
164_18RS21j 
2{164_2603} 
2{164_A909} 
164_CJB110} 
2{164_C0H1} 
2{164_H36B} 
_JM9130013} 
2{164_M732 
2{164_M78l' 
164_1169NT; 
Consensus 



101 

AAGATAAAGC 

AAGATAAAGC 

AAGATAAAGC 

AAGATAAAGC 

AAGATAAAGC 

AAGATAAAGC 

AAGATAAAGC 

AAGATAAAGC 

AAGATAAAGC 

AAGATAAAGC 

AAGATAAAGC 
********** 



AAGTGAATAT 
AAGTGAATAT 
AAGTGAATAT 
AAGTGAATAT 
AAGTGAATAT 
AAGTGAATAT 
AAGTGAATAT 
AAGTGAATAT 
AAGTGAATAT 
AAGTGAATAT 
AAGTGAATAT 



T CAAATTTAG 
T CAAATTTAG 
T CAAATTTAG 
TCAAATTTAG 
T CAAATTTAG 
TCAAATTTAG 
TCAAATTTAG 
TCAAATTTAG 
TCAAATTTAG 
TCAAATTTAG 
TCAAATTTAG 



********** ********** 



CTGTTGATAC 

CTGTTGATAC 

CTGTTGATAC 

CTGTTGATAC 

CTGTTGATAC 

CTGTTGATAC 

CTGTTGATAC 

CTGTTGATAC 

CTGTTGATAC 

CTGTTGATAC 

CTGTTGATAC 
********** 



150 

TTTTAAAGAT 
TTTTAAAGAT 
TTTTAAAGAT 
TTTTAAAGAT 
TTTTAAAGAT 
TTTTAAAGAT 
TTTTAAAGAT 
TTTTAAAGAT 
TTTTAAAGAT 
TTTTAAAGAT 

TTTTAAAGAT 
********** 



810 



WO 2004/018646 



PCT/US2003/026827 



Table 47: Comparative Sequences relating to SAG0739 (strain info highlighted in BOLD) 



msa68511 
tnsa68511 . 2{ 

msa68511 . 

msa68511 . 
msa68511.2{ 

msa68511. 

msa68511 . 
msa68511.2{l64 

rasa68511 . 

msa68511. 
msa68511.2{ 



■2{l64_090} 
164_18RS21} 
2 { 164_2603} 
2{164_A909) 
164_CJB110) 
2{l64_COHl} 
2{164_H36B} 
_JM9130013) 
2{164 M732) 
2{lS4~M78l} 
164_1169NT} 
Consensus 



151 

TATAAAGGTA 
TATAAAGGTA 
TATAAAGGTA 
TATAAAGGTA 
TATAAAGGTA 
TATAAAGGTA 
TATAAAGGTA 
TATAAAGGTA 
TATAAAGGTA 
TATAAAGGTA 
TATAAAGGTA 



AATTTGAATC 
AATTTGAATC 
AATTTGAATC 
AATTTGAATC 
AATTTGAATC 
AATTTGAATC 
AATTTGAATC 
AATTTGAATC 
AATTTGAATC 
AATTTGAATC 
AATTTGAATC 



********** ********** 



AGGTGAATTG 

AGGTGAATTG 

AGGTGAATTG 

AGGTGAATTG 

AGGTGAATTG 

AGGTGAATTG 

AGGTGAATTG 

AGGTGAATTG 

AGGTGAATTG 

AGGTGAATTG 

AGGTGAATTG 
********** 



ACAACAGAGG 
ACAACAGAGG 
ACAACAGAGG 
ACAACAGAGG 
ACAACAGAGG 
ACAACAGAGG 
ACAACAGAGG 
ACAACAGAGG 
ACAACAGAGG 
ACAACAGAGG 

ACAACAGAGG 
********** 



200 

ATATCGTCTC 

ATATCGTCTC 

ATATCGTCTC 

ATATCGTCTC 

ATATCGTCTC 

ATATCGTCTC 

ATATCGTCTC 

ATATCGTCTC 

ATATCGTCTC 

ATATCGTCTC 

ATATCGTCTC 
********** 



itisa68511 
msa68511.2{ 
msa68511 . 
msa68511 . 
msa68511.2{ 
msa68511 . 
msa68511 . 
msa68511 . 2{164 
msa68511 . 
maaG8511 . 
_ msa68511.2{ 



.2{164_090} 
164_18RS2l} 
2{164_2603} 
2{164_A909} 
164_CJB110) 
2{l64_COHl} 
2{164_H36B} 
_JM9130013} 
2{164_M732} 
2{164_M78l} 
164_1169NT} 
Consensus 



msa68511 
msa68511.2{ 

msa68511 . 

msa68511 . 
msa68511 .2{ 

msa68511 . 

msa68Sll . 
msa68511.2{l64 

msa68511 . 

msa68511 - 
msa68511.2{ 



.2{164_090} 
164_18RS2l} 
2{164_2603} 
2{164_A909} 
164_CJB110} 
2{164_C0H1} 
2{164_H3 6BJ 
: _JM9130013) 
2{164_M732} 
2{164_M781} 
164_1169NT} 
Consensus 



msa68511 
msa68511.2{ 

msa68511 . 

msa68Sll . 
msa685ll.2{ 

msa68511 . 

msa68511. 
msa68511.2{l64 

msa68511. 

msa68511 . 
Tnsa68511.2{ 



.2{164_090 
164_18RS21 
2{164_2603; 
2{164_A909 
164_CJB110 
2{l64_COHl 
2{l64_H36B 
_JM9130013 
2{l64_M732} 
2{164_M781} 
164_1169NT} 
Consensus 



201 

AGCCGTTAAG 
AGCCGTTAAG 
AGCCGTTAAG 
AGCCGTTAAG 
AGCCGTTAAG 
AGCCGTTAAG 
AGCCGTTAAG 
AGCCGTTAAG 
AGCCGTTAAG 
AGCCGTTAAG 

AGCCGTTAAG 
********** 

251 

TCAATCAAGC 
TCAATCAAGC 
TCAATCAAGC 
TCAATCAAGC 
TCAATCAAGC 
TCAATCAAGC 
TCAATCAAGC 
TCAATCAAGC 
TCAATCAAGC 
TCAATCAAGC 
TCAATCAAGC 
********** 

301 

gATAAGGCTC 
gATAAGGCTC 
gATAAGGCTC 
gATAAGGCTC 
gATAAGGCTC 
gATAAGGCTC 
gATAAGGCTC 
gATAAGGCTC 
gATAAGGCTC 
gATAAGGCTC 
aATAAGGCTC 



GAAAAAAGCG 

GAAAAAAGCG 

GAAAAAAGCG 

GAAAAAAGCG 

GAAAAAAGCG 

GAAAAAAGCG 

GAAAAAAGCG 

GAAAAAAGCG 

GAAAAAAGCG 

GAAAAAAGCG 

GAAAAAAGCG 
********** 



TAAATCAAAA 
TAAATCAAAA 
TAAATCAAAA 
TAAATCAAAA 
TAAATCAAAA 
TAAATCAAAA 
TAAATCAAAA 
TAAATCAAAA 
TAAATCAAAA 
TAAATCAAAA 

TAAATCAAAA 
********** 



CTGAAACAAA 
CTGAAACAAA 
CTGAAACAAA 
CTGAAACAAA 
CTGAAACAAA 
CTGAAACAAA 
CTGAAACAAA 
CTGAAACAAA 
CTGAAACAAA 
CTGAAACAAA 
CTGAAACAAA 



250 

AATGATTTTG 

AATGATTTTG 

AATGATTTTG 

AATGATTTTG 

AATGATTTTG 

AATGATTTTG 

AATGATTTTG 

AATGATTTTG 

AATGATTTTG 

AATGATTTTG 

AATGATTTTG 
********** 

300 

TAAAAAAGAA 
TAAAAAAGAA 
TAAAAAAGAA 
TAAAAAAGAA 
TAAAAAAGAA 
TAAAAAAGAA 
TAAAAAAGAA 
TAAAAAAGAA 
TAAAAAAGAA 
TAAAAAAGAA 
TAAAAAAGAA 
********_* ********** ********** 



GAGAAGTAGT 

GAGAAGTAGT 

GAGAAGTAGT 

GAGAAGTAGT 

GAGAAGTAGT 

GAGAAGTAGT 

GAGAAGTAGT 

GAGAAGTAGT 

GAGAAGTAGT 

GAGAAGTAGT 

GAGAAGTAGT 
********** 



TTCTCAGAcG 
TTCTCAGAcG 
TTCTCAGAcG 
TTCTCAGAcG 
TTCTCAGACG 
TTCTCAGAcG 
TTCTCAGAcG 
TTCTCAGAcG 
TTCTCAGAcG 
TTCTCAGAcG 
TTCTCAGAtG 



TGACTTTGCT 

TGACTTTGCT 

TGACTTTGCT 

TGACTTTGCT 

TGACTTTGCT 

TGACTTTGCT 

TGACTTTGCT 

TGACTTTGCT 

TGACTTTGCT 

TGACTTTGCT 

TGACTTTGCT 
********** 



AGGATACTGC 
AGGATACTGC 
AGGATACTGC 
AGGATACTGC 
AGGATACTGC 
AGGATACTGC 
AGGATACTGC 
AGGATACTGC 
AGGATACTGC 
AGGATACTGC 
AGGATACTGC 



AGTAGAAGAT 
AGTAGAAGAT 
AGTAGAAGAT 
AGTAGAAGAT 
AGTAGAAGAT 
AGTAGAAGAT 
AGTAGAAGAT 
AGTAGAAGAT 
AGTAGAAGAT 
AGTAGAAGAT 
AGTAGAAGAT 



ATTGTCATTG 
ATTGTCATTG 
ATTGTCATTG 
ATTGTCATTG 
ATTGTCATTG 
ATTGTCATTG 
ATTGTCATTG 
ATTGTCATTG 
ATTGTCATTG 
ATTGTCATTG 
ATTGTCATTG 



350 

ATTATAAAGA 
ATTATAAAGA 
ATTATAAAGA 
ATTATAAAGA 
ATTATAAAGA 
ATTATAAAGA 
ATTATAAAGA 
ATTATAAAGA 
ATTATAAAGA 
ATTATAAAGA 
ATTATAAAGA 



_********* ********** ********** ********** ********** 



msa68511 . 2 { 164_090 } 
msa68511.2{l64_18RS2l} 
msa68 511 . 2 { 164_2603 } 
msa68511 - 2 { 164_A909 J 
msa685ll . 2 { 164_CJB110 ) 
msa68 511 . 2 { 164_C0H1 > 
msa68511 . 2 { 164_H36B} 
msa68511.2{l64_JM913 0013} 
msa68511.2(l64_M732} 
msa68511.2{164_M78l} 
msa68511.2{l64_1169NT} 

Consensus 



351 372 
AAACACAGAA GATAAAGAAA AA 
AAACACAGAA GATAAAGAAA AA 
AAACACAGAA GATAAAGAAA AA 
AAACACAGAA GATAAAGAAA AA 
AAACACAGAA GATAAAGAAA AA 
AAACACAGAA GATAAAGAAA AA 
AAACACAGAA GATAAAGAAA AA 
AAACACAGAA GATAAAGAAA AA 
AAACACAGAA GATAAAGAAA AA 
AAACACAGAA GATAAAGAAA AA 

AAACACAGAA GATAAAGAAA AA 
********** ********** ** 



SEQ ID NO • 4712 
STRAIN 2603 

YFLTTKKGKELRKNAEKFYGEYKENPEEYHQIAKDKASEY 

TTED I VSAVKEKSGE WDFANDFVNQAKS KFSDEDTAKKEDKAPETKVED I VIDYKENTE 
DKEK 

SEQ ID NO. 4713 
STRAIN A909 frame: X 

YFLTTKKGKELRKNAEKFYGEYKENPEEYHQIAKDKASEYSNLAVDTFKDYKGKFESGEL 
TTED I VSAVKEKSGE WDFANDFVNQAKS KFSDEDTAKKEDKAPETKVED I VI DYKENTE 
DKEK 



SEQ ID NO. 4714 



811 



WO 2004/018646 



PCT/US2003/026827 



Table 47: Comparative Sequences relating to SAG0739 (strain info highlighted in BOLD) 



STRAIN H36B frame: 1 

YFLTTKKGKELRKNAEKFYGEYKENPEEYHQIAKDKASEYSNliAVDTFKDYKGKFESGEL 
TTEDI VSAVKE KSGE WDFANDFVNQAKSKFSDEDTAKKEDKAPETKVED I V I DYKENTE 
DKEK 

SEQ ID NO. 4715 
STRAIN 18RS21 frame: 1 

YFLTTKKGKELRKNAEKFYGEYKENPEEYHQIAKDKASEYSNLAVDTFKDYKGKFESGELi 
TTEDIVSAVKEKSGEVVDFANDFVNQAKSKFSDEDTAKKEDKAPETKVEDIVIDYKENTE 
DKEK 



SEQ ID NO. 4716 
STRAIN M732 frame: 1 

YFLTTKKGKELRKNAEKFYGEYKENPEEYHQIAKDKASEYSNLAVDTFKDYKGKFESGEL 
TTEDI VSAVKE KSGEVVDFANDFVNQAKSKFSDEDTAKKEDKAPETK^TEDIVIDYKENTE 
DKEK 



SEQ ID NO. 4717 
STRAIN __COHl frame: 1 

YFLTTKKGKELRKNAEKFYGEYKENPEEYHQIAKDKASEYSNLAVDTFKDYKGKFESGEL 
TTEDIVSAVKEKSGEV\T5FANDFVNQAKSKFSDEDTAKKEDXAPETK\^DIVIDYKENTE 
DKEK 

SEQ ID NO. 4718 
STRAIN _M7 81 frame: 1 

YFLTTKKGKELRKNAEKFYGEYKENPEEYHQIAKDKASEYSNLAVDTFKDYKGKFESGEL 
TTED I VSAVKEKSGEVVDFANDFVNQAKSICFSDEDTAKKEDKAPETKVEDI VI DYKENTE 
DKEK 

SEQ ID NO. 4719 ' 
STRAIN _090 frame: 1 

YFLTTKKGKELRKNAEKFYGEYKENPEEYHQIAKDKASEYSNLAVDTFKDYKGKFESGEL 
TTED I VSAVKEKSGEWDFAND FVNQAKSKFSDEDTAKKEDKAPETKVEDI VID YKENTE 
DKEK 



SEQ ID NO. 4720 

STRAIN _CJB110 frame: 1 

YFLTTKKGKELRKNAEKFYGEYKENPEEYHQIAKDKASEYSNIiAVDTFKDYKGKFESGEL 
TTED I VSAVKEKSGEWDFAND FVNQAKSKFSDEDTAKKEDKAPETKVED I VI DYKENTE 
DKEK 



SEQ ID NO. 4721 
STRAIN 116 9NT frame: 1 

YFLTTKKGKELRKNAEKFYGEYKENPEEYHQIAKDKASEYSNLAVDTFK^ 

TTED I VSAVKEKSGEWDFAND FVNQAKS KFSDEDTAKKENKAPETKVED I VI DYKENTE 

DKEK 



SEQ ID NO. 4722 

STRAIN _JM9130013 frame: 1 

YFLTTKKGKELRKNAEKFYGEYKENPEEYHQIAKDKASEYSNIAVDTFKDYKGKFESGEL 
TTED I VSAVKEKSGEWDFAND FVNQAKS KFSDEDTAKKEDKAPETKVEDI VID YKENTE 
DKEK 



PRETTY of: /biqtmp/rasa68 746.2 {*} January 22, 2003 05:54 



msa68746 
msa68746 . 2{ 
msa68746.2{ 

msa68746 . 

msa68746. 
maa68746.2{ 

msa68746. 

msa68746 . 
msa68746.2{l64 

msa68746 . 

msa68746 . 



.2(164 090} 
164_1169NT} 
164JL8RS21} 
2{164_2603} 
2{164_A909 
164_CJB110 
2{l64__COHl 
2{164_H36B} 
_JM9130013> 
2{164_M732} 
2{164_M781} 
Consensus 



YFLTTKKGKE 
YFLTTKKGKE 
YFLTTKKGKE 
YFLTTKKGKE 
YFLTTKKGKE 
YFLTTKKGKE 
YFLTTKKGKE 
YFLTTKKGKE 
YFLTTKKGKE 
YFLTTKKGKE 

YFLTTKKGKE 
********** 



LRKNAEKFYG 

LRKNAEKFYG 

LRKNAEKFYG 

LRKNAEKFYG 

LRKNAEKFYG 

LRKNAEKFYG 

LRKNAEKFYG 

LRKNAEKFYG 

LRKNAEKFYG 

LRKNAEKFYG 

LRKNAEKFYG 
********** 



EYKENPEEYH 

EYKENPEEYH 

EYKENPEEYH 

EYKENPEEYH 

EYKENPEEYH 

EYKENPEEYH 

EYKENPEEYH 

EYKENPEEYH 

EYKENPEEYH 

EYKENPEEYH 

EYKENPEEYH 
********** 



QIAKDKASEY 

QIAKDKASEY 

QIAKDKASEY 

QIAKDKASEY 

QIAKDKASEY 

QIAKDKASEY 

QIAKDKASEY 

QIAKDKASEY 

QIAKDKASEY 

QIAKDKASEY 

QIAKDKASEY 
********** 



50 

SNLAVDTFKD 

SNLAVDTFKD 

SNLAVDTFKD 

SNLAVDTFKD 

SNLAVDTFKD 

SNLAVDTFKD 

SNLAVDTFKD 

SNLAVDTFKD 

SNLAVDTFKD 

SNLAVDTFKD 

SNLAVDTFKD 
********** 



msa68746 
msa68746 
msa68746 

msa68746 . 

msa68746 . 
msa68746.2{ 

msa68746. 

msa68746 . 
msa68746.2{l64 

msa68746. 

msa68746 . 



2 {164_090} 
.2(164_1169NT} 
.2{164_18RS2l) 
2{164_2603) 
2{164_A909} 
164_CJB110} 
2fl64_COHl} 
2{164_H36B) 
_JM9130013} 
2{164_M732) 
2{164_M781J 



51 

YKGKFESGEL 
YKGKFESGEL 
YKGKFESGEL 
YKGKFESGEL 
YKGKFESGEL 
YKGKFESGEL 
YKGKFESGEL 
YKGKFESGEL 
YKGKFESGEL 
YKGKFESGEL 
YKGKFESGEL 



TTEDIVSAVK 
TTED I VSAVK 
TTEDIVSAVK 
TTEDIVSAVK 
TTEDIVSAVK 
TTEDIVSAVK 
TTEDIVSAVK 
TTEDIVSAVK 
TTEDIVSAVK 
TTEDIVSAVK 
TTEDIVSAVK 



EKSGEWDFA 
EKSGEWDFA 
EKSGEWDFA 
EKSGEWDFA 
EKSGEWDFA 
EKSGEWDFA 
EKSGEWDFA 
EKSGEWDFA 
EKSGEWDFA 
EKSGEWDFA 
EKSGEWDFA 



ND FVNQAKS K 
ND FVNQAKS K 
ND FVNQAKS K 
ND FVNQAKS K 
ND FVNQAKS K 
ND FVNQAKS K 
NDFVNQAKSK 
NDFVNQAKSK 
NDFVNQAKSK 
NDFVNQAKSK 
NDFVNQAKSK 



100 

FSDEDTAKKE 
FSDEDTAKKE 
FSDEDTAKKE 
FSDEDTAKKE 
FSDEDTAKKE 
FSDEDTAKKE 
FSDEDTAKKE 
FSDEDTAKKE 
FSDEDTAKKE 
FSDEDTAKKE 
FSDEDTAKKE 



812 



WO 2004/018646 



PCT/US2003/026827 



Table 47: Comparative Sequences relating to SAG0739 (strain info highlighted in BOLD) 



Consensus 



********** ********** ********** ********** ********** 



msa68746 
msa68746.2{ 
msa68746.2{ 

msa68746. 

msa68746 . 
msa68746.2{ 

msa68746 . 

msa68746 . 
rasa68746.2{l64 

msa68746 . 

msa68746 . 



-2{l64_090 
164_1169NT 
164_18RS21 
2{164_2603 
2{l64__A909 
164_CJB110 
2 { 164_COHl 
2{164_H36B 
_JM913 0013 
2{l64_M732 
2{164_M781 • 
Consensus 



101 

dKAPETKVED 

nKAPETKVED 

dKAPETKVED 

dKAPETKVED 

dKAPETKVED 

dKAPETKVED 

dKAPETKVED 

dKAPETKVED 

dKAPETKVED 

dKAPETKVED 

dKAPETKVED 
********** 



124 

IVIDYKENTE DKEK 

IVIDYKENTE DKEK 

IVIDYKENTE DKEK 

IVIDYKENTE DKEK 

IVIDYKENTE DKEK 

IVIDYKENTE DKEK 

IVIDYKENTE DKEK 

IVIDYKENTE DKEK 

IVIDYKENTE DKEK 

IVIDYKENTE DKEK 

IVIDYKENTE DKEK 
********** **** 



813 



WO 2004/018646 



PCT/US2003/026827 



Table 48: Comparative Sequences relating to SAG1474 



SEQ ID NO: 4801 
STRAIN 2603 

aatagtactgagacaagtgctt cagt agt t cc tac tacaaat actat cgt 
tcaaactaatgacagtaafccctaccgcaaaatttgtatcagaatcaggac 
aatctgtaataggtcaagtaaaaccagataattctgcggcgcttacaaca 
gttgacacgcctcatcatatttcagctccagafcgctttaaaaacaactca 
atcaagtcctgtcgttgagagtacttctactaagttaactgaagagactt 
acaaacaaaaagatggtcaagatttagccaacatggtgagaagtggtcaa 
gttactagtgaggaactcgttaatatggcatacgatattattgctaaaga 
aaacccatctttaaatgcagtcattactactagacgccaagaagctattg 
aagaggc t agaaaac 1 1 aaagat ac caa t cagccg 1 1 1 1 1 aggtgt t ccc 
ttgttagtcaaggggttagggcacagtattaaaggtggtgaaaccaataa 
tggcttgatctatgcagatggaaaaattagcacatttgacagtagctatg 
tcaaaaaatataaagatttaggatttattattttaggacaaacgaacttt 
ccagagtatgggtggcgtaatataacagattctaaattatacggtctaac 
gcataatccttgggatcttgctcataatgctggtggctcttctggtggaa 
gtgcagcagccattgctagcggaatgacgccaattgctagcggtagtgat 
gc t gg t gg t tctatccgtattccatcttct tggacgggc 1 1 gg t agg 1 1 1 
aaaaccaacaagaggattggtgagtaatgaaaagccagattcgtatagta 
cagcagttcattttccattaactaagtcatctagagacgcagaaacatta 
ttaacttatctaaagaaaagcgatcaaacgctagtatcagttaatgattt 
aaaatctttaccaattgcttatactttgaaatcaccaatgggaacagaag 
ttagtcaagatgctaaaaacgctattatggacaacgtcacattcttaaga 
aaacaaggattcaaagtaacagagatagacttaccaattgatggtagagc 
attaatgcgtgattattcaaccttggctattggcatgggaggagcttttt 
caacaattgaaaaagacttaaaaaaacatggttttactaaagaagacgtt 
gatcctattacttgggcagttcatgttatttatcaaaattcagataaggc 
tgaacttaagaaatctattatggaagcccaaaaacatatggatgattatc 
gtaaggcaatggagaagcttcacaagcaatttcctattttcttatcgcca 
acgaccgcaagtttagcccctctaaatacagatccatatgtaacagagga 
agataaaagagcgatttataatatggaaaacttgagccaagaagaaagaa 
ttgctctctttaatcgccagtgggagcctatgttgcgtagaacacctttt 
acacaaattgctaatatgacaggactcccagctatcagtatcccgactta 
cttatctgagtctggtttacccatagggacgatgttaatggcaggtgcaa 
actatgatatggtattaattaaatttgcaactttctttgaaaaacatcat 
ggttttaatgttaaatggcaaagaataatagataaagaagtgaaaccatc 
tactggcctaatacagcctactaactccctctttaaagctcattcatcat 
tagtaaatttagaagaaaattcacaagttactcaagtatctatctctaaa 
aaatggatgaaatcgtctgttaaaaataaaccatccgtaatggcatatca 
aaaagca 

SEQ ID NO: 4802 
STRAIN 090 

* AATAGTACTGAGACAAGTG CTT CAGTAGTT CCTACTACAA 

ATACTAT CGTT CAAACT AATGACAGTAAT C CT AC CG CAAAATTTGTAT CA 
GAATCAGGACAATCTGTAAT AGGTCAAGTAAAAC CAGAT AATT CTG CGG C 
GCTTACAACAGTTGACACG C CT CATCATATTTCAG CT CCAGATG CTTTAA 
AAACAACTCAATCAAGT CCTGT CGTTGAGAGTACTT C'l ACTAAGTTAACT 
GAAGAGACTTACAAACAAAAAGATGGTAAAGATTTAG CCAACATGGTGAG 
AAGTGGT(^AAGTTACTAGTGAGGAACTCGTTAATATGGCATACGATATTA 
TTGCTAAAGAAAACC CATCTTT AAATG CAGT CATTACTACTAGACG C CAA 
GAAG CTATTGAAGAGG OTAGAAAACTTAAAGATAC CAAT CAG C CGTTTTT 
AGGTGTT C CCTTGTTAGTCAAGGGGTTAGGG CACAGTATTAAAGGTGGT G 
AAAC CAATAATGGCTTGAT CTATG CAGATGGAAAAATTAG CACATTTGAC 
AGTAGCTATGTCAAAAAATATAAAGATTTAGGATTTATTATTTTAGGACA 
AACGAACTTT CCAGAGTATGGGTGG CGTAATATAACAGATTCTAAATTAT 
ACGGT CTAACGCATAAT CCTTGGGATCTTG CT CATAATG CTGGTGGCT CT 
TCTGGTGGAAGTGCAGCAGCCATTGCTAGCGGAATGACGCCAATTGCTAG 
CXJGTAGTGATGCTGGTGGTTCTATCCGTATTCCATCTTCrTGGACGGGCT 
TGGTAGGTTTAAAAC CAACAAGAGGATTGGTGAGTAATGAAAAG C CAGAT 
TCGTATAGTACAGCAGTT CATTTT C CATTAACTAAGT CAT CT AG AGACG C 
AGAAACATT ATTAACTT AT CTAAAGAAAAG CG AT CAAACGCT AGTAT CAG 
TTAATGATTTAAAAT CTT t AC CAATTG CTT ATACTTTGAAATCAC CAATG 
GGAACAG AAGTTAGT CAAG ATG CTAAAAACG CTATTATGGACAACGT CAC 
ATTCTTAAGAAAACAAGGATTC^U^GTAACAGAGATAGACTTACCAATTG 
ATGGTAGAG CATTAATGCGTGATTATT CAACCTTGG CTATTGG CATGGG A 
GGAGCTTTTTCAACAATTGAAAAAGACTO 

■ AGAAGACGTTGATCCTATTACTTGGG CAGTTCATGTTATTTAT CAAAATT 
CAGATAAGG CTGAACTTAAGAAAT CTATTATGGAAGC CCAAAA^CATATG 
GATGATTATCGT AAGGCAATGGAG AAG CTT CACAAGCA^TTTCCTATTTT 
CTTATCGCCAACGACCGCAAGTTTAGCCCCrCTAAATACAGATCCATATG 
TAACAGAGGAAGATAAAAGAGCGATTTATAATATGGAAAACTTGAGCCAA 
GAAGAAAGAATTGCT CT CTTTAAT CGC CAGTGGGAG CCT ATGTTGCGTAG 
AACAC(TITTTACACAAATTGCTAATATGACAGGACTCCCAGCTA 
TC C CGACTTACTTAT CTGAGTCTGGTTTAC C CATAGG GACGATGTTAATG 
G CAGGTG CAAACTATG ATATGGTATTAATTAAATTTG CAACTTTCTTTG A 
AAAACAT CATGGTTTTAATGTT AAATGG CAAAGAATAAT AGATAAAG AAG 
TGAAACGATCTACTGGCCTAATACAGCCTACTAACTCCC^ 
CATTCAT CATTAGTAAATTT AGAAGAAAATTCACAAGTTACT CAAGT AT C 
TATCT CT AAAAAATGGATGAAATCGT CTGTTAAAAATAAACCATC CGTAA 
TGGCATATCAAAAAGCA 

SEQ ID NO: 4803 
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STRAIN A909 

TACTACAAATACTATCGTTCAAACTAATGAC^GTAATCCTACCGCAAAAT 

TTGTATCAGAATCAGGACAATCTGTAATAGGTCAAGTAAAACCAGATAAT 

TCTGCGGCGCTTACAACAGTTGACACGCCTCATCATATTTCAGCTCCAGA 

TGCTTTAAAAACAACTCAATCAAGTCCTGTCGTTGAGAGTACTTCTACTA 

AGTTAACTG AAGAG ACTTACAAACAAAAAGATGGT CAAG ATTTAG CCAAC 

ATGGTGAGAAGTGGTCAAGTTACTAGTGAGGAACTCGTTAATATGGCATA 

CGATATTATTGCTAAAGAAAACCCATCTTTAAATGCAGTCATTACTACTA 

GACGCCAAGAAGCTATTGAAGAGGCTAGAAAACTTAAAGATACCAATCAG 

CCGTTTTTAGGTGTTCCCTTGTTAGTCAAGGGGTTAGGGCACAGTATTAA . 

AGGTGGTGAAACCAATAATGGCTTGATCTATGCAGATGGAAAAATTAGCA 

CATTTGACAGTAG CTATGT CAAAAAAT ATAAAGATTT AGGATTTATT AT T 

TTAGGACAAACGAACTTTCCAGAGTATGGGTGGCGTAATATAACAGATTC 

TAAATTATACGGTCTAACGCATAATCCTTGGGATCTTGCTCATAATGCTG 

GTGGCTCTT CTGGTGGAAGTGCAG CAGCCATTGCTAGCGGAATGACG CCA 

ATTGCTAGCGGTAGTGATGCTGGTGGTTCTATCCGTATTCCATCTTCTTG 

GACGGGCrTTGGTAGGTTTAAAACCAACAAGAGGATTGGTGAGTAATGAAA 

AG CCAGATTCGTATAGTACAGCAGTTCATTTT CCATTAAc TAAGT CAT CT 

AGAGACG CAGAAACATTATT AACTTAT CTAAAGAAAAGC GAT CAAACG CT 

AGTAT CAGTTAATGATTTAAAATCTTT AC CAATTG CTTATACTTTGAAAT 

CACCAATGGGAACAGAAGTTAGTCAAGATG CT AAAAACG CTATT ATGGAC 

AACGT CAC aTTCTT AAGAAAACAAGGATT CAAAGTAACAG AGAT AGACTT 

ACCAATTGATGGTAGAGCATTAATGCGTGATTATTCAACCTTGGCTATTG 

GCATGGGAGGAG CTTTTTCAACAATTGAAAAAGACT^ 

TTTACTAAAGAAGACGTTGATCCTATTACTTGGGCAGOT 

T CAAAATT CAGAT AAGG CTG AACTT AAGAAAT CTATT AT GGAAG C C CAAA 

AACATATGGATGATTAT CGTAAGG CAATGGAGAAG CTT GACAAGCAATTT 

CCTATTTTCTTATCGC CAACGACCG CAAGTTTAG C CC CT CTAAATACAGA 

TCCATATGTaACAGAGGAAGATAAAAGAGCGATTTATAATATGGAAAACT 

TGAG C CAAGAAGAAAGAATTGCTCT CTTTAATCG C CAGTGGGAG C CTATG 

TTGCX^TAGAACACCTTTTAC^CAAATTGCTAATATGACA 

TATCAGTAT C CCGACTTACTTATCTGAGTCTGGTTTACCCATAGGGACGA 

TGTTAATGGCAGGTG CAAACTATGATATGGTATTAAT^ 

TTCTTTGAAAAACATC^TGGTTTTAATGTTAAATGGCAAAGAATAATAGA 

TAAAGAAGTGAAACCATCTACTGGCCTAATACAGCCTACTAACTCCCTCT 

TTAAAGCTCATT CAT CATTAGTAAATTTAGAAGAAAATT CACAAGTTACT 

CAAGTATCTATCTCTAAAAAATGGATGAAATCGTCTGTTAAAAATAAACC 

ATCCGTAATGGCATATCAAAAAGCA 

SEQ ID NO: 4804 
STRAIN COH1 

AATAGTACTGAGACAAGTGCTTC^GTAGCTCCTACTACAAAT 
ACTAT CGTT CAAACTAATGACAGTAAT CCTAC CG CAAAATTTGCATCAGA 
AT CAGGACAATCTGTAATAGGT CAAGTAAAAC CAGCTAATT CTG CGG CG C 
TTACAACAGTTGACACG C CT CATATTT GAG CT CCAGATG CTTTAAAAACA 
ACTCAATCAAGT C CTGTCGTTGAGAGT CCTT CTACTrAAGTTAACTGAAGA 
GACATACAAACAAAAAGATGGTCAAGATTTAGCCAACATGGTGAGAAGTG 
GT CAAGTTACTAGTGAGGAACTCGT CAATATGGCATACGATATT ATCGCT 
AAAGAAAACC CATCTTTAAATG CAGT CATTACTACTAGACG C CAAGAAG C 
C^TTGAAGAGGCTAGAAAACTTAAAGATACTAATCAG C CGTTTTTAGGTG 
TT CC CTTGTT AGT CAAGGGGTTAGGG CACAGTATTAAAGGTGGTGAAAC C 
AATAATGGCTTG AT CTATG CAGATGGAAAAATTAG CACATTTGACAGTAG 
CTATGTCAAAAAATATAAAGATTTAGGATTTATTAT^ 

ATTTTCCAGAGTATGGGTGGCXjTAATATAACAGACrCrAAATTATACGGT 
CCAACGCTVrAATCCTTGGAATCTTGCTCATA 

TGGAAGTGCAG CAG CTATTG CTAGCGGAATGACGC CAATTG CTAGCGGCA 
GTGATGCTGGTGGTT CTAT CCGTATTCCATCTTGTTGGACGGGCTTAGTA 
GG TTTAAAACCAACAAGAGGATTGGTGAGTAATGAAAAGCCAGATTCGTA 
TAGT ACAGCAGTTCATTTT C CATTAACTAAGT CATCTAG AGACG CAG AAA 
CATTGTTAACITACCTAAAGAAAAGCGATCAAACGCTAGTATCAGTTA^ 
GATTTAAAATCTTTAC CAATTG CTTATACTTTGAAAT CAC CAATG GG AAC 
AGAAGTTAGTCAAGATGCTAAAAATGCTATTATGGACAACGTCAGATTOT 
TAAGAAAACAAGGATTCAAAGTGACAGAGATAGATTtACCAATTGATGGT 
AGAG CATTAATG CGTGATTATT CAAC CTTGG CTATTGG GATGGGAGGAG C 
TTTTTCAAGAATTGAAAAAGACTTAAAAAAA 

ACGTTGATC C CATT ACTTGGGCAGTT CATGTTATTTAT CAAAATT CAGAT 
AAGGCTGAACTTAAG AAAT CTATTGTGGAAGCC CAAAAACATATGGATGA 
TTAT CGTAAGGCAATGGAG AAGCTT CACAAGCAATTTCCT CTTAT 
CGCCAACGACCGCAAgTTTAGCCCCTCTAAATAC^ 

GAGAAAGATAAAAGAGCGATTTATAATATGGAAAACTTGAGCCAAGAAGA 
AAGAATTGCTCTCTTTAATCGCCAGTGGGAGCCTATGCT 
CTTTTACACCAATTGCTAATAtGACAGGACTCCCAGCTATCAGTATCCCG 
ACTTACTTAT CTGAGTCTGGTTTAC C CATAGGGACGATGTTAATGGCAGG 
TG CAAACrATGATATGGTATTAATTAAATTTG CAACITTCTTTXIAAAAAC 
AT CATGGTTTTAATGTTAAATGG CAAAG AATAAT AGATAAAGAAGTGAAA 
CCAT CTG CT GAC CTAATACAG CCTACTAACTCCCT CTTTAAAGCTCATT C 
ATCATTAGTAAATTTAGAAG AAAATTCACAAGTT ACT CAAGTATCTATCT 
CTAAAAAATGGATGAAATCGTCTGTTAAAAATAAACCATCCGTAATGGCA 
TATCAAAAAGCA 

SEQ ID NO: 4305 
STRAIN M732 

TCAGTAG CT CCT ACT ACAAATACTATCGTT CAAACTAATGACAGTAAT CC 
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TACCGCAAAATTTGCATCAGAATCAGGACAATCTGTAATAGGTCAAGTAA 
AACCAGCTAATTCTGCGGCGCrTAC^C^GTTGACACGCCTCATATTTCA 
GCTCCAGATGCTTTAAAAACAACTCAATCAAGTCCTGTCGTTGAGAGTCC 
TTCrACTAAGTTAACTGAAGAGACATACAAACAAAAAGATGGTCAAGATT 
TAGCCAACATGGTGAGAAGTGGTCAAGTTACTAGTGAGGAACTCGTCAAT 
ATGGCATACGATATTATCGCTAAAGAAAACCCATCTTTAAATGCAGTCAT 
TACTACTAGACGCCAAGAAGCCATTGAAGAGGCTAGAAAACTTAAAGATA 
CTAAT CAG C CGTTTTTAGGTGTTC C CTTGTTAGT CAAGGG GTTAGGG CAC 
AGTATTAAAGGTGGTGAAAC CAATAATGG CTT GATCT ATGCAGATGGAAA 
AATTAGCACATTTGACAGTAGCTATGTCAAAAAATATAAAGATTTAGGAT 
TTATT ATTT TAGGACAAACG AATTTTC CAG AGTATGGGTGG CGTAATATA 
ACAGACTCrAAATTATACGGTCnAACGCATAATCCTTGGGATCTTGCTCA 
TAACGCTGGTGGCTCTTCTGGTGGAAGTGCAGCAGCTATTGCTAGCGGAA 
TGACGCCAATTGCTAGCGGCAGTGATGCTGGTGGTTCTATCCGTATTCCA 
TCTTCTTGGACGGGCTTAGTAGGTTTAAAACCAACAAGAGGATTGGTGAG 
TAATGAAAAGCCAGATTCGTATAGTACAGCAGTTCATTTTCCATTAACTA 
AGTCATCTAGAGACGCAGAAACATTGTTAACTTACCTAAAGAAAAGCGAT 
CAAACGCTAGTATCAGTTAATGATTTAAAATCTTTACCAATTGCTTATAC 
TTTGAAATCACCAATGGGAACAGAAGTTAGTCAAGATGCTAAAAATGCTA 
TTATGGACAACGTCACATT CTTAAGAAAACAAGGATT CAAAGTGACAGAG 
AT AG ATTTAC CAATTGATGGTAGAG CATTAATGCGTG ATTATTCAACCTT 
GGCTATTGG CATGGG AGGAG CTTT TTCAACAATTGAAAAAGACTTAAAAA 
AACATGGTTTTACTAAAGAAGACGTTGATC CCATTACTTGGG CAGTT CAT 
GTTATTTAT CAAAATT CAG ATAAGG CTGAACTTAAGAAAT CTATTGTGGA 
AGCCGAAAAACATATGG ATGATTATCGTAAGG CAATGGAGAAG CTTCACA 
AG CA ATTTC CTATTTT CTTATCGCCAACGACCGCAAGTTTAG CC C CT CTA 
AATACAGATCCATATGTTACAGAGAAAGATAAAAGAGCGATTTATAATAT 
GGAAAACTTG AG CCAAGAAGAAAGAATTG CTCTCTTTAAT CG CCAGTGGG 
AG CCTATGTTGCGTAGAACACCTTTTACAC CAATTG CTAATATGACAGGA 
CT C C CAGCTATCAGTAT C C CGACTTACTTAT CTGAGTCTGGTTTAC C CAT 
AGGGACGATGTTAATGG CAGGTGCAAACT ATGATATGGTATTAATTAAAT 
TTGCAACTTTCTTTGAAAAACATCATGG 

ATA^TAGATAAAGA^GTGAAACCATCTGCTGACCTAATACAGCCTACTAA 
CT CC CT CTTTAAAG CTCATT CATCATTAGTAAATTTAGAAGAAAATTCAC 
AAGTTACTCAAGTAT CTAT CTCTAAAAAATGG ATG AAATCGT CTGTTAAA 
AATAAACCAT CCGTAATGG CATAT CAAAAAG CA 

SEQ XD NO: 4806 
STRAIN 18RS21 

AATAGTACTGAGACAAGTG CTT CAGTAGTTCCTACTACAAATACTAT CGT 
TCAAACTAATGACAGTAAT C CTACCGCAAAATTTGTATCAGAAT CAGGAC 
AATCTGTAAT AGGT CAAGT AAAACCAGATAATTCTG CGG CGCTTACAACA 
GTTG ACACG C CTCAT CATATTTCAG CTCCAGATG CTTTAAAAACAACTCA 
AT CAAGT CCTGT CGTTG AGAGTACTTCTACTAAGTTAACTGAAGAGACTT 
ACAAACAAAAAG ATGGT CAAGATTTAG CCAACATGGTGAG AAGTGGTCAA 
' GTTACTAGTGAGGAACTCGTTAATATGGCATACGATATTATTGCTAAAGA 
AAACC CATCTTTAAATG CAGTCATTACTACTAGACGC CAAGAAG CTATTG 
AAGAGGCTAGAAAACTTAAAGATAC CAAT CAG CCGTTTTTAGGTGTT CC C 
TTGTTAGTCAAGGGGTTAGGGCACAGTATTAAAGGTGGTGAAACCAATAA 
TGGCTTGATCTATG CAGATGGAAAAATTAG CACATTTGACAGTAGCTATG 
T CAAAAAATATAAAG ATTTAGGATTTATTATTTT^ 

CCAGAGTATGGGTGGCGTAATATAACAGATTCTAAATTATACGGTCTAAC 
GCATAATCCTTGGGATCTTG CT CATAATGCTGGTGGCTCTTCTGGTGGAA. 
GTGCAG CAG CCATTG CTAG CGGAATGACG C CAATTG CTAG CGGTAGTGAT 
GCTGGTGGTT CTAT CCGTATTCCAT CTT CTTGGACGGGCTTGGTAGGTTT 
AAAAC CAACAAGAGGATTGGTGAGTAATGAAAAG CCAGATTCGTATAGTA 
CAGCAGTTCATTTTCCATTAACTAAGTCATCTAGAGACG 
TTAACTTAT CTAAAGAAAAGCGAT CAAACG CTAGTATCAGTTAATGATTT 
AAAATCTTTACCAATTGCTTATACTTTGAAATCACCAATGG 
TTAGT CAAGATGCTAAAAAC G CTATTATGGACAACGTCACATT CTTAAGA 
AAACAAGGATTCAAAGTAACAGAGATAGACTTACCAATTGATGGTAGAGC 
ATTAATG CGTGATTATT CAAC CTTGG CTATTGGCATGGGAGG AG CTTTTT 
CAACAATTGAAAAAGACTTAAAAAAACATGGTTTTACTAAAGAAGACG 
GAT C CTATTACTTGGG CAGTT CATGTTATTTATCAAAAT T CAGATAAGG C 
TGAACTTAAGAAAT CrATTATGGAAGC CCAAAAA.CATATGGATGATT AT C 
GTAAGGCAATGGAGAAGCTT CACAAGCAATTTCCTATTT^ 
ACGACCG CAAGTTT AG C CCCT CTAAATACAGATC CATATGTAACAGAGG A 
AG a t AAAAG AGCGATTTATAATATGGAAAACTTGAG C CAAGAAGAAAGAA 
TTGCT CTCTITAAT CG C CAGTGGGAGC CTATGCTGCX3TAGAACAC CTTTT 
ACACAAATTG CTAATATGACAGGACTC CCAGCTATCAGTATCCCG ACTTA 
CTTAT CTGAGTCTGGTTTAC C CATAGGGACGATGTTAATGGCAGGTG CAA 
ACTATGAT ATGGTATTAATTAAATTTG CAACT1T' CTTTGAAAAACAT CAT 
GGTTTTAATGTTAAATGGCAAAGAATAATAGATAAAGAAGTGAAA.CCATC 
TACTGG CCT AATACAGCCTACTAACTC CCT CTTTAAAG CT CATTCAT CAT 
TAGT AAATTTAGAAGAAAATTCACAAGTTACTCAAGT ATCTATCT 
AAATGGATGAAATCGT CTGTTAAAAAT AAACCAT C CGTAATGGCATATCA 
AAAAG CA 

SEQ ID NO: 4807 
STRAIN M781 

TG CTTCAGTAG CT CCTACTACAAATACTATCGTT CAAACTAATG ACAGTA 
ATCCTAC CG CAAAATTTG CATCAGAAT CAGGACAATCTGT AATAGGTCAA 
GTAAAAC CAG CTAATTCTG CGGCG CTT ACAACAGTTGACACG CCT CATAT 
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TTCA.GCTCCAGATGCTTTAAAAACAACTCAATCAAGTCCTGTCGTTGAGA 
GT C CTT CTACTAAGTT AACTGAAG AG ACATACAAACAAAAAG ATGGT CAA 
GATTTAGCCAACATGGTGAGAAGTGGTCAAGTTACTAGTG AG GAACT CGT 
CAATATGG CATACG ATATT AT CGCTAAAG AAAAC CCATCTTTAAATG CAG 
TCATT ACTACT AG ACG C CAAGAAG C CATTGAAGAGG CTAG AAAACTTAAA 
GATACTAAT CAG CCGTTTTT AGGTGTT CC CTT GTT AGTCAAGGGGTT AG G 
GCACAGTATtAAAGGTGGTGAAACCAATAATGGCTTGATCTATGCAGATG 
GAAAAATTAG CACATTTGACAGTAG CT ATGTCAAAAAAT ATAAAGATTTA 
GGATTTATTATTTTAGGACAAACGaATTTTCCAGAGTATGGGTGGCGTAA 
TATAACAGACTCTAA^TTATACGGTCCAACGCATAATCCTTGGAaTCTTG 
CTCATAACGCTGGTGGCTCTTCTGGTGGAAGTGCAGCAGCTATTGCTAGC 
GGAATGACGCCAATTGCTAGCGGCAGTGATGCTGGTGGTTCTATCCGTAT 
TCCATCTTCTTGGACGGGCTTAGTAGGTTTAA\A.CCAA 
TGAGTAATGAAAAGCCAGATTCGTATAGTAC^GCAGTTCATTTTCCATTA 
ACTAAGT CAT CTAG AGACG CAG AAACATTGTTAACTTAC CTAAAGAAAAG 
CGAT CAAACGCTAGTATCAGTTAATGATTTAAAaTCTTT^ 
ATACTTTGAAAT CAC CAATGGG AACAG AAgTTAGTCAAG ATG CTAAAAAT 
GCTATTATGG ACAACGT CACATT CTTAAGAGAACAAGGATTCAAAGTGAC 
AGAG ATAGATTTAC CAATTG ATGGTAG AG CATTAATG CGTGATTATTCAA 
CCTTGGCTATTGGCATGGGAGGAGCTTTTTCAACAATT 
AAAAAACATGGTTTTACTAAAGAAGACGTTGATCCCATTACTTGGGCAGT 
TCATGTTATTTAT CAAAATT CAGATAAGG CTGAACTTAAGAAAT CTATTG 
TGGAAGCCCAAAAACATATGGATGATT AT CGTAAGG CAATGGAGAAG CTT 
CACAAGCAATTT C CTAT^ITT CTTATCG CCAACGACCG CAAGTTTAG C CC C 
TCTAAAT ACAGAT C CATATGTAACAGaG aAAGATAAAAGAGCGATTTATA 
ATATGGAAAACTTG AGC CAAGAAGAAAGAATTG CTCT CTTT 
TGGGAGC CTATGTTG CGTAGAACAC CTTTT ACAC CAATTG CTAATAt GAC 
AGGACTCCCAGCTATCAGTATCCCGACnTACTTATCTGAGTCTGG 
CGATAGGGACGATGTTAATGG CAGGTG CAAACTATGATATGGTATTAATT 
AAATTTG CAACTTT CTTTGAAAAACAT CATGGTTTTAATGTTAAATGGCA 
AAGAATAATAGATAAAGAAGTGAAAC CAT CTG CTG AC CTAATACAGCCTA 
CTAACTC CCT CTTTAAAGCT CATT CATCATTAGTAAATTT AGAAGAAAAT 
T CACAAGTTACTCAAGTAT CTATCT CTAAAAAATGGATGAAATCGTCTGT 
TAAAAATAAACCAT CCGTAATGG CATAT CAAAAAGCA 

SEQ ID MO: 4810 
STRAIN CJB110 

TAGTT CCTACTACAAAT ACTAT CGTT CAAACTAATGACAGTAATCCTAC C 
GCAAAATTTGTATCAGAAT CAGGACAATCTGTAATAGGTCAAGTAAAAC C 
AGATAATTCTG CGGCGCTTACAACAGTTGACACG CCT CAT CATATTTCAG 
CTCCAGATG CTTTAAAAA.CAACTCAATCAAGTC CTGT CGTTGAGAGTACT 
TCTACTAAGTTAACTGAAGAGACTTACAAACAAA 

AGCCAACATGGTGAGAAGTGGTCAAGTTACTAGTGAGGAACTCGTTAATA 
TGGCATACGATATTATTGCTAAAGAAAACCCATLTTT'AAATGCAGTCATT 
ACTACTAGACG C CAAGAAG CTATTGAAGAGGCTAGAAAACITAAAGATAC 
CAATCAGCCGTTTTTAGGTGTTCCCTTGTTAGTCAAGGGGTTAGGGCACA 
GTATTAAAGGTGGTGAAAC CAATAATGG CTTGATCTATG CAGATGGAAAA 
ATTAGCACATTTGACAGTAG CTATGTCAAAAAATATAAAG ATTTAGGATT 
TATTATTTT AGGACAAACGAACTTT C CAGAGTATGGGTGGCGTAATATAA 
CAGATTCTAAATTATACGGT CTAACG CATAATC CTTGGGATCTTGCTCAT 
AATGCTGGTGG CTCTT CTGGTGGAAGTG CAGCAG CCATTG CTAGCGGAAT 
GACGCCAATTGCTAGCGGTAGTGATGCTGGTGGTTCTATCCGTATTCCAT 
CTTCTTGGACGGG CTTGGTAGGTTTAAAACCAACAAGAGG ATTGGTGAGT 
CATGAAAAG C CAGATTCGTATAGTACAGCAGTTCATTTTCCATTAACTAA 
GTCATCTAGAGACGCAGAAACATTATTAACTTATCTAAAGAAAAGCGATC 
AAACG CT AGT AT CAGTTAATGATTTAAAATCTTTACCAATTGCTTATACr 
TTGAAAT CAC CAAT GGGAACAG AAGTTAGT CAAGATG CTAAAAACG CTAT 
TATGGACAACGT CACATT CTT AAGAAAACAAGGATTCAAAGTAACAGAGA 
TAGACTTACCAATTGATGGTAGAGCATTAATGCGTGATTATTCAACCTTG 
GCTATTGGCATGGG AgGAG CTTTTT CAACaATTGAAAAAG AcTTAaAAAA 
AcATGGTTTTACTAAAGAAGACGTTGATCCTATTACTTGGGCAGT^ 
TTATTTATCAAAATTCAGATAAGG CTGAACTTAAG AAATCTATTATGGAA 
G CCCAAAAACATATGGATGATTAT CGTAAGG CAATGGAGAAG CTTCACAA 
GCAATTTCCTATTTTCTTATCGCCAACGACCGCAAG^ 
ATACAGATCCATATGTAACAGAGGAAGATAAAAGAGCGATTTATAATATG 
GAAi^ACTTGAG C CAAGAAG AAAGAATTGCT CT CTTTAATCGCCAGTGGGA 
GCCTATGTTGCGTAGAAC^CCTTTTACACAAATTGCTAATAtGACAG 
TCCCAGCTATCAGTATCCCGACTTACTTATCTGAGTCTGGTTTACCCATA 
gGGACgATGTTAATGGCAGGTGCLAAACTATGATATGGTATTAATTAAATT 
TG CAACLTi' C1TTG AAAAACATCATGGTTTTAATGTTAAATGGCAAAGAA 
TAATAGATAAAGAAGTGAAACCAT CTACTGGCCTAATACAGC CTACTAAC 
TCC CT CITT AAAG CTCATTCATCATTAGTAAATTTAGAAGAA^ 
AGTTACTCAAGTATCrATCTCTAAAAAATGGATGAAATCGTCTGTTAAAA 
ATAAACCATCCGTAATGGCATATCAAAAAGCA 

SEQ ID NO: 4811 
STRAIN 1169NT 

AATAGTACTG AGACAAGTG CTT CAGTAG CT CCTACTACAAATACTATCGT 
TCAAACT AATGACAGTAAT C CTAC CGCAAAATTTG CATCAGAAT CAGGAC 
AATCTGTAATATGTCAAGTAAAACCAGATAATTCTGCGGCGCTTACAACA 
GTTGACACGC CT CATATTT CAG CTCCAGAT GATTTAAAAACAACT CAATC 
AAGTCCTGTCGTTGAGAGTACTTCTACTAAGTTAACTGA^ 
AACAAAAAG ATGGTCAAGATTTAG CCAACATGGTGAG AAGTGGT CAAGTT 
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ACTAGTGAGGAACTCGTCAATATGGCATACGATATTATTGCTTAAGAAAA 
CC CTT CTTTAAATG CAGTCATT ACT ACTAG ACGC CAAGAAGC CATTGAAG 
AGGCrAGAAAACTTAAAGATACTAATCAGCCATTTTTAGGTGTTCCCTTG 
TTAGTCAAGGGGTTAGGGCACAGTATTAAAGGTGGTGAAACCAATAATGG 
CTTGATCTATGCAGATGGAAAAAT t aG CACATTTGACAGTAG CTATGTCA 
AAAAATATAAAGATTTAGGATTTATTATTTTAGGACAAACGAACTTTCCA 
GAGTATGGGTGGCGTAATATAACAGATTCTAAATTATACGGTCCAACGCA 
TAAC C CT CGGAAT CTTG CT CATAATG CTGGTGG CT CTTCTGGTGG AAGTG 
CAGCAGCCATTGCTAGCGGrATGACGCCAATTGCTAGCGGTAGTGATGCT 
GGTGGTTCTATCCG t ATTCCATCTTCTTGGACGGG CTTGGTAGGTTTAAA 
ACCAACAAGAGGATTGGTGAGTAATGAAAAGCCAGATTCGTATAGTACAG 
CAGTT CATT TTC CATTAACTAAGT CAT CT AGAG ACGCAG AAACATTATT A 
ACTTATCTAAAG AAAAG CGATCAAACG CT AGT AT CAGTT AATGATTTAAA 
ATCTTTACCAATTGCTTATACTTTGAAATCACCAATC4GGAACAGAAGTTA 
GTCMGATGCTAAAAACGCTATTATGGACAACGTCACATTCTTAAGAAAA 
CAAGGATTCAAAGTAACAGAGATAGACTTAC CAATTG ATGGT AGAG CATT 
AATG CGTGATTATTCAACCTTGGCTATTGG CATGGGAGGAGCTTTTT CAA 
CAATTGAAAAAGACTTAAAAAAACATGGTTTTACTAAAGAAGACGTTGAT 
CCTATTACITGGGCAGTTCATGTTATTTAT 

ACITAAGAAATCTATTATGGA^G CC CAAAAA.CATATGGATGATTATCGTA 
AGGCAATGG AGAAG CITCACAAGCAATTT CCTAT TTTCT CAACG 
ACCG CAAGTTTAGC C CCTCTAAAT ACAGA t CCATATGTAACAGAGGAAGA 
TAAAAGAGCGATTTATAATATGGAAAACTTGAGCCAAGAAGAAAGAATTG 
CT CTCTTTAATCG C CAGTGGGAG C CTATGTTG CGTAG AACACCTTTTACA 
CAAATTGCTAATATGACAGG ACTC CCAG CT AT CAGTAT CC CG ACTTACTT 
ATCTGAGTCTGGTTTACCCATAGGGACGATGTTAATGGCAGGTGCAAA.CT 
ATGATATGGTATTAATT AAATTTG CAACTTT CTTTGAAAAACAT CATGGT 
TTTAATGTTAAATGG CAAAGAATAATAGAT AAAGAAGTGAAACCAT CTAC 
TGGCCTAATACAGCCTACTAACTC C CTCTTTAAAG CT CATTCATCATTAG 
TAAATTTAGAAGAAAATTCACAAGTTACTCAAGTATCT^ 

TGGATGAAATCGTCTGTTAAAAATAAACCATCCGTAATGGCATATCAAAA 
AGCA 

SEQ XD HO: 4812 
STRAIN JM9130013 

TTCAGTAGCT CCTACTACAAATACT ATCGTTCAAACT AATGACAGTAAT C 
CTACCGCAAAATTTT CAT CAGAATCAGGACAAT CTGTAATAGGTCAAGTA 
AAACC^GCTAATTCTGTGGCGCTTACAACAGTTGACACGCCTCATATTT 
AGCT CCAGATGCTTT AAAAACAACT CAATCAAGTC CTGTCGTTGAGAGT C 
CTTCrACTAAGTTAACTGAAGAGACATA(^AACAAAAAGA 
TTAG C CAACATGGTGAGAAGTGGT CAAGTTACTAGTGAGGAACT CGTCAA 
T ATGG CATACGATATTATTG CTAAAGAAAACC CAT CTTTAAATG CAGTCA 
TTACTACTAGAC GCCAAGAAGCTATTGAAGAGG CTAGAAAA.CTT AAAGAT 
ACCAATCAGCCGTTTTTAGGTGTTCCCTTGTTAGTCAAGGGG 
CAGTATTAAAGGTGGTGAAACCAATAATGGCTTGATCTATGCAGGTGGAA 
AAATTAG CACATTTGACAGTAG CTATGT CAAAAAATATAAAGATTTAGGA 
TTTATTATTTTAGGACAAACCtAAC TTT CCAGAGTATG^ 
AACAGATTCTAAATTATACGGTC CAACGCATAAC C CTTGGAATCTTGCT C 
ATAATGCTGGTGGCT CTTCTGGTGGAAGTG CAG CAGTTATTGCTAG CGGG 
ATGACGCCAATTG CTAG CGGTAGTGATGCTGGTGGTT CTATC CGTATTC C 
ATCTT CTTGG ACGGG CTTGGTAGGTTTAAAAC CAACAAGAGGATTGGTGA 
GTAATGAAAAGCC^GATTCGTATAGTACAGC^GTTCATTTTCC^TTA^CT 
AAGT CATCTAGAGACGCAGAAACATTATTAACTTATCT CGA 
TCAAACG CT AGT AT CAGTT AATGATTT AAAAT CTTTAC CAATTG CTTATA 
CTTTGAAAT CAC CAATGGGAACAGAAGTTAGT CAAGATG CTAAAAATGCT 
ATTATGGACAACGT CATATT CTTAAGAAAACAAGG ATTCAAAGTGACAGA 
GATAG ACTT ACCAATTGATGGTAGAG CATTAATG CGTGATTATTCAACCT 
TGGCTATTGGTATGGGAGGAGCTTTTTCAAC^UOT 

AAACATGGTTTTACTAAAGAAGACGTTGAT CCCATT^ CA 

TGTTATTTATCAAA^TTCAGATAAGGCTGAACTTAAGAA^TCTATTATGG 

AAGCCCAAAAACATATGGATGATTATCGTAAGGCAATGGAGAAGCTTCAC 

AAGCAATTTCCTATTTTCTTATCGCCAACGACCGCAAGTTTAGCCCCT 

AAATAC^GATCCATATGTAACAGAGGAAGATAAAAGAGCGATTTATAATA 

TGGAAAACTTGAGCCAAGAAGAAAGAATTGCTCTCTTTAAT 

GAG C CTATGTTG CGTAG AACAC CTTTTACACAAATTG CTAATATG ACAG G 

ACTCCCAGCTATCAGTATCCCGACTTACT 

TAGGGACGATGTTAATGGCAGGTGCAAACTATGATATGGTATTAATTAAA 
TTTGCAACTTT'CTTT GAAAAATAT CATGGTTTTAATGTTAAATGG CAAAG 
AATAATAGATAAAGAAGTGAAACCATCTACTGGCCTAATACAG C CTACTA 
ACTCCCTCTTTAAAG CT CATTCAT CATTAGTAAATTTAGAAGAAAATTCA 
CAAGTTACT CAAGTATCTAT CT CTAAAAAATGGATGAAAT CGTCTGTT AA 
AAATAAACCATC CGTAATGG CATAT 

SEQ XD NO: 4813 
STRAIN H36B 

CTT CAGTAGTT CCTACTACAAATACTAT CGTT CAAACTAATGACAGT AAT 
CCTACCGCAAAATTTT CAT CAGAAT CAGGACAAT CTGTAATAGGT CAAGT 
AAAACCAGCTAATTCTGTGGCGCTTACAACAGTTGACACGCCTCATA 
CAGCTCCAGATG CITTAAAAACAACTCAAT CAAGTCCTGT CGTTG AG AGT 
CCTT CTACTAAGTTAACTG AAGAGACAT ACAAACAAAAAGATGGTCAAG A 
TTTAGCCAA(^TGGTGAGAAGTGGTCAAGTTACTAGTGAGGAACTCGTCA 
ATATGGCATaCX?ATAtTATTGCTAAAGAAAACCCATCTTTAAATG CAGTC 
ATTACTACTAGACG C CAAGAAG CT ATTGAAGAGG CTAGAAAACTTAAAGA 



818 



WO 2004/018646 



PCT/US2003/026827 



Table 48: Comparative Sequences relating to SAG1474 



TACCAATCAGCCGTTTTTAGGTGTTCCCTTGTTAGTCAAGGGGTTAGGGC 
ACAGTATTAAAGGTGGTGAAACCAATAATGGCTTGATCTATGCAGGTGGA 
AAAATTAGCACATTTGACAGTAGCTATGTCAAAAAATATAAAGATTTAGG 
ATTTATTATTTTAGGACAAACGAACTTTCC^GAGTATG^ATGGCGCAATA 
TAACAGATTCTAAATTATACGGTCCAACGCATAACCCTTGGAATCTTGCT 
CATAATGCTGGTGGCTCTTCTGGTGGAAGTGCAGCAGTTATTGCTAGCGG 
GATGACX3CCAATTGCTAGCGGTAGTGATGCTGGTGGTTCTATCCGTATTC 
CATCTTCTTGGACGGGCTTGGTAGGTTTAAAACCAACAAGAGGATTGGTG 
AGTAATGAAAAGCCAGATTCGTATAGTACAGCAGTTCATTTTCCATTAAC 
TAAGT CATCTAG AGACG CAG AAACATTATT AACTTATCT AAAGAAAAGCG 
AT CAAACG CTAGTAT CAGTT AATG ATTTAAAATCT TT ACCAATTGCTTAT 
ACITT GAAATCACCAATGGGAA.CAG AAGTTAGT CAAG ATG CTAAAAATG C 
TATTATGGACAACGTCATATTCTTAAGAAAACAAGGATTCAAAGTGACAG 
AG ATAGACTTAC CAATTGATGGTAGAGCATTAATG CGTGATTATTCAAC C 
TTGGCTATTGGTATGGGAGGAGCTTTTTCAACAATTGAAAAAGACTTAAA 
AAAACATGGTTTTACTAAAGAAGACGTTGATCCCATTACTTGGGCAGTTC 
ATGTTATTTATCAAAATT CAGATAAGG CTG AACTTAAGAAATCTATTATG 
GAAG CC CAAAAACAT ATGGATG ATTAT CGTAAGG CAATGGAGAAGCTTCA 
CAAGCAATTTCCTATTTTOTTATCGCCAACGACCGCAAGTTTAGCCCCTC 
TAAATACAGATC CATATGTAACAG AGGAAG ATAAAAGAG CGATTTATAAT 
ATGGAAAACITOAGCCAAGAAGAAAGAATTGCTCTCTTTAATCGCCAGTG 
GGAG C CTATGTTGCGTAGAACACCTTTTACAGAAATTGCTAATATGACAG 
GAOTCCCAGCrATCAGTATCGCGACTTACTTATCrGAGTCTGGTTTACCC 
ATAGGGACG ATGTT AATGG CAGGTG CAAACTATG ATATGGTATTAATTAA 
ATTTGCAACTTTCTTTGAAA^TATCATGGlUU'l'AATGTTAAATGGCAAA 
GAATAATAGATAAAGAAGTGAAACCATCTACTGGCCTAATACAGCCTACT 
AACT C CCTCTTTAAAGCTCATTCAT CATTAGTAAATTTAGAAGAAAATTC 
ACAAGTTACTCAAGTAT CTATCTCTAAAAAATGG ATGAAATCGT CTGTTA 
AAAATAAA 

PRETTY of: /biotmp/msa71927 . 2 { * } January 22, 20 03 07:23 



nisa71927.2{l73_18RS21 
msa71927.2{l73_2603, 
msa71927 . 2 { 173_A909 
msa71927.2{l73_090 
msa71927 . 2{l73_CJB110 ; 
msa7 1927 . 2 { 173_C0H1 * 
msa71927 . 2 {l73_M781 
msa71927 . 2 {l73_M732 
msa71927.2{l73_H36B 
msa71927 . 2 {173_JM9130013 ; 
msa71927.2{l73_1169NT/ 

Consensus 



1 

aatagtactg agacaagtgc ttcagtagtt ccTACTACAA 

aatagtactg agacaagtgc ttcagtagtt ccTACTACAA 

— TACTACAA 

aatagtactg agacaagtgc ttcagtagtt ccTACTACAA 

, tagtt ccTACTACAA 

aatagtactg agacaagtgc ttcagtagct ccTACTACAA 

tgc ttcagtagct ccTACTACAA 

-tcagtagct ccTACTACAA 

c ttcagtagtt ccTACTACAA 

ttcagtagct ccTACTACAA 

aatagtactg agacaagtgc ttcagtagct ccTACTACAA 
_ _ __.*■******* 



50 

ATACTATCGT 

ATACTATCGT 

ATACTATCGT 

ATACTATCGT 

ATACTATCGT 

ATACTATCGT 

ATACTATCGT 

ATACTATCGT 

ATACTATCGT 

ATACTATCGT 

ATACTATCGT 
********** 



msa71927.2{ 

msa71927. 

msa71927. 
rasa71927 
msa71927 .2{ 

msa71927 . 

msa71927 . 

msa71927 . 

msa71927. 
msa71927.2{l73 
msa71927.2{ 



173_18RS21; 
2{173_2603 
2{173_A909 
.2{173_090 
173_CJB110 
2'{l73_COHl/ 
2{173_M781 
2{l73_M732 
2{173_H36B 
_JM9130013 
173_1169NT; 
Consensus 



51 

TCAAACTAAT 
TCAAACTAAT 
TCAAACTAAT 
TCAAACTAAT 
TCAAACTAAT 
TCAAACTAAT 
TCAAACTAAT 
TCAAACTAAT 
TCAAACTAAT 
TCAAACTAAT 
TCAAACTAAT 



GACAGTAATC 
GACAGTAATC 
GACAGTAATC 
GACAGTAATC 
GACAGTAATC 
GACAGTAATC 
GACAGTAATC 
GACAGTAATC 
GACAGTAATC 
GACAGTAATC 
GACAGTAATC 



CTACCGCAAA 
CTACCGCAAA 
CTACCGCAAA 
CTACCGCAAA 
CTACCGCAAA 
CTACCGCAAA 
CTACCGCAAA 
CTACCGCAAA 
CTACCGCAAA 
CTACCGCAAA 
CTACCGCAAA 



********** ********** ********** 



ATTTg t ATCA 
ATTTg tATCA 
ATTTgtATCA 
ATTTg tATCA 
ATTTgtATCA 
ATTTg cATCA 
ATTTg cATCA 
ATTTg CAT CA 
ATTT t cATCA 
ATTTtcATCA 

ATTTgcATCA 
**** **** 



100 

GAATCAGGAC 
GAATCAGGAC 
GAATCAGGAC 
GAATCAGGAC 
GAATCAGGAC 
GAATCAGGAC 
GAATCAGGAC 
GAATCAGGAC 
GAATCAGGAC 
GAATCAGGAC 

GAATCAGGAC 
********** 



msa71927 .2{ 

msa71927 . 

msa71927. 
msa71927 
msa71927.2{ 

msa71927. 

msa71927. 

rasa71927. 

msa71927. 
msa71927.2{l73 
msa71927.2{ 



173_18RS2l} 
2fl73__2603* 
2{173_A909 
.2{173_090 
173_CJB110 
2{l73_COHl 
2{173_M781 
2{173_M732 
2{173_H36B 
_JM9130013 
173_1169NT 
Consensus 



101 

AAT CTGTAAT 
AATCTGTAAT 
AATCTGTAAT 
AATCTGTAAT 
AATCTGTAAT 
AATCTGTAAT 
AATCTGTAAT 
AATCTGTAAT 
AATCTGTAAT 
AATCTGTAAT 

AATCTGTAAT 
********** 



AgGTCAAGTA 
AgGTCAAGTA 
AgGTCAAGTA 
AgGTCAAGTA 
AgGTCAAGTA 
AgGTCAAGTA 
AgGTCAAGTA 
AgGTCAAGTA 
AgGTCAAGTA 
AgGTCAAGTA 
AtGTCAAGTA 



AAACCAGaTA 
AAACCAGaTA 
AAACCAGaTA 
AAACCAGaTA 
AAACCAGaTA 
AAACCAGCTA 
AAACCAGcTA 
AAACCAGCTA 
AAACCAGcTA 
AAACCAGcTA 
AAACCAGaTA 



*_******** *******„** 



ATTCTGcGGC 
ATTCTGcGGC 
ATTCTGcGGC 
ATTCTGcGGC 
ATTCTGcGGC 
ATTCTGcGGC 
ATTCTGcGGC 
ATTCTGcGGC 
ATTCTGtGGC 
ATTCTGtGGC 
ATTCTGcGGC 
******_*** 



150 

GCTTACAACA 
GCTTACAACA 
GCTTACAACA 
GCTTACAACA 
GCTTACAACA 
GCTTACAACA 
GCTTACAACA 
GCTTACAACA 
GCTTACAACA 
GCTTACAACA 

GCTTACAACA 
********** 



msa71927 .2 
msa71927 
msa71927 
msa7192 

msa71927 .2 
msa71927 
msa71927 
msa71927 



{l73_18RS2l} 
.2{173_2603} 
.2{173_A909) 
7.2{l73_090} 
{173_CJB110} 
.2{l73_COHl} 
.2{173_M781) 
.2(173_M732) 



151 

GTTGACACGC 
GTTGACACGC 
GTTGACACGC 
GTTGACACGC 
GTTGACACGC 
GTTGACACGC 
GTTGACACGC 
GTTGACACGC 



CtcaTCATAT 
Ctc aTCATAT 
CtcaTCATAT 
CtcaTCATAT 
CtcaTCATAT 
C . - . TCATAT 
C. . .TCATAT 
C. . .TCATAT 



TTCAGCTCCA 
TTCAGCTCCA 
TTCAGCTCCA 
TTCAGCTCCA 
TTCAGCTCCA 
TTCAGCTCCA 
TTCAGCTCCA 
TTCAGCTCCA 



GATG cTTTAA 
GATGcTTTAA 
GATG cTTTAA 
GATGcTTTAA 
GATGcTTTAA 
GATG C TTTAA 
GATGcTTTAA 
GATGcTTTAA 



200 

AAACAACTCA 
AAACAACTCA 
AAACAACTCA 
AAACAACTCA 
AAACAACTCA 
AAACAACTCA 
AAACAACTCA 
AAACAACTCA 
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msa71927. 2{l73_H36B} 
msa71927.2{l73_JM9130013} 
msa71927 - 2 { 173_JL169NT} 

Consensus 



GTTGACACGC C. . - TCATAT TTCAGCTCCA GATGcTTTAA AAACAACTCA 

GTTGACACGC C . . . TCATAT TTCAGCTCCA GATGcTTTAA AAACAACTCA 

GTTGACACGC C. . . TCATAT TTCAGCTCCA GATGaTTTAA AAACAACTCA 

+ ■*■*** + **** * ****** ********** ****_***** ********** 



msa71927.2{ 

msa71927 . 

msa71927 . 
msa71927 
msa71927 .2{ 

msa71927 . 

msa71927 . 

msa71927 . 

msa71927 . 
tnsa71927.2{l73 
msa71927.2{ 



173_18RS2l} 
2{173_2603} 
2{173_A909} 
.2{173_090} 
173JZJB110} 
2{173_C0H1} 
2{173_M781} 
2{173_M732} 
2{173__H36B} 
_JM9130013} 
173_1169NT} 
Consensus 



201 

ATCAAGTCCT 
ATCAAGTCCT 
ATCAAGTCCT 
ATCAAGTCCT 
ATCAAGTCCT 
ATCAAGTCCT 
ATCAAGTCCT 
ATCAAGTCCT 
ATCAAGTCCT 
ATCAAGTCCT 
ATCAAGTCCT 



GTCGTTGAGA 
GTCGTTGAGA 
GTCGTTGAGA 
GTCGTTGAGA 
GTCGTTGAGA 
GTCGTTGAGA 
GTCGTTGAGA 
GTCGTTGAGA 
GTCGTTGAGA 
GTCGTTGAGA 
GTCGTTGAGA 



GTaCTTCTAC 
GTaCTTCTAC 
GTaCTTCTAC 
GTaCTTCTAC 
GTaCTTCTAC 
GTcCTTCTAC 
GTcCTTCTAC 
GTcCTTCTAC 
GTcCTTCTAC 
GTcCTTCTAC 
GTaCTTCTAC 



TAAGTTAACT 
TAAGTTAACT 
TAAGTTAACT 
TAAGTTAACT 
TAAGTTAACT 
TAAGTTAACT 
TAAGTTAACT 
TAAGTTAACT 
TAAGTTAACT 
TAAGTTAACT 
TAAGTTAACT 



250 

GAAGAGAC t T 
GAAGAGAC t T 
GAAGAGAC tT 
GAAGAGAC tT 
GAAGAGAC tT 
GAAGAGACaT 
GAAGAGAC aT 
GAAGAGACaT 
GAAGAGACaT 
GAAGAGACaT 
GAAGAGACaT 



********** ********** **_******* ********** ********_* 



msa71927.2{ 

msa71927. 

msa71927. 
msa71927 
msa71927 . 2{ 

msa71927 . 

msa71927 . 

msa71927 . 

msa71927 . 
msa71927.2{l73 
msa71927.2{ 



173_18RS2l} 
2{173_2603} 
2{173_A909} 
.2{173_090> 
173_CJB110} 
2{l73_COHl} 
2{173__M781} 
2 1 173_M732} 
2{173_H36B} 
_JM9130013} 
173_1169NT} 
Consensus 



msa71927 . 2{ 

msa71927 . 

msa71927. 
msa71927 
msa71927.2{ 

msa71927 . 

msa71927. 

msa71927 . 

msa71927. 
msa71927.2{l73 
msa71927.2{ 



173_18RS2l' 
2{173_2603 
2{173_A909 : 
.2{173_090 
173_CJB110 
2{l73_COHl 
2{173_M781} 
2{173_M732} 
2{173JK36B~ 
_JM9130013 
173_1169NT 
Consensus 



251 

ACAAACAAAA 
ACAAACAAAA 
ACAAACAAAA 
ACAAACAAAA 
ACAAACAAAA 
ACAAACAAAA 
ACAAACAAAA 
ACAAACAAAA 
ACAAACAAAA 
ACAAACAAAA 
ACAAACAAAA 



AGATGGTcAA 
AGATGGTcAA 
AGATGGTcAA 
AGATGGTaAA 
AGATGGTaAA 
AGATGGTcAA 
AGATGGTcAA 
AGATGGTcAA 
AGATGGTcAA 
AGATGGTcAA 
AGATGGTCAA 



GAtTTAGCCA 
GAtTTAGCCA 
GAtTTAGCCA 
GAtTTAGCCA 
GAtTTAGCCA 
GAtTTAGCCA 
GAtTTAGCCA 
GAtTTAGCCA 
GAtTTAGCCA 
GAgTTAGCCA 
GAtTTAGCCA 



ACATGGTGAG 
ACATGGTGAG 
ACATGGTGAG 
ACATGGTGAG 
ACATGGTGAG 
ACATGGTGAG 
ACATGGTGAG 
ACATGGTGAG 
ACATGGTGAG 
ACATGGTGAG 
ACATGGTGAG 



********** *******_** ** w ******* ********** 



301 

GTTACTAGTG 

GTTACTAGTG 

GTTACTAGTG 

GTTACTAGTG 

GTTACTAGTG 

GTTACTAGTG 

GTTACTAGTG 

GTTACTAGTG 

GTTACTAGTG 

GTTACTAGTG 

GTTACTAGTG 
********** 



AGGAACTCGT 

AGGAACTCGT 

AGGAACTCGT 

AGGAACTCGT 

AGGAACTCGT 

AGGAACTCGT 

AGGAACTCGT 

AGGAACTCGT 

AGGAACTCGT 

AGGAACTCGT 

AGGAACTCGT 
********** 



tAATATGGCA 

tAATATGGCA 

tAATATGGCA 

tAATATGGCA 

tAATATGGCA 

cAATATGGCA 

cAATATGGCA 

cAATATGGCA 

CAATATGGCA 

cAATATGGCA 

cAATATGGCA 
_********* 



TACGATATTA 

TACGATATTA 

TACGATATTA 

TACGATATTA 

TACGATATTA 

TACGATATTA 

TACGATATTA 

TACGATATTA 

TACGATATTA 

TACGATATTA 

TACGATATTA 
********** 



300 

AAGTGGTCAA 

AAGTGGTCAA 

AAGTGGTCAA 

AAGTGGTCAA 

AAGTGGTCAA 

AAGTGGTCAA 

AAGTGGTCAA 

AAGTGGTCAA 

AAGTGGTCAA 

AAGTGGTCAA 

AAGTGGTCAA 
********** 

350 

T t GCTAAAGA 

TtGCTAAAGA 

TtGCTAAAGA 

TtGCTAAAGA 

TtGCTAAAGA 

TcGCTAAAGA 

TcGCTAAAGA 

TcGCTAAAGA 

TtGCTAAAGA 

TtGCTAAAGA 

TtGCTAAAGA 
*_******** 



msa71927.2{ 

msa71927. 

msa71927. 
msa71927 
msa71927.2{ 

msa71927. 

msa71927. 

msa71927 . 

msa71927 . 
msa71927.2{l73 
msa71927.2{ 



173_18RS2l} 
2{173_2603} 
2{173_A909} 
.2{173_090} 
173__CJB110} 
2{l73_COHl} 
2{l73_M78l} 
2fl73_M732} 
2{173_H36B} 
_JM9130013} 
173_1169NT} 
Consensus 



351 

AAACCCaTCT 

AAACCCaTCT 

AAACCCaTCT 

AAACCCaTCT 

AAACCCaTCT 

AAACCCaTCT 

AAACCCaTCT 

AAACCCaTCT 

AAACCCaTCT 

AAACCCaTCT 

AAACCCtTCT 
******_*** 



TTAAATGCAG 

TTAAATGCAG 

TTAAATGCAG 

TTAAATGCAG 

TTAAATGCAG 

TTAAATGCAG 

TTAAATGCAG 

TTAAATGCAG 

TTAAATGCAG 

TTAAATGCAG 

TTAAATGCAG 
********** 



TCATTACTAC 

TCATTACTAC 

TCATTACTAC 

TCATTACTAC 

TCATTACTAC 

TCATTACTAC 

TCATTACTAC 

TCATTACTAC 

TCATTACTAC 

TCATTACTAC 

TCATTACTAC 
********** 



TAGACGCCAA 

TAGACGCCAA 

TAGACGCCAA 

TAGACGCCAA 

TAGACGCCAA 

TAGACGCCAA 

TAGACGCCAA 

TAGACGCCAA 

TAGACGCCAA 

TAGACGCCAA 

TAGACGCCAA 
********** 



400 

GAAGCtATTG 

GAAGCtATTG 

GAAGCtATTG 

GAAGCtATTG 

GAAGCtATTG 

GAAGCcATTG 

GAAGCcATTG 

GAAGCcATTG 

GAAGCtATTG 

GAAGCtATTG 

GAAGCcATTG 
*****_**** 



msa71927 . 2 { 173__18RS21 } 
msa71927.2{l73_2603 > 
msa71927 . 2 { 173_A909 
msa71927 . 2 { 173_090 
msa71927 . 2 { 173_CJB110 
msa71927 . 2 {173_C0H1 
msa71927 . 2 { 173_M781 
msa71927 . 2 { 173_M732 ' 
maa71927.2{l73_H36B 
msa71927 . 2 { 173_OM913 0013 
msa71927.2{l73_1169NT' 

Consensus 



401 

AAGAGGCTAG 
AAGAGGCTAG 
AAGAGGCTAG 
AAGAGGCTAG 
AAGAGGCTAG 
AAGAGGCTAG 
AAGAGGCTAG 
AAGAGGCTAG 
AAGAGGCTAG 
AAGAGGCTAG 
AAGAGGCTAG 



AAAACTTAAA 
AAAACTTAAA 
AAAACTTAAA 
AAAACTTAAA 
AAAACTTAAA 
AAAACTTAAA 
AAAACTTAAA 
AAAACTTAAA 
AAAACTTAAA 
AAAACTTAAA 
AAAACTTAAA 



********** ********** 



GATACcAATC 

GATACcAATC 

GATACcAATC 

GATACCAATC 

GATACcAATC 

GATACtAATC 

GATACtAATC 

GATACtAATC 

GATACcAATC 

GATACcAATC 

GATACtAATC 
*****_**** 



AGCCgTTTTT 
AGCCgTTTTT 
AGCCgTTTTT 
AGCCgTTTTT 
AGCCgTTTTT 
AGCCgTTTTT 
AGCCgTTTTT 
AGCCgTTTTT 
AGCCgTTTTT 
AGCCgTTTTT 
AGCCaTTTTT 



450 

AGGTGTTCCC 
AGGTGTTCCC 
AGGTGTTCCC 
AGGTGTTCCC 
AGGTGTTCCC 
AGGTGTTCCC 
AGGTGTTCCC 
AGGTGTTCCC 
AGGTGTTCCC 
AGGTGTTCCC 
AGGTGTTCCC 



****_***** ********** 



msa71927 . 2 { 173_18RS2l} 
msa71927 . 2 f 173_2603 ' 
msa71927.2{l73_A909 
msa71927.'2{l73_090 

msa71927 . 2 { 173_CJB110 } 
msa71927 . 2 { 173_C0H1 ) 
msa71927.2{173_M781} 



451 

TTGTTAGTCA 
TTGTTAGTCA 
TTGTTAGTCA 
TTGTTAGTCA 
TTGTTAGTCA 
TTGTTAGTCA 
TTGTTAGTCA 



AGGGGTTAGG 
AGGGGTTAGG 
AGGGGTTAGG 
AGGGGTTAGG 
AGGGGTTAGG 
AGGGGTTAGG 
AGGGGTTAGG 



GCACAGTATT 
GCACAGTATT 
GCACAGTATT 
GCACAGTATT 
GCACAGTATT 
GCACAGTATT 
GCACAGTATT 



AAAGGTGGTG 
AAAGGTGGTG 
AAAGGTGGTG 
AAAGGTGGTG 
AAAGGTGGTG 
AAAGGTGGTG 
AAAGGTGGTG 



500 

AAACCAATAA 
AAACCAATAA 
AAACCAATAA 
AAACCAATAA 
AAACCAATAA 
AAACCAATAA 
AAACCAATAA 
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Table 48: Comparative Sequences relating to SAG1474 



msa71927 .2 f 173__M732} 
msa71927 .2{l73_H36B} 
msa71927 . 2 { 173_JM9130013 } 
msa71927 . 2 { 173_1169NT} 

Consensus 



TTGTTAGTCA 
TTGTTAGTCA 
TTGTTAGTCA 
TTGTTAGTCA 



AGGGGTTAGG 
AGGGGTTAGG 
AGGGGTTAGG 
AGGGGTTAGG 



********** ********** 



GCACAGTATT 

GCACAGTATT 

GCACAGTATT 

GCACAGTATT 
***** ***** 



AAAGGTGGTG 
AAAGGTGGTG 
AAAGGTGGTG 
AAAGGTGGTG 



AAACCAATAA 
AAACCAATAA 
AAACCAATAA 
AAACCAATAA 



********** ********** 



msa71927.2{ 

msa71927 . 

msa71927 . 
msa71927 
msa71927 .2{ 

msa71927 . 

msa71927 . 

msa71927 . 

msa71927 . 
msa71927.2{l73 
msa71927.2{ 



173_JL8RS2l} 
2{173_2603) 
2{173_A909} 
.2{173_090} 
173_CJB110} 
2f 173_COHl) 
2{173_M781} 
2{173_M732} 
2{173_H36B) 
_JM9130013} 
173.JL169NT} 
Consensus 



501 

TGGCTTGATC 
TGGCTTGATC 
TGGCTTGATC 
TGGCTTGATC 
TGGCTTGATC 
TGGCTTGATC 
TGGCTTGATC 
TGGCTTGATC 
TGGCTTGATC 
TGGCTTGATC 
TGGCTTGATC 



TATGCAGaTG 
TATGCAGaTG 
TATGCAGaTG 
TATGCAGaTG 
TATGCAGaTG 
TATGCAGaTG 
TATGCAGaTG 
TATGCAGaTG 
TATGCAGgTG 
TATGCAGgTG 
TATGCAGaTG 



GAAAAATTAG 
GAAAAATTAG 
GAAAAATTAG 
GAAAAATTAG 
GAAAAATTAG 
GAAAAATTAG 
GAAAAATTAG 
GAAAAATTAG 
GAAAAATTAG 
GAAAAATTAG 
GAAAAATTAG 



********** *******_** ********** 



CACATTTGAC 
CACATTTGAC 
CACATTTGAC 
CACATTTGAC 
CACATTTGAC 
CACATTTGAC 
CACATTTGAC 
CACATTTGAC 
CACATTTGAC 
CACATTTGAC 
CACATTTGAC 



550 

AGTAGCTATG 
AGTAGCTATG 
AGTAGCTATG 
AGTAGCTATG 
AGTAGCTATG 
AGTAGCTATG 
AGTAGCTATG 
AGTAGCTATG 
AGTAGCTATG 
AGTAGCTATG 
AGTAGCTATG 



********** ********** 



msa7l927 .2{ 

msa71927 . 

msa7l927 . 
msa71927 
msa71927.2{ 

msa71927. 

msa71927. 

msa71927 . 

msa71927 . 
msa71927.2{l73 
rasa71927.2{ 



173__18RS2l) 
2{173_2603} 
2{173_A909) 
.2{173_090) 
173_CJB110} 
2fl73_COHl) 
2(173_M781j 
2{173__M732} 
2{173_H36B} 
_JM9130013} 
173_1169NT} 
Consensus 



551 

TCAAAAAATA 

TCAAAAAATA 

TCAAAAAATA 

TCAAAAAATA 

TCAAAAAATA 

TCAAAAAATA 

TCAAAAAATA 

TCAAAAAATA 

TCAAAAAATA 

TCAAAAAATA 

TCAAAAAATA 
********** 



TAAAGATTTA 
TAAAGATTTA 
TAAAGATTTA 
TAAAGATTTA 
TAAAGATTTA 
TAAAGATTTA 
TAAAGATTTA 
TAAAGATTTA 
TAAAGATTTA 
TAAAGATTTA 
TAAAGATTTA 



GGATTTATTA 
GGATTTATTA 
GGATTTATTA 
GGATTTATTA 
GGATTTATTA 
GGATTTATTA 
GGATTTATTA 
GGATTTATTA 
GGATTTATTA 
GGATTTATTA 
GGATTTATTA 



TTTTAGGACA 
TTTTAGGACA 
TTTTAGGACA 
TTTTAGGACA 
TTTTAGGACA 
TTTTAGGACA 
TTTTAGGACA 
TTTTAGGACA 
TTTTAGGACA 
TTTTAGGACA 
TTTTAGGACA 



600 

AACGAAcTTT 
AACGAAcTTT 
AACGAAcTTT 
AACGAAcTTT 
AACGAAcTTT 
AACGAAtTTT 
AACGAAtTTT 
AACGAAtTTT 
AACGAAcTTT 
AACGAACTTT 
AACGAAcTTT 



********** ********** ********** ******_*** 



msa71927 . 2 { 173_18RS21 
msa71927 . 2 {173_2603 
msa71927 .2{173_A909 
rasa71927 . 2 { 173_090 
msa71927 .2{173_CJB110 
msa7 1 92 7 . 2 { 173__COHl 
msa7 1 92 7 . 2 { 173_M7 8 1 
msa71927 . 2 ( 173_M732 : 
msa7 1 9 2 7 . 2 { 1 7 3_H3 SB ' 
msa71927 . 2 { 173_JM913 0013 
msa71927 . 2 { 173_1169NT 

Consensus 



601 

CCAGAGTATG 
CCAGAGTATG 
CCAGAGTATG 
CCAGAGTATG 
CCAGAGTATG 
CCAGAGTATG 
CCAGAGTATG 
CCAGAGTATG 
CCAGAGTATG 
CCAGAGTATG 
CCAGAGTATG 



GgTGGCGtAA 

GgTGGCGtAA 

GgTGGCGtAA 

GgTGGCGtAA 

GgTGGCGtAA 

GgTGGCGtAA 

GgTGGCGtAA 

GgTGGCGtAA 

GaTGGCGcAA 

GaTGGCGcAA 

GgTGGCGtAA 
********** ★_**★**_** 



TATAACAGAt 

TATAACAGAt 

TATAACAGAt 

TATAACAGAt 

TATAACAGAt 

TATAACAGAc 

TATAACAGAc 

TATAACAGAc 

TATAACAGAt 

TATAACAGAt 

TATAACAGAt 
*********_ 



TCTAAATTAT 
TCTAAATTAT 
TCTAAATTAT 
TCTAAATTAT 
TCTAAATTAT 
TCTAAATTAT 
TCTAAATTAT 
TCTAAATTAT 
TCTAAATTAT 
TCTAAATTAT 
TCTAAATTAT 



650 

ACGGTCtAAC 
ACGGTC t AAC 
ACGGTCtAAC 
ACGGTCtAAC 
ACGGTCtAAC 
ACGGTC cAAC 
ACGGTCcAAC 
ACGGTCnAAC 
ACGGTCcAAC 
ACGGTCcAAC 
ACGGTCcAAC 



********** ******_*** 



msa71927 . 2 { 
■ msa71927. 
msa71927. 
msa71927 
msa71927 ,2{ 
msa71927 . 
msa71927 - 
msa71927 . 
msa71927 . 
msa71927.2{l73 
msa71927.2{ 



173_18RS2l} 
2{173_2603) 
2{173_A909} 
.2{l73_090} 
173_OJB110} 
2{l73_COHl) 
2{173_M781} 
2{173_M732} 
2{173_H36B} 
_OM9130013} 
173_1169NT} 
Consensus 



651 

GCATAAt CCT 

GCATAAtCCT 

GCATAAt CCT 

GCATAAtCCT 

GCATAAtCCT 

GCATAAtCCT 

GCATAAtCCT 

GCATAAtCCT 

GCATAAc CCT 

GCATAAcCCT 

GCATAAcCCT 
******^*** 



tGGgATCTTG 

tGGgATCTTG 

tGGgATCTTG 

tGGgATCTTG 

tGGgATCTTG 

tGGaATCTTG 

tGGaATCTTG 

tGGgATCTTG 

tGGaATCTTG 

tGGaATCTTG 

cGGaATCTTG 
_ **^****** 



CTCATAAtGC 
CTCATAAt GC 
CTCATAAtGC 
CTCATAAtGC 
CTCATAAtGC 
CTCATAAcGC 
CTCATAAcGC 
CTCATAAcGC 
CTCATAAtGC 
CTCATAAtGC 
CTCATAAtGC 



TGGTGGCTCT 
TGGTGGCTCT 
TGGTGGCTCT 
TGGTGGCTCT 
TGGTGGCTCT 
TGGTGGCTCT 
TGGTGGCTCT 
TGGTGGCTCT 
TGGTGGCTCT 
TGGTGGCTCT 
TGGTGGCTCT 



700 

TCTGGTGGAA 
TCTGGTGGAA 
TCTGGTGGAA 
TCTGGTGGAA 
TCTGGTGGAA 
TCTGGTGGAA 
TCTGGTGGAA 
TCTGGTGGAA 
TCTGGTGGAA 
TCTGGTGGAA 
TCTGGTGGAA 



*******_** ********** ********** 



msa71927 . 2 { 173_JL8RS21 } 
msa71927.2fl73_2603) 
msa71927 . 2 { 173_A909 
msa7192 7 . 2 { 173_0 90 
msa71927 . 2 { 173_CJB110 
msa71927 .2fl73_COHl; 
msa71927 .21 173_M781 
msa71927 . 2 { 173_M732 
msa71927 .2{173_H36B 
msa71927 . 2 { 173_JM9130013 
msa71927 . 2 { 173_1169NT] 

Consensus 



rasa71927 . 2 {l73_18RS21 [ 
msa71927 . 2 {173_2603 ' 
msa71927.2{173_A909 
msa71927 . 2 { 173_090} 

msa71927.2{l73_CJB110) 
msa71927 . 2 { 173__COHl } 



701 

GTGCAGCAGc 
GTGCAGCAGc 
GTGCAGCAGc 
GTGCAGCAGc 
GTGCAGCAGc 
GTGCAGCAGc 
GTGCAGCAGc 
GTGCAGCAGc 
GTGCAGCAGt 
GTGCAGCAGt 
GTGCAGCAGc 



cATTGCTAGC 
cATTGCTAGC 
CATTGCTAGC 
cATTGCTAGC 
CATTGCTAGC 
tATTGCTAGC 
tATTGCTAGC 
tATTGCTAGC 
tATTGCTAGC 
tATTGCTAGC 
CATTGCTAGC 



*********_ _********* 



751 

GCTGGTGGTT 
GCTGGTGGTT 
GCTGGTGGTT 
GCTGGTGGTT 
GCTGGTGGTT 
GCTGGTGGTT 



CTATCCGTAT 
CTATCCGTAT 
CTATCCGTAT 
CTATCCGTAT 
CTATCCGTAT 
CTATCCGTAT 



GGaATGACGC 

GGaATGACGC 

GGaATGACGC 

GGaATGACGC 

GGaATGACGC 

GGaATGACGC 

GGaATGACGC 

GGaATGACGC 

GGgATGACGC 

GGgATGACGC 

GGrATGACGC 
**_******* 



TCCATCTTCT 
TCCATCTTCT 
TCCATCTTCT 
TCCATCTTCT 
TCCATCTTCT 
TCCATCTTCT 



CAATTGCTAG 
CAATTGCTAG 
CAATTGCTAG 
CAATTGCTAG 
CAATTGCTAG 
CAATTGCTAG 
CAATTGCTAG 
CAATTGCTAG 
CAATTGCTAG 
CAATTGCTAG 
CAATTGCTAG 



750 

CGGtAGTGAT 
CGGtAGTGAT 
CGGtAGTGAT 
CGGtAGTGAT 
CGGtAGTGAT 
CGGcAGTGAT 
CGGcAGTGAT 
CGGcAGTGAT 
CGGtAGTGAT 
CGGtAGTGAT 
CGGtAGTGAT 



********** ***_****** 



TGGACGGGCT 
TGGACGGGCT 
TGGACGGGCT 
TGGACGGGCT 
TGGACGGGCT 
TGGACGGGCT 



800 

TgGTAGGTTT 
TgGTAGGTTT 
TgGTAGGTTT 
TgGTAGGTTT 
TgGTAGGTTT 
TaGTAGGTTT 



821 



WO 2004/018646 



PCT/US2003/026827 



Table 48: Comparative Sequences relating to SAG1474 



msa71927.2{l73_M78ll 
msa71927.2{173_M732} 
msa71927 .2{l73_H36B} 
msa71927 . 2 { 173_JM9130013 } 
msa71927.2{l73_1169NT} 

Consensus 



GCTGGTGGTT 
GCTGGTGGTT 
GCTGGTGGTT 
GCTGGTGGTT 
GCTGGTGGTT 



CTATCCGTAT 
CTATCCGTAT 
CTATCCGTAT 
CTATCCGTAT 
CTATCCGTAT 



TCCATCTTCT 
TCCATCTTCT 
TCCATCTTCT 
TCCATCTTCT 
TCCATCTTCT 



TGGACGGGCT 
TGGACGGGCT 
TGGACGGGCT 
TGGACGGGCT 
TGGACGGGCT 



TaGTAGGTTT 
TaGTAGGTTT 
TgGTAGGTTT 
TgGTAGGTTT 
TgGTAGGTTT 



********** ********** ********** ********** *_******** 



msa71927 .2{ 

msa71927. 

msa71927 . 
msa71927 
msa71927 .2{ 

msa71927 . 

msa71927. 

msa71927 . 

msa71927 . 
msa71927.2{l73 
msa71927.2{ 



173_18RS2l) 
2{l73_2603 j 
2{173__A909} 
,2{173_090} 
173_CJB110} 
2{l73_COHl} 
2{173_M78l} 
2{173_M732} 
2(173_H36B} 
_JM9130013} 
173JL169NT) 
Consensus 



msa71927.2{ 

msa71927 . 

msa71927 . 
msa71927 
msa71927 , 2{ 

msa7X927 . 

msa71927 . 

msa71927 . 

msa71927 . 
msa71927.2{!73 
rnsa71927.2{ 



173_18RS2l} 
2 { 173^2603} 
2{173_A909} 
.2{173_090} 
173_CJB110} 
2(l73_COHlj 
2{173_M78l} 
2{173_M732} 
2{173_H36B} 
_JM9130013} 
173_1169NT} 
Consensus 



801 

AAAACCAACA 

AAAACCAACA 

AAAACCAACA 

AAAACCAACA 

AAAACCAACA 

AAAACCAACA 

AAAACCAACA 

AAAACCAACA 

AAAACCAACA 

AAAACCAACA 

AAAACCAACA 
********** 

851 

CAGCAGTTCA 
CAGCAGTTCA 
CAGCAGTTCA 
CAGCAGTTCA 
CAGCAGTTCA 
CAGCAGTTCA 
CAGCAGTTCA 
CAGCAGTTCA 
CAGCAGTTCA 
CAGCAGTTCA 
CAGCAGTTCA 
********** 



AGAGGATTGG 
AGAGGATTGG 
AGAGGATTGG 
AGAGGATTGG 
AGAGGATTGG 
AGAGGATTGG 
AGAGGATTGG 
AGAGGATTGG 
AGAGGATTGG 
AGAGGATTGG 
AGAGGATTGG 



TGAGTaATGA 
TGAGTaATGA 
TGAGTaATGA 
TGAGTaATGA 
TGAGTCATGA 
TGAGTaATGA 
TGAGTaATGA 
TGAGTaATGA 
TGAGTaATGA 
TGAGTaATGA 
TGAGTaATGA 



********** *****_**** 



TTTTCCATTA 

TTTTCCATTA 

TTTTCCATTA 

TTTTCCATTA 

TTTTCCATTA 

TTTTCCATTA 

TTTTCCATTA 

TTTTCCATTA 

TTTTCCATTA 

TTTTCCATTA 

TTTTCCATTA 
********** 



ACTAAGTCAT 

ACTAAGTCAT 

ACTAAGTCAT 

ACTAAGTCAT 

ACTAAGTCAT 

ACTAAGTCAT 

ACTAAGTCAT 

ACTAAGTCAT 

ACTAAGTCAT 

ACTAAGTCAT 

ACTAAGTCAT 
********** 



850 

TCGTATAGTA 

TCGTATAGTA 

TCGTATAGTA 

TCGTATAGTA 

TCGTATAGTA 

TCGTATAGTA 

TCGTATAGTA 

TCGTATAGTA 

TCGTATAGTA 

TCGTATAGTA 

TCGTATAGTA 
********** 

900 

AGAAACATTa 

AGAAACATTa 

AGAAACATTa 

AGAAACATTa 

AGAAACATTa 

AGAAACATTg 

AGAAACATTg 

AGAAACATTg 

AGAAACATTa 

AGAAACATTa 

AGAAACATTa 
********** *********_ 



AAAGCCAGAT 

AAAGCCAGAT 

AAAGCCAGAT 

AAAGCCAGAT 

AAAGCCAGAT 

AAAGCCAGAT 

AAAGCCAGAT 

AAAGCCAGAT 

AAAGCCAGAT 

AAAGCCAGAT 

AAAGCCAGAT 
********** 



CTAGAGACGC 
CTAGAGACGC 
CTAGAGACGC 
CTAGAGACGC 
CTAGAGACGC 
CTAGAGACGC 
CTAGAGACGC 
CTAGAGACGC 
CTAGAGACGC 
CTAGAGACGC 
CTAGAGACGC 



msa71927 .2{ 

msa71927 . 

TO9a71927 . 
msa71927 
msa71927.2{ 

msa71927 . 

msa71927 . 

msa71927 . 

msa71927 . 
msa71927.2{l73 
msa71927.2{ 



173_18RS2l} 
2{173_2603} 
2{173_A909) 
.2{173_090) 
173_CJB110} 
2{173_C0H1} 
2(173 M781~ 
2(1733473 2 
2{173_H36B 
_JM9130013 
173_1169NT} 
Consensus 



901 

TTAACTTAtC 
TTAACTTAtC 
TTAACTTAtC 
TTAACTTAtC 
TTAACTTAtC 
TTAACTTAcC 
TTAACTTAcC 
TTAACTTAcC 
TTAACTTAtC 
TTAACTTAtC 
TTAACTTAtC 



TAAAGAAAAG 
TAAAGAAAAG 
TAAAGAAAAG 
TAAAGAAAAG 
TAAAGAAAAG 
TAAAGAAAAG 
TAAAGAAAAG 
TAAAGAAAAG 
TAAAGAAAAG 
TAAAGAAAAG 
TAAAGAAAAG 



CGATCAAACG 
CGATCAAACG 
CGATCAAACG 
CGATCAAACG 
CGATCAAACG 
CGATCAAACG 
CGATCAAACG 
CGATCAAACG 
CGATCAAACG 
CGATCAAACG 
CGATCAAACG 



CTAGTATCAG 
CTAGTATCAG 
CTAGTATCAG 
CTAGTATCAG 
CTAGTATCAG 
CTAGTATCAG 
CTAGTATCAG 
CTAGTATCAG 
CTAGTATCAG 
CTAGTATCAG 
CTAGTATCAG 



********_* ********** ********** ********** 



950 

TTAATGATTT 

TTAATGATTT 

TTAATGATTT 

TTAATGATTT 

TTAATGATTT 

TTAATGATTT 

TTAATGATTT 

TTAATGATTT 

TTAATGATTT 

TTAATGATTT 

TTAATGATTT 
********** 



msa71927.2{ 

msa71927 . 

msa71927 . 
msa71927 
msa7 1927 ,2{ 

tnsa71927 . 

msa71927 . 

msa71927 . 

msa71927 . 
msa71927.2{l73 
msa71927.2{ 



173_18RS21 
2{173_2603 
2{173_A909 ] 
.2{l73_090l 
173_CJB110 
2{l73_COHl 
2{173_M781 1 
2{173_M732; 
2{173_H36B" 
_JM9130013 
173_1169NT 
Consensus 



951 

AAAATCTTTA 
AAAATCTTTA 
AAAATCTTTA 
AAAATCTTTA 
AAAATCTTTA 
AAAATCTTTA 
AAAATCTTTA 
AAAATCTTTA 
AAAATCTTTA 
AAAATCTTTA 
AAAATCTTTA 



CCAATTGCTT 
CCAATTGCTT 
CCAATTGCTT 
CCAATTGCTT 
CCAATTGCTT 
CCAATTGCTT 
CCAATTGCTT 
CCAATTGCTT 
CCAATTGCTT 
CCAATTGCTT 
CCAATTGCTT 



ATACTTTGAA 
ATACTTTGAA 
ATACTTTGAA 
ATACTTTGAA 
ATACTTTGAA 
ATACTTTGAA 
ATACTTTGAA 
ATACTTTGAA 
ATACTTTGAA 
ATACTTTGAA 
ATACTTTGAA 



********** ********** ********** 



ATCACCAATG 

ATCACCAATG 

ATCACCAATG 

ATCACCAATG 

ATCACCAATG 

ATCACCAATG 

ATCACCAATG 

ATCACCAATG 

ATCACCAATG 

ATCACCAATG 

ATCACCAATG 
********** 



1000 

GGAACAGAAG 

GGAACAGAAG 

GGAACAGAAG 

GGAACAGAAG 

GGAACAGAAG 

GGAACAGAAG 

GGAACAGAAG 

GGAACAGAAG 

GGAACAGAAG 

GGAACAGAAG 

GGAACAGAAG 
********** 



msa71927.2{ 

msa71927 . 

msa71927 . 
msa71927 
msa71927 . 2{ 

msa71927 . 

nisa71927 . 

msa71927 . 

ins a7 1927 . 
msa71927.2{l73 
tnsa71927.2{ 



173_18RS21} 
2{173_2603* 
2{173_A909 
.2{173_090 
173_CJB110 
2{l73_COHl 
2{173_M781 
2{173__M732 
2{173_H36B} 
_JM9130013 ) 
173_1169NT} 
Consensus 



1001 

TTAGTCAAGA 
TTAGTCAAGA 
TTAGTCAAGA 
TTAGTCAAGA 
TTAGTCAAGA 
TTAGTCAAGA 
TTAGTCAAGA 
TTAGTCAAGA 
TTAGTCAAGA 
TTAGTCAAGA 
TTAGTCAAGA 



TGCTAAAAAc 
TGCTAAAAAc 
TGCTAAAAAc 
TGCTAAAAAC 
TGCTAAAAAC 
TGCTAAAAAt 
TGCTAAAAAt 
TGCTAAAAAt 
TGCTAAAAAt 
TGCTAAAAAt 
TGCTAAAAAc 



GCTATTATGG 
GCTATTATGG 
GCTATTATGG 
GCTATTATGG 
GCTATTATGG 
GCTATTATGG 
GCTATTATGG 
GCTATTATGG 
GCTATTATGG 
GCTATTATGG 
GCTATTATGG 



********** *********_ ********** 



ACAACGTCAc 

ACAACGTCAc 

ACAACGTCAc 

ACAACGTCAC 

ACAACGTCAc 

ACAACGTCAc 

ACAACGTCAC 

ACAACGTCAc 

ACAACGTCAt 

ACAACGTCAt 

ACAACGTCAc 
*********_ 



1050 

ATTCTTAAGA 

ATTCTTAAGA 

ATTCTTAAGA 

ATTCTTAAGA 

ATTCTTAAGA 

ATTCTTAAGA 

ATTCTTAAGA 

ATTCTTAAGA 

ATTCTTAAGA 

ATTCTTAAGA 

ATTCTTAAGA 
********** 



msa71927.2{l73_18RS2l) 
TOSa7 1927 . 2 { 173.2603 } 
msa71927 -2{173_A909; 
tnBa71927.2{l73__090 

tnsa71927 . 2 { 173_CJB110 



1051 

aAACAAGGAT 
aAACAAGGAT 
aAACAAGGAT 
aAACAAGGAT 
aAACAAGGAT 



TCAAAGTaAC 
TCAAAGTaAC 
TCAAAGTaAC 
TCAAAGTaAC 
TCAAAGTaAC 



AGAGATAGAc 
AGAGATAGAc 
AGAGATAGAc 
AGAGATAGAc 
AGAGATAGAc 



TTACCAATTG 
TTACCAATTG 
TTACCAATTG 
TTACCAATTG 
TTACCAATTG 



1100 
ATGGTAGAGC 
ATGGTAGAGC 
ATGGTAGAGC 
ATGGTAGAGC 
ATGGTAGAGC 



822 



WO 2004/018646 



PCT/US2003/026827 



Table 48: Comparative Sequences relating to SAG1474 



msa71927 . 2{173_C0H1} 
msa71927 . 2 { 173_M78l) 
msa71927 . 2 {173_M732 } 
msa71927 . 2 {173_H36B} 
msa71927 . 2 {l73_JM9130013 } 
msa71927.2{l73_1169NT} 

Consensus 



aAACAAGGAT 
gAACAAGGAT 
aAACAAGGAT 
aAACAAGGAT 
aAACAAGGAT 
aAACAAGGAT 
_********* 



TCAAAGTgAC 

TCAAAGTgAC 

TCAAAGTgAC 

TCAAAGTgAC 

TCAAAGTgAC 

TCAAAGTaAC 
*******_** 



AGAGATAGAt 
AGAGATAGAt 
AGAGATAGAt 
AGAGATAGAc 
AGAGATAGAc 
AGAGATAGAC 



TTACCAATTG 
TTACCAATTG 
TTACCAATTG 
TTACCAATTG 
TTACCAATTG 
TTACCAATTG 



*********_ ********** 



ATGGTAGAGC 

ATGGTAGAGC 

ATGGTAGAGC 

ATGGTAGAGC 

ATGGTAGAGC 

ATGGTAGAGC 
********** 



msa71927.2{ 

msa71927 . 

msa71927 . 
msa71927 
msa71927 .2{ 

msa71927 . 

msa71927 . 

msa71927 . 

msa71927 . 
msa71927.2{l73 
msa71927.2{ 



173_18RS2l} 
2{173_2603} 
2{173_A909} 
•2{l73_090) 
173_CJB110} 
2 {l73_COHl} 
2{l73_M781^ 
2{173_M732; 
2{173_H36b' 
_JM9130013 
173_lie9NT' 
Consensus 



1101 

ATTAATGCGT 
ATTAATGCGT 
ATTAATGCGT 
ATTAATGCGT 
ATTAATGCGT 
ATTAATGCGT 
ATTAATGCGT 
ATTAATGCGT 
ATTAATGCGT 
ATTAATGCGT 
ATTAATGCGT 



GATTATTCAA 
GATTATTCAA 
GATTATTCAA 
GATTATTCAA 
GATTATTCAA 
GATTATTCAA 
GATTATTCAA 
GATTATTCAA 
GATTATTCAA 
GATTATTCAA 
GATTATTCAA 



CCTTGGCTAT 
CCTTGGCTAT 
CCTTGGCTAT 
CCTTGGCTAT 
CCTTGGCTAT 
CCTTGGCTAT 
CCTTGGCTAT 
CCTTGGCTAT 
CCTTGGCTAT 
CCTTGGCTAT 
CCTTGGCTAT 



TGGcATGGGA 
TGGcATGGGA 
TGGcATGGGA 
TGGcATGGGA 
TGGcATGGGA 
TGGcATGGGA 
TGGcATGGGA 
TGGcATGGGA 
TGGtATGGGA 
TGGtATGGGA 
TGGcATGGGA 



1150 
GGAGCTTTTT 
GGAGCTTTTT 
GGAGCTTTTT 
GGAGCTTTTT 
GGAGCTTTTT 
GGAGCTTTTT 
GGAGCTTTTT 
GGAGCTTTTT 
GGAGCTTTTT 
GGAGCTTTTT 
GGAGCTTTTT 



********** ********** ********** ***-****** ********** 



msa71927.2{ 

msa71927 . 

rasa71927 . 
msa71927 
msa71927.2{ 

msa71927 . 

msa71927 . 

msa71927 . 

msa71927 . 
msa71927.2{!73 
msa71927 .2{ 



173_18RS2lJ 
2{l73_2603) 
2{173_A909} 
.2{173_090} 
173_CJB110} 
2{l73_COHl} 
2{l73JM78l} 
2{173_M732} 
2{173_H36B| 
_JM9130013' 
173_1169NT; 
Consensus 



1151 

CAACAATTGA 
CAACAATTGA 
CAACAATTGA 
CAACAATTGA 
CAACAATTGA 
CAACAATTGA 
CAACAATTGA 
CAACAATTGA 
CAACAATTGA 
CAACAATTGA 

CAACAATTGA 
********** 



AAAAGACTTA 

AAAAGACTTA 

AAAAGACTTA 

AAAAGACTTA 

AAAAGACTTA 

AAAAGACTTA 

AAAAGACTTA 

AAAAGACTTA 

AAAAGACTTA 

AAAAGACTTA 

AAAAGACTTA 
********** 



AAAAAACATG 
AAAAAACATG 
AAAAAACATG 
AAAAAACATG 
AAAAAACATG 
AAAAAACATG 
AAAAAACATG 
AAAAAACATG 
AAAAAACATG 
AAAAAACATG 
AAAAAACATG 



GTTTTACTAA 
GTTTTACTAA 
GTTTTACTAA 
GTTTTACTAA 
GTTTTACTAA 
GTTTTACTAA 
GTTTTACTAA 
GTTTTACTAA 
GTTTTACTAA 
GTTTTACTAA 
GTTTTACTAA 



********** ********** 



1200 
AGAAGACGTT 
AGAAGACGTT 
AGAAGACGTT 
AGAAGACGTT 
AGAAGACGTT 
AGAAGACGTT 
AGAAGACGTT 
AGAAGACGTT 
AGAAGACGTT 
AGAAGACGTT 

AGAAGACGTT 
********** 



msa7l927.2{ 

msa71927 . 

msa71927 . 
msa71927 
msa71927.2{ 

msa71927 . 

msa71927 . 

msa71927 . 

msa71927 . 
msa71927.2{l73 
msa71927.2{ 



173_18RS21 
2(173_2603 
2(173_A909 
.2{173_090 
173_CJB110} 
2{173_C0H1 
2{173_M781 
2{173_M732 
2{173_H36B 
__JM9130013J 
173_1169NT} 
Consensus 



1201 

GATCCtATTA 
GATCCtATTA 
GATCCtATTA 
GATCCtATTA 
GATCCtATTA 
GATCCcATTA 
GATCCcATTA 
GATCCcATTA 
GATCCcATTA 
GATCCcATTA 
GATCCtATTA 
***** _ **** 



CTTGGGcAGT 

CTTGGGcAGT 

CTTGGGcAGT 

CTTGGGcAGT 

CTTGGGcAGT 

CTTGGGcAGT 

CTTGGGcAGT 

CTTGGGcAGT 

CTTGGGCAGT 

CTTGGGgAGT 

CTTGGGcAGT 
******„*** 



TCATGTTATT 

TCATGTTATT 

TCATGTTATT 

TCATGTTATT 

TCATGTTATT 

TCATGTTATT 

TCATGTTATT 

TCATGTTATT 

TCATGTTATT 

TCATGTTATT 

TCATGTTATT 
********** 



TATCAAAATT 
TATCAAAATT 
TATCAAAATT 
TATCAAAATT 
TATCAAAATT 
TATCAAAATT 
TATCAAAATT 
TATCAAAATT 
TATCAAAATT 
TATCAAAATT 
TATCAAAATT 



1250 
CAGATAAGGC 
CAGATAAGGC 
CAGATAAGGC 
CAGATAAGGC 
CAGATAAGGC 
CAGATAAGGC 
CAGATAAGGC 
CAGATAAGGC 
CAGATAAGGC 
CAGATAAGGC 
CAGATAAGGC 



********** ********** 



rnsa71927.2{ 

msa71927 . 

msa71927. 
msa71927 
msa71927 . 2{ 

msa71927 . 

msa71927 . 

ms a7 192 7 . 

maa71927 . 
msa71927.2{l73 
msa71927.2{ 



173_18RS2l} 
2{l73_2603} 
2 {173_A909' 
.2{173_090 
173_CJB110 
2{173_C0H1) 
2{173_M781" 
2f 173JM732 
2{173_H36B 
_JM9130013) 
173_1169NT} 
Consensus 



1251 

TGAACTTAAG 

TGAACTTAAG 

TGAACTTAAG 

TGAACTTAAG 

TGAACTTAAG 

TGAACTTAAG 

TGAACTTAAG 

TGAACTTAAG 

TGAACTTAAG 

TGAACTTAAG 

TGAACTTAAG 
***** ***** 



AAATCTATTa 

AAATCTATTa 

AAATCTATTa 

AAATCTATTa 

AAATCTATTa 

AAATCTATTg 

AAATCTATTg 

AAATCTATTg 

AAATCTATTa 

AAATCTATTa 

AAATCTATTa 
*********_ 



TGGAAGCCCA 

TGGAAGCCCA 

TGGAAGCCCA 

TGGAAGCCCA 

TGGAAGCCCA 

TGGAAGCCCA 

TGGAAGCCCA 

TGGAAGCCCA 

TGGAAGCCCA 

TGGAAGCCCA 

TGGAAGCCCA 
********** 



AAAACATATG 
AAAACATATG 
AAAACATATG 
AAAACATATG 
AAAACATATG 
AAAACATATG 
AAAACATATG 
AAAACATATG 
AAAACATATG 
AAAACATATG 
AAAACATATG 



1300 
GATGATTATC 
GATGATTATC 
GATGATTATC 
GATGATTATC. 
GATGATTATC 
GATGATTATC 
GATGATTATC 
GATGATTATC 
GATGATTATC 
GATGATTATC 
GATGATTATC 



********** ********** 



msa71927.2{ 

msa71927 . 

msa71927 . 
msa71927 
msa71927.2{ 

msa71927. 

msa7192 7 . 

msa71927 . 

msa71927 . 
msa71927.2{l73 
msa71927.2{ 



173_18RS21 
2{173_2603 
2{173_A909 > 
.2{l73_090 
173_CJB110 
2{l73_COHl 
2{173_M781 
2{173_M732 
2{173_H3 6B 
__JM9130013 
173_1169NT 
Consensus 



1301 

GTAAGGCAAT 
GTAAGGCAAT 
GTAAGGCAAT 
GTAAGGCAAT 
GTAAGGCAAT 
GTAAGGCAAT 
GTAAGGCAAT 
GTAAGGCAAT 
GTAAGGCAAT 
GTAAGGCAAT 
GTAAGGCAAT 



GGAGAAGCTT 
GGAGAAGCTT 
GGAGAAGCTT 
GGAGAAGCTT 
GGAGAAGCTT 
GGAGAAGCTT 
GGAGAAGCTT 
GGAGAAGCTT 
GGAGAAGCTT 
GGAGAAGCTT 
GGAGAAGCTT 



CACAAGCAAT 
CACAAGCAAT 
CACAAGCAAT 
CACAAGCAAT 
CACAAGCAAT 
CACAAGCAAT 
CACAAGCAAT 
CACAAGCAAT 
CACAAGCAAT 
CACAAGCAAT 
CACAAGCAAT 



********** ********** ********** 



TTCCTATTTT 

TTCCTATTTT 

TTCCTATTTT 

TTCCTATTTT 

TTCCTATTTT 

TTCCTATTTT 

TTCCTATTTT 

TTCCTATTTT 

TTCCTATTTT 

TTCCTATTTT 

TTCCTATTTT 
********** 



1350 

CTTATCGCCA 

CTTATCGCCA 

CTTATCGCCA 

CTTATCGCCA 

CTTATCGCCA 

CTTATCGCCA 

CTTATCGCCA 

CTTATCGCCA 

CTTATCGCCA 

CTTATCGCCA 

CTTATCGCCA 
* ********* 



msa71927 . 2 { 173_18RS21 } 
msa7l927 . 2 { 173_2603 } 
Kisa71927.2{l73_A909) 
msa71927 .2{173_090} 



1351 1400 
ACGACCGCAA GTTTAGCCCC TCTAAATACA GATCCATATG TaACAGAGgA 
ACGACCGCAA GTTTAGCCCC TCTAAATACA GATCCATATG TaACAGAGgA 
ACGACCGCAA GTTTAGCCCC TCTAAATACA GATCCATATG ' TaACAGAGgA 
ACGACCGCAA GTTTAGCCCC TCTAAATACA GATCCATATG TaACAGAGgA 



823 



WO 2004/018646 



PCT/US2003/026827 



Table 48: Comparative Sequences relating to SAG1474 



msa71927 . 2 { 173_CJB110 ) 
msa71927 .2{l73_COHl) 
msa71927 .2{l73_M78lj 
msa71927 .2{l73_M732} 
msa71927 .2{l73_H36B} 
msa71927 . 2 { 173_JM9130013 } 
tnsa71927 .2{l73__1169NT} 

Consensus 



ACGACCGCAA 
ACGACCGCAA 
ACGACCGCAA 
ACGACCGCAA 
ACGACCGCAA 
ACGACCGCAA 
ACGACCGCAA 



GTTTAGCCCC 
GTTTAGCCCC 
GTTTAGCCCC 
GTTTAGCCCC 
GTTTAGCCCC 
GTTTAGCCCC 
GTTTAGCCCC 



TCTAAATACA 
TCTAAATACA 
TCTAAATACA 
TCTAAATACA 
TCTAAATACA 
TCTAAATACA 
TCTAAATACA 



GATCCATATG 
GATCCATATG 
GATCCATATG 
GATCCATATG 
GATCCATATG 
GATCCATATG 
GATCCATATG 



TaACAGAGgA 
TaACAGAGaA 
TaACAGAGaA 
T t ACAGAGaA 
TaACAGAGgA 
TaACAGAGgA 
TaACAGAGgA 



********** ********** ********** ********** *_******_* 



msa71927 . 2{ 

msa71927 . 

msa71927 . 
msa71927 
msa71927.2{ 

msa71927 . 

msa71927 . 

msa71927 . 

msa71927 . 
msa71927.2{l73 
msa71927.2{ 



173_JL8RS2l} 
2{l73_2603 
2{173_A909 
,2{173_090 
173_CJB110 
2(l73_COHl ! 
2{173_M781 
2{173_M732 
2{l73_H36B 
_JM9130013 ! 
173_1169NT) 
Consensus 



1401 

AGATAAAAGA 
AGATAAAAGA 
AGATAAAAGA 
AGATAAAAGA 
AGATAAAAGA 
AGATAAAAGA 
AGATAAAAGA 
AGATAAAAGA 
AGATAAAAGA 
AGATAAAAGA 
AGATAAAAGA 



GCGATTTATA 
GCGATTTATA 
GCGATTTATA 
GCGATTTATA 
GCGATTTATA 
GCGATTTATA 
GCGATTTATA 
GCGATTTATA 
GCGATTTATA 
GCGATTTATA 
GCGATTTATA 



ATATGGAAAA 
ATATGGAAAA 
ATATGGAAAA 
ATATGGAAAA 
ATATGGAAAA 
ATATGGAAAA 
ATATGGAAAA 
ATATGGAAAA 
ATATGGAAAA 
ATATGGAAAA 
ATATGGAAAA 



CTTGAGCCAA 
CTTGAGCCAA 
CTTGAGCCAA 
CTTGAGCCAA 
CTTGAGCCAA 
CTTGAGCCAA 
CTTGAGCCAA 
CTTGAGCCAA 
CTTGAGCCAA 
CTTGAGCCAA 
CTTGAGCCAA 



1450 
GAAGAAAGAA 
GAAGAAAGAA 
GAAGAAAGAA 
GAAGAAAGAA 
GAAGAAAGAA 
GAAGAAAGAA 
GAAGAAAGAA 
GAAGAAAGAA 
GAAGAAAGAA 
GAAGAAAGAA 
GAAGAAAGAA 



********** ********** ********** ********** ********** 



msa71927.2{ 

msa71927. 

msa71927 . 
msa71927 
msa71927 .2{ 

msa71927 . 

msa71927 . 

msa71927 . 

msa71927 . 
msa71927.2{l73 
msa7l927.2{ 



173_18RS2l} 
2{173_2603} 
2{173_A909} 
.2{173_090) 
173_CJB110} 
2{l73__COHl} 
2{173_M781} 
2{173_M732 
2{173__H36B' 
_JM9130013' 
173_1169NT} 
Consensus 



1451 

TTGCTCTCTT 
TTGCTCTCTT 
TTGCTCTCTT 
TTGCTCTCTT 
TTGCTCTCTT 
TTGCTCTCTT 
TTGCTCTCTT 
TTGCTCTCTT 
TTGCTCTCTT 
TTGCTCTCTT 
TTGCTCTCTT 



TAATCGCCAG 
TAATCGCCAG 
TAATCGCCAG 
TAATCGCCAG 
TAATCGCCAG 
TAATCGCCAG 
TAATCGCCAG 
TAATCGCCAG 
TAATCGCCAG 
TAATCGCCAG 
TAATCGCCAG 



TGGGAGCCTA 
TGGGAGCCTA 
TGGGAGCCTA 
TGGGAGCCTA 
TGGGAGCCTA 
TGGGAGCCTA 
TGGGAGCCTA 
TGGGAGCCTA 
TGGGAGCCTA 
TGGGAGCCTA 
TGGGAGCCTA 



TGTTGCGTAG 
TGTTGCGTAG 
TGTTGCGTAG 
TGTTGCGTAG 
TGTTGCGTAG 
TGTTGCGTAG 
TGTTGCGTAG 
TGTTGCGTAG 
TGTTGCGTAG 
TGTTGCGTAG 
TGTTGCGTAG 



1500 
AACACCTTTT 
AACACCTTTT 
AACACCTTTT 
AACACCTTTT 
AACACCTTTT 
AACACCTTTT 
AACACCTTTT 
AACACCTTTT 
AACACCTTTT 
AACACCTTTT 
AACACCTTTT 



********** ********** ********** ********** ********** 



msa71927.2{ 

msa71927. 

msa71927. 
msa71927 
msa71927.2{ 

msa71927. 

msa71927 . 

msa71927. 

msa71927 . 
msa71927.2{l73 
msa71927 .2{ 



173_18RS21 
2(l73_2603 
2{173_A909 
,2{173_090} 
173_C«JB110} 
2 {173 COHl} 
2{173_M781 
2{173_M732 
2{173__H36B 
_JM9130013 
173_1169NT 
Consensus 



1501 

ACACaAATTG 
ACACaAATTG 
ACACaAATTG 
ACACaAATTG 
ACACaAATTG 
ACACcAATTG 
ACACcAATTG 
ACACcAATTG 
ACACaAATTG 
ACACaAATTG 
ACACaAATTG 



CTAATATGAC 
CTAATATGAC 
CTAATATGAC 
CTAATATGAC 
CTAATATGAC 
CTAATATGAC 
CTAATATGAC 
CTAATATGAC 
CTAATATGAC 
CTAATATGAC 
CTAATATGAC 



****„***** ********** 



AGGACTCCCA 

AGGACTCCCA 

AGGACTCCCA 

AGGACTCCCA 

AGGACTCCCA 

AGGACTCCCA 

AGGACTCCCA 

AGGACTCCCA 

AGGACTCCCA 

AGGACTCCCA 

AGGACTCCCA 
********** 



GCTATCAGTA 

GCTATCAGTA 

GCTATCAGTA 

GCTATCAGTA 

GCTATCAGTA 

GCTATCAGTA 

GCTATCAGTA 

GCTATCAGTA 

GCTATCAGTA 

GCTATCAGTA 

GCTATCAGTA 
********** 



1550 

TCCCGACTTA 

TCCCGACTTA 

TCCCGACTTA 

TCCCGACTTA 

TCCCGACTTA 

TCCCGACTTA 

TCCCGACTTA 

TCCCGACTTA 

TCCCGACTTA 

TCCCGACTTA 

TCCCGACTTA 
********** 



tnsa71927 ,2{ 

msa71927. 

msa71927. 
msa71927 
tnsa71927.2{ 

msa71927 . 

msa71927 . 

msa71927 . 

msa71927 . 
msa71927.2{l73 
msa71927.2{ 



173_18RS2l} 
2 { 173^2603} 
2{173_A909} 
.2{l73_090} 
173_CJB110} 
2{l73_COHl 
2(l73_M781 
2 jl73_M732 
2{173_H3 6B} 
_JM9130013} 
173__1169NT) 
Consensus 



1551 

CTTATCTGAG 
CTTATCTGAG 
CTTATCTGAG 
CTTATCTGAG 
CTTATCTGAG 
CTTATCTGAG 
CTTATCTGAG 
CTTATCTGAG 
CTTATCTGAG 
CTTATCTGAG 
CTTATCTGAG 



TCTGGTTTAC 
TCTGGTTTAC 
TCTGGTTTAC 
TCTGGTTTAC 
TCTGGTTTAC 
TCTGGTTTAC 
TCTGGTTTAC 
TCTGGTTTAC 
TCTGGTTTAC 
TCTGGTTTAC 
TCTGGTTTAC 



CCATAGGGAC 
CCATAGGGAC 
CCATAGGGAC 
CCATAGGGAC 
CCATAGGGAC 
CCATAGGGAC 
CCATAGGGAC 
CCATAGGGAC 
CCATAGGGAC 
CCATAGGGAC 
CCATAGGGAC 



GATGTTAATG 
GATGTTAATG 
GATGTTAATG 
GATGTTAATG 
GATGTTAATG 
GATGTTAATG 
GATGTTAATG 
GATGTTAATG 
GATGTTAATG 
GATGTTAATG 
GATGTTAATG 



1600 
GCAGGTGCAA 
GCAGGTGCAA 
GCAGGTGCAA 
GCAGGTGCAA 
GCAGGTGCAA 
GCAGGTGCAA 
GCAGGTGCAA 
GCAGGTGCAA 
GCAGGTGCAA 
GCAGGTGCAA 
GCAGGTGCAA 



********** ********** ********** ********** ********** 



msa71927.2{ 
msa71927 , 
msa71927. 
msa71927 

msa71927.2{ 
msa71927 . 
msa71927 . 
msa71927 « 
msa71927 . 
msa71927.2{l73 

msa71927.2{ 



173_18RS2l} 
2{173_2603} 
2{173_A909} 
.2{l73_090 
173_CJB110 
2{l73_COHl; 
2{173_M781 
2(173_M732 
2 {173 H36B 
_JM9130013 
173__1169NT/ 
Consensus 



1601 

ACTATGATAT 
ACTATGATAT 
ACTATGATAT 
ACTATGATAT 
ACTATGATAT 
ACTATGATAT 
ACTATGATAT 
ACTATGATAT 
ACTATGATAT 
ACTATGATAT 
ACTATGATAT 



GGTATTAATT 
GGTATTAATT 
GGTATTAATT 
GGTATTAATT 
GGTATTAATT 
GGTATTAATT 
GGTATTAATT 
GGTATTAATT 
GGTATTAATT 
GGTATTAATT 
GGTATTAATT 



AAATTTGCAA 
AAATTTGCAA 
AAATTTGCAA 
AAATTTGCAA 
AAATTTGCAA 
AAATTTGCAA 
AAATTTGCAA 
AAATTTGCAA 
AAATTTGCAA 
AAATTTGCAA 
AAATTTGCAA 



********** ********** ********** 



CTTTCTTTGA 

CTTTCTTTGA 

CTTTCTTTGA 

CTTTCTTTGA 

CTTTCTTTGA 

CTTTCTTTGA 

CTTTCTTTGA 

CTTTCTTTGA 

CTTTCTTTGA 

CTTTCTTTGA 

CTTTCTTTGA 
********** 



1650 

AAAAcATCAT 

AAAAcATCAT 

AAAACATCAT 

AAAAcATCAT 

AAAAcATCAT 

AAAACATCAT 

AAAAcATCAT 

AAAAcATCAT 

AAAAtATCAT 

AAAAtATCAT 

AAAAcATCAT 
****_***** 



msa71927 . 2 { 173_18RS21 } 
msa71927 , 2 (173^2603 } 
rasa71927 . 2 { 173_A909 } 



1651 1700 
GGTTTTAATG TTAAATGGCA AAGAATAATA GATAAAGAAG TGAAACCATC 
GGT TTTA ATG TTAAATGGCA AAGAATAATA GATAAAGAAG TGAAACCATC 
GGTTTTAATG TTAAATGGCA AAGAATAATA GATAAAGAAG TGAAACCATC 



824 



WO 2004/018646 



PCT/US2003/026827 



Table 48: Comparative Sequences relating to SAG1474 



msa71927.2{l73_090 
msa71927 .2{l73_CJB110 
msa71927 . 2 { 173_C0H1 
msa71927 . 2 {173_M781 
msa71927.2(l73_M732 
msa7 1927.2 (173_H3 6B 
msa71927-2{l73_JM913 0013 
msa7 1927 , 2 { 173_1169NT 

Consensus 



GGTTTTAATG 
GGTTTTAATG 
GGTTTTAATG 
GGTTTTAATG 
GGTTTTAATG 
GGTTTTAATG 
GGTTTTAATG 
GGTTTTAATG 



TTAAATGGCA 
TTAAATGGCA 
TTAAATGGCA 
TTAAATGGCA 
TTAAATGGCA 
TTAAATGGCA 
TTAAATGGCA 
TTAAATGGCA 



AAGAATAATA 
AAGAATAATA 
AAGAATAATA 
AAGAATAATA 
AAGAATAATA 
AAGAATAATA 
AAGAATAATA 
AAGAATAATA 



GATAAAGAAG 
GATAAAGAAG 
GATAAAGAAG 
GATAAAGAAG 
GATAAAGAAG 
GATAAAGAAG 
GATAAAGAAG 
GATAAAGAAG 



TGAAACCATC 
TGAAACCATC 
TGAAACCATC 
TGAAACCATC 
TGAAACCATC 
TGAAACCATC 
TGAAACCATC 
TGAAACCATC 



********** ********** ********** ********** ********** 



msa71927.2{ 

msa71927. 

msa71927. 
msa71927 
msa71927 ,2{ 

msa71927. 

msa71927 . 

msa71927 . 

msa71927 . 
msa71927.2{l73 
msa71927 »2{ 



173_18RS2l' 
2{173_2603 
2{173_A909 
,2{173_090 
173_CJB110] 
2{173_C0H1} 
2{173_M781} 
2{173_M732 
2{l73_H36B 
_JM913 0013 
173_1169NT} 
Consensus 



1701 

TaCTGgCCTA 
TaCTGgCCTA 
TaCTGgCCTA 
TaCTGgCCTA 
TaCTGgCCTA 
TgCTGaCCTA 
TgCTGaCCTA 
TgCTGaCCTA 
TaCTGgCCTA 
TaCTGgCCTA 
TaCTGgCCTA 



ATACAGCCTA 
ATACAGCCTA 
ATACAGCCTA 
ATACAGCCTA 
ATACAGCCTA 
ATACAGCCTA 
ATACAGCCTA 
ATACAGCCTA 
ATACAGCCTA 
ATACAGCCTA 
ATACAGCCTA 



CTAACTCCCT 
CTAACTCCCT 
CTAACTCCCT 
CTAACTCCCT 
CTAACTCCCT 
CTAACTCCCT 
CTAACTCCCT 
CTAACTCCCT 
CTAACTCCCT 
CTAACTCCCT 
CTAACTCCCT 



*_***_**** ********** ********** 



CTTTAAAGCT 

CTTTAAAGCT 

CTTTAAAGCT 

CTTTAAAGCT 

CTTTAAAGCT 

CTTTAAAGCT 

CTTTAAAGCT 

CTTTAAAGCT 

CTTTAAAGCT 

CTTTAAAGCT 

CTTTAAAGCT 
********** 



1750 

CATTCATCAT 

CATTCATCAT 

CATTCATCAT 

CATTCATCAT 

CATTCATCAT 

CATTCATCAT 

CATTCATCAT 

CATTCATCAT 

CATTCATCAT 

CATTCATCAT 

CATTCATCAT 
********** 



msa71927 »2{ 

msa71927. 

msa71927 . 
msa71927 
msa71927.2{ 

msa71927. 

msa71927. 

msa71927. 

msa71927. 
msa71927-2{l73 
msa71927.2{ 



173_18RS2l} 
2{173_2603) 
2{l73_A909* 
,2{173_090 
173_CJB110 
2{l73_COHl} 
2{173_M781} 
2{173_M732} 
2{173_H36B 
_JM9130013 
173_1169NT 
Consensus 



1751 

TAGTAAATTT 

tagtaaattt" 

TAGTAAATTT 

TAGTAAATTT 

TAGTAAATTT 

TAGTAAATTT 

TAGTAAATTT 

TAGTAAATTT 

TAGTAAATTT 

TAGTAAATTT 

TAGTAAATTT 
********** 



AGAAGAAAAT 

AGAAGAAAAT 

AGAAGAAAAT 

AGAAGAAAAT 

AGAAGAAAAT 

AGAAGAAAAT 

AGAAGAAAAT 

AGAAGAAAAT 

AGAAGAAAAT 

AGAAGAAAAT 

AGAAGAAAAT 
********** 



TCACAAGTTA 

TCACAAGTTA 

TCACAAGTTA 

TCACAAGTTA 

TCACAAGTTA 

TCACAAGTTA 

TCACAAGTTA 

TCACAAGTTA 

TCACAAGTTA 

TCACAAGTTA 

TCACAAGTTA 
********** 



CTCAAGTATC 

CTCAAGTATC 

CTCAAGTATC 

CTCAAGTATC 

CTCAAGTATC 

CTCAAGTATC 

CTCAAGTATC 

CTCAAGTATC 

CTCAAGTATC 

CTCAAGTATC 

CTCAAGTATC 
********** 



1800 

TAT CT CT AAA 

TATCTCTAAA 

TATCTCTAAA 

TATCTCTAAA 

TATCTCTAAA 

TATCTCTAAA 

TATCTCTAAA 

TATCTCTAAA 

TATCTCTAAA 

TATCTCTAAA 

TATCTCTAAA 
********** 



msa71927.2{ 

msa71927 . 

msa71927. 
msa71927 
msa71927 ,2{ 

maa71927. 

msa71927. 

msa71927. 

msa71927 . 
msa71927 .2(173 
msa71927 .2{ 



173_18RS2l) 
2f 173_2603} 
2{173_A909} 
.2(173^090' 
173_CJB110 
2{l73_COHl' 
2{173_M781} 
2 \ 173_M732} 
2{173_H36Bl 
_JM9130013 
173_1169NT] 
Consensus 



1801 

AAATGGATGA 
AAATGGATGA 
AAATGGATGA 
AAATGGATGA 
AAATGGATGA 
AAATGGATGA 
AAATGGATGA 
AAATGGATGA 
AAATGGATGA 
AAATGGATGA 
AAATGGATGA 



AATCGTCTGT 
AATCGTCTGT 
AATCGTCTGT 
AATCGTCTGT 
AATCGTCTGT 
AATCGTCTGT 
AATCGTCTGT 
AATCGTCTGT 
AATCGTCTGT 
AATCGTCTGT 
AATCGTCTGT 



********** ********** 



TAAAAATAAA 

TAAAAATAAA 

TAAAAATAAA 

TAAAAATAAA 

TAAAAATAAA 

TAAAAATAAA 

TAAAAATAAA 

TAAAAATAAA 

TAAAAATAAA 

TAAAAATAAA 

TAAAAATAAA 
********** 



ccatccgtaa 
ccatccgtaa 
ccatccgtaa 
ccatccgtaa 
ccatccgtaa 
ccatccgtaa 
ccatccgtaa 
ccatccgtaa 



1850 
tggcatatca 
tggcatatca 
tggcatatca 
tggcatatca 
tggcatatca 
tggcatatca 
tggcatatca 
tggcatatca 



ccatccgtaa 
ccatccgtaa 



tggcatat — 
tggcatatca 



1851 

msa71927 . 2 { 173_18RS21 } aaaagca 
msa71927.2{l73_2603} aaaagca 
msa71927 . 2{173_A909} aaaagca 
msa71927.2{l73_090} aaaagca 
msa71927.2{l73_CJB110} aaaagca 
msa71927 . 2{l73_COHl} aaaagca 
msa71927 .2{173_M78l} aaaagca 
msa71927 .2{173_M732} aaaagca 

msa71927.2{l73_H36B} 

msa71927.2{l73_JM9130013) 

msa71927.2{l73_1169NT} aaaagca 

Consensus 

SEQ ID NO: 4814 

STRAIN 2603 frame: 1 

NSTETSASWPTTin'IVQTNDSNPTAKFVSESGQSVIG^VKPDNSAALTTVDTPHHI SAP 
DALKTTQSSPVVESTSTKLTEETYKQKDGQDLANMWSGQVTSEELVN^ 
LNAVITTRRQEAIEEARKLKDTNQPFLGVPLLVKGLGHS I KGGETNNGLI YADGKI STFD 
SSyVKKyKDLGPIII^TNFPEYGWRNITDSKLyGLTHNPWDLAHNAGGSSGGSAAAIAS 
GMTP I ASGSDAGGS IRI PSSWTGLVGLKPTRGLVSNEKPDS YSTAVHFPLTKSSRDAETL 
LTYLKKSDQTLVSVNDLKSLPIAYTLKSPMGTEVSQ 

LPIDGRAIMRDYSTLAIGMGGAFSTIEKDLKKHGFTKEDVDPITWAVHVIYQNSDKAELK 
KS IMEAQKHMDDYRKAMEKLHKQFPI FLSPTTASLAPIiNTDPYVTEEDKRAI YNMENI/SQ 
EERI AliFNRQWEPMLRRTPFTQI ANMTGLPAI S I PTYLSESGLP IGTMLMAGANYDMVLI 
KFATFFEKHHGFNVKWQRI I DKEVKPSTGL I QPTNSLFKAHSSLVNLEENSQVTQVS I SK 
KWMKSSVKNKPSVMAYQKA 



SEQ ID NO J 4815 
STRAIN _090 frame: 1 

NSTETSASWPTTNT I VQTNDSNPTAKFVSESGQS V I GQVKPDNSAALTTVDTPHH I SAP 
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DALKTTQSSPVVESTSTKLTEETYKQKIX3K^ IAKENPS 
LNAVI TTRRQEA I EEARKLKDTNQ P FLGVPLLVKGLGHS I KGGETNNGL I YADGKI STFD 
SS YVKKYKDLGF I ILGQTNFPEYGWRN ITDSKLYGLTHNPWDIAHNAGGS SGGSAAAI AS 
GMTP I ASGSDAGGS I RI PSSWTGLVGLKPTRGLVSNEKPDSYSTAVHFPLTKSSRDAETL 
LTYLKKSDQTLVSVNDLKSLP I AYTLKS PMGTEVS QDAKNAI MDNVTFLRKQGFKVTE I D 
LPIDGRAIiMRDYSTLAIGMGGAFSTIEKDLKKHGFTKEDVDPITWAVHVIYQNSDKAELK 
KS IMEAQKHMDDYRKAMEKLHKQFPI FLS PTTASLAPLNTDPYVTEEDKRAI YNMENLSQ 
EERI ALFNRQWEPMLRRTPFTQIANMTGLPAI S I PTYLSESGLPI GTMLMAGANYDMVL I 
KFATFFEKHHGFNVKWQRIIDKEVKPSTGLIQPTNSLFKAHSSLVNIiEENSQVTQVSISK 

KWMKS S VKNKPS VMAYQKA 

r 

SEQ ID NO: 4816 

STRAIN A909 frame: 2 

TTNT I VQTNDSNPTAKFVSESGQS V IGQVKPDNSAALTTVDTPHHI SAPDALKTTQSSPV 
VESTSTKLTEETYKQKDGQDLANMVRSGQVTSEELVNMAYD 1 1 AKENPS LNAVI TTRRQE 
AI EEARKLKDTNQP FLGVPLLVKGLGH S I KGGETNNGL I YADGKI STFDSSYVKKYKDLG 
FIILGGTNFPEYGWRNITDSKLYGLTHNPWDLAHNAGGSSGGSAAAIASGMTPIASGSdA 
GGSIRIPSSWTGLVGLICPTRGLVSNEKPDSYSTAVHFPLTKSSRDAETLLTYLKKSDQTL 
VSVNDLKSLPIAYTLKSPMGTEVSQDAKNAI MDNVTFLRKQGFKVTE IDLPI DGRALMRD 
YSTLAIGMGGAFSTI EKDLKKHGFTKEDVDP ITWAVHVI YQNSDKAELKKS IMEAQKHMD 
DYRKAMEKLHKQFPIFLSPTTASLAPLNTDPYVTEEDKRAIYNMENLSQEERIALFNRQW 
EPMLRRTPFTQIANMTGLPAIS I PTYLSESGLPI GTMLMAGANYDMVL I KFATFFEKHHG 
FNVKWQRI IDKEVKPSTGLI QPTNSLFKAHSSLVNLEENSQVTQVS I SKKWMKSSVKNKP 
SVMAYQKA 

SEQ ID NO: 4817 

STRAIN COH1 frame : 1 

NSTETSAS VAPTTNT I VQTNDSNPTAKFASESGQS VI GQVKPANSAALTTVDTPHISAPD 
ALKTTQSSPVVESPSTKLTEETYKQKDGQDLANMVRSGQVTSEELVNMAYDI IAKENPSL 
NAV I TTRRQEA I EEARKLKDTNQP FLGVPLLVKGLGHS I KGGETNNGL I YADGKI STFDS 
S YVKKYKDLGF I ILGQTNFPEYGWRN I TDSKLYGPTHNPWNLAHNAGGS SGGSAAAI ASG 
MTPI ASGSDAGGS IRI PS SWTGLVGLKPTRGLVSNEKPDS YSTAVHFPLTKS SRDAETLL 
TYLKKSDQTLVS VNDLKSLP I AYTLKS PMGTEVSQDAKNAI MDNVTFLRKQGFKVTE I DL 
PIDGRALMRDYSTLAIGMGGAFSTIEKDLKKHGFTKEDVDPITWAVHVIYQNSDKAELKK 
SIVEAQKHMDDYRKAMEKLHKQFPIFLSPTTASLAPIiNTDPYV^ 

ERIALFNRQWEPMLRRTPFTPIANMTGLPAISIPTYLSESGLPIGTMLMAGANYDMVLIK 
FATFFEKHHG FNVKWQRI IDKEVKPSADLI QPTNSLFKAHSSLVNLEENSQVTQVS I SKK 
WMKSSVKNKPSVMAYQKA 

SEQ ID NO: 4818 

STRAIN M732 frame: 1 

S VAPTTNT I VQTNDSNPTAKFASE SGQS VI GQVKPANSAALTTVDTPH I S APDALKTTQS 
S PVVES PSTKLTEET YKQKDGQDLANMVRSGQVTS EELVNMAYD 1 1 AKENPSLNAVITTR 
RQEAI EEARKLKDTNQPFLGVPLLVKGLGHS I KGGETNNGLI YADGKI STFDSS YVKKYK 
DLGF 1 1 LG<2TNFPEYGWRNITDSKLYGXTHNPWDLAHNAGGSSGGSAAAI ASGMTPI ASG 
SDAGGS IRI PS SWTGLVGLKPTRGLVSNEKPDS YSTAVHFPLTKS SRDAETLLTYLKKSD 
QTLVS VNDLKSLP I AYTLKS PMGTEVSQDAKNAI MDNVTFLRKQGFKVTE IDLPI DGRAL 
MRDYSTLAIGMGGAFST IEKDLKKHGFTKEDVDPI TWAVHVI YQNSDKAELKKS I VEAQK 
HMDDYRKAMEKLHKQFP I FLSPTTASLAPLNTDPYVTEKDKRAI YNMENLSQEERI ALFN 
RQWEPMLRRTPFTPI ANMTGLPAI S I PTYLSESGL PI GTMLMAGANYDMVL I KFATFFEK 
HHGFNVKWQRI I DKEVKPSADLI QPTNSLFKAHS SLVNLEENSQVTQVS I SKKWMKSSVK 
NKPSVMAYQKA 

SEQ ID NO: 4819 

STRAIN 18RS21 frame: 1 

NSTETSAS WPTTNT I VQTNDSNPTAKFVSESGQS VI GQVKPDNSAALTTVDTPHHI SAP 
DALKTTQSSPVVESTSTKLTEETYKQKDGQDLANMVRSGQVTSEELVNMAYDI IAKENPS 
LNAVI TTRRQEAI EEARKLKDTNQPFLGVPLLVKGLGHS I KGGETNNGL I YADGKI STFD 
SS YVKKYKDLGF 1 1 LGQTNFPE YGWRN I TDS KLYGLTHNPWDLAHNAGGS SGGSAAAI AS 
GMTPIASGSDAGGSIRIPSSWTGLVGLKPTRGLVSNEKPDSYSTAVHFPLTKSSRDAETL 
LTYLKKSDQTLVSVNDLKSLPIAYTLKSPMGTEVSQDAKNAI MDNVTFLRKQGFKVTEID 
LP IDGRALMRD YSTLAI GMGGAFST I EKDLKKHGFTKEDVDP ITWAVHVI YQNSDKAELK 
KS IMEAQKHMDDYRKAMEKLHKQFP I FLSPTTASLAPLNTDPYVTEEDKRAI YNMENLSQ 
EERIALFNRQWEPMLRRTPFTQIANMTGLPAISIPTYLSESGLPIGTMLMAGANYDMVLI 
KFATFFEKHHGFNVKWQRI IDKEVKPSTGLI QPTNSLFKAHSSLVNLEENSQVTQVS I S K 
KWMKS S VKNKPS VMAYQKA 

SEQ ID NO: 4820 

STRAIN M781 frame: 2 

ASVAPTTNTIVQTNDSNPTAKFASESGQSVIGQVKPANSAALTTVDTPHISAPDALKTTQ 
SSPVVESPSTKLTEETYKQKDGQDIuANMVRSGQVTSEELVNMAYDIIAKENPSLNAV 
RRQEAI EEARKLKDTNQPFLGVPLLVKGLGHS I KGGETNNGL I YADGKI STFDS SYVKKY 
KDLGF 1 1 LGQ/mFPEYGWRNITDSKLYGPTHNPWNLAHNAGGSSGGSAAA I ASGMTP IAS 
GSDAGGS IRI PS S WTGLVGLKPTRGLVSNEKPDS YSTAVHFPLTKS SRDAETLLTYLKKS 
DQTLVSVNDLKSLPIAYTLKSPMGTEVSQDAKNAIMDNVTFLREGGFKVTEIDLPIDGRA 
LMRDYSTLA I GMGGAFST I E KDLKKHGFTKED VD P I TWAVHVI YQNSDKAELKKS I VEAQ 
KHMDD YRKAMEKLHKQFP I FLS PTTAS LAPLNTDP YVTE KDKRAI YNMENLSQEERI ALF 
NRQWEPMLRRTPFTP IANMTGLPAI S I PTYLSESGLPI GTMLMAGANYDMVL I KFATFFE 
KHHGFNVKWQRI I DKEVKPSADLI QPTNSLFKAHS SLVNLEENSQVTQVS I SKKWMKSSV 
KNKP SVMAYQKA 

SEQ ID NO: 4821 

STRAIN CJB110 frame: 3 
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VPTTNTIVQTNDSNPTAKFVSESGQSVIGQVKPDNSAA^ 

PVVESTSTIKLTEETYKQKDGKDI^ANMWSGQVTSEELVNMAYDI IAKENPSLNAVITTRR 
QEAI EEARKLKDTNQPFIiGVPLLVKGLGHS I KGGETNNGL I YADGKI STPDS SYVKKYKD 
LGFI I IX3QTNFPEYGWRNITDSKLYGLTHNPWDLAHNAGGSSGGSAAAI ASGMTP I ASGS 
DAGGS IRI PSSWTGLVGLKPTRGLVSHEKPDS YSTAVHFPLTKS SRDAETLLTYLKKSDQ 
TLVSVNDLKSLP I AYTLKS PMGTE VSQDAKNAI MDNVTFLRKQGFKVTE I DLPIDGRALM 
RDYSTLAIGMGGAFSTIEKDLKKHGFTKEDVDPITWAVHVIYQNSDKAELKKSIMFJVQKH 
MDDYRKAMEKLHKQFPIFLSPTTASLAPLNTDPYVTEEDKRAIYNMENLSQEERIALFNR 
QWEPMLRRTPFTQI ANMTGLPAI S I PTYLSESGLPIGTMLMAGANYDMVLIKFATFFEKH 
HGFNVKWQRI IDKEVKPSTGLI QPTNSLFKAHSSLVNLEENSQVTQVS I SKKWMKSSVKN 
KPSVMAYQKA 

SEQ ID NO: 4822 

STRAIN 1169NT frame: 1 

NSTETSASVAPTTNT I VQTNDSNPTAKFASESGQSVI CQVKPDNSAALTTVDTPH I SAPD 
DLKTTQSSPVVESTSTKLTEETYKQKDGQDIiANMVRSGQVTSEELVNMAYDIIAKENPSL 
NAVITTRRQEAIEEARKLKDTNQPFIiGVPLLVKGLGHSIKGGETNNGLIYADGKISTFDS 
S YVKKYKDLGF 1 1 LGQTNF PE YGWRNI TDSKL YG PTHNPRNLAHNAGGS SGGS AAAI ASG 
MTPIASGSDAGGSIRIPSSWTGLVGLKPTRGLVSNEKPDSYSTAVHFPLTKSSRDAETLL 
TYLKKSDGTLVSVNDLKSLPIAYTLKSPMGTEVSQDAKNAIMDNVTFLRKQGFKVTEIDL 

PIDGRALMRDYSTLAIGMGGAFST I EKDLKKHGFTKEDVDPI TWAVHVI YQNSDKAELKK 
SIMElAQKHMDDYRKAMEKIiHKQFPIFLSPTTASLAPLNTDPYVTEEDKRAIYNMENLSQE 
ERIALFNRQWEPMLRRTPFTQIANMTGLPAI S IPTYLSESGLPIGTMLMAGANYDMVLIK 
FATFFEKHHGFNVKWQRI IDKEVKPSTGLI QPTNSLFKAHSSLWLEENSQVTQVS I SKK 

WMKS S VKNKPS VMAYQKA 

SEQ ID NO: 4823 

STRAIN JM9130013 frame: 2 

SVAPTTNTIVQTNDSNPTAKFSSESGQSVIGQVKPANSVALTTVDTPHISAPDALKTTQS 
SPVVESPSTKLTEETYKQKDGQELANMWSGQVTSEELVNMAYDIIAKENPSLNAVITTR 
RQEAI EEARKLKDTNQPFLGVPLLVKGLGHS I KGGETNNGL I YAGGKI STFDSSYVKKYK 
DLGFI I LGQTNF PE YGWRNI TDSKLYGPTHNPWNLAHNAGGSSGGSAAVI ASGMTPI ASG 
SDAGGSIRIPSSWTGLVGLKPTRGLVSNEKPDSYSTAVHFPLTKSSRDAETLLTYLKKSD 
QTLVSVNDLKSLP I AYTLKS PMGTE VSQDAKNAI MDNVI FLRKQGFKVTEIDLP I DGRAL 
MRDYSTLAI GMGGAFST I EKDLKKHGFTKEDVDP I TWGVHVI YQNSDKAELKKS I MEAQK 
HMDDYRKAMEKLHKQFP I FLSPTTASLAPLNTDP YVTEEDKRAI YNMENLSQEERIALFN 
RQWEPMLRRTPFTQI ANMTGLPAI S I PTYLSESGLPIGTMLMAGANYDMVLI KFATFFEK 
YHGFNVKWQR I IDKEVKPSTGLIQPTNSLFKAHSSLVNLEENSQVTQVSI SKKWMKSSVK 
NKPSVMAY 

SEQ ID NO: 4824 

STRAIN H36B frame: 3 

S VVPTTNTI VQTNDSNPTAKFS SE SGQS VI GQVKPANS VALTTVDTPHI SAPDALKTTQS 
S PVVESPSTKLTEET YKQKDGQDLANMVRSGQVTS EELVNMAYD 1 1 AKENP SLNAVI TTR 
RQEAIEEARKLKDTNQPFLGVPLLVKGLGHS I KGGETNNGL I YAGGKI STFDSSYVKKYK 
DLGFI I LGQTNFPEYGWRNI TDSKLYGPTHNPWNLAHNAGGSSGGSAAVI ASGMTP I ASG 
SDAGGS I RI PS S WTGLVGLKPTRGLVSNEKPD S YSTAVHFPLTKS SRDAETLLTYLKKSD 
QTLVSVNDLKSLP I AYTLKSPMGTEVSQDAKNAI MDNVI FLRKQGFKVTE IDLPI DGRAL 
MRDYSTLAI GMGGAFST I EKDLKKHGFTKEDVDP I TWAVHVI YQNSDKAELKKS IMEAQK 
HMDDYRKAMEKLHKQFPI FLSPTTASLAPLNTDPYVTEEDKRAI YNMENLSQEERIALFN 
RQWEPMLRRTPFTQI ANMTGLPAI S I PTYLSESGLP IGTMLMAGANYDMVLI KFATFFEK 
YHGFNVKWQRI I DKEVKPSTGLIQPTNSLFKAHSSLVNLEENSQVTQVS I SKKWMKSSVK 

NK 



PRETTY Of: /biotmp/msa72034 . 2 { *} January 22, 2003 07:25 

1 

msa72034 .2{l73_090) nstetsasw pTTNTIVQTN DSNPTAKFvS 

msa72034.2{l73__18RS21} nstetsasw pTTNTIVQTN DSNPTAKFvS 

msa72034 . 2 { 173_2603 } nstetsasw pTTNTIVQTN DSNPTAKFvS 

msa72034.2{!73_A909} TTNTIVQTN DSNPTAKFvS 

msa72034 .2{173_CJB110} v pTTNTIVQTN DSNPTAKFvS 

msa72034 . 2 f 173_COHl) nstetsasva pTTNTIVQTN DSNPTAKFaS 

msa72034 . 2 { 173_M732 } sva pTTNTIVQTN DSNPTAKFaS 

msa72034.2{l73_M78l} asva pTTNTIVQTN DSNPTAKFaS 

msa72034.2{l73_1169NT) nstetsasva pTTNTIVQTN DSNPTAKFaS 

msa72034.2{l73_H36B} SW pTTNTIVQTN DSNPTAKFsS 

msa72034.2{l73_JM9130013} sva pTTNTIVQTN DSNPTAKFsS 

Consensus _********* ********_* 



ESGQSVIgQV 
ESGQSVIgQV 
ESGQSVIgQV 
ESGQSVIgQV 
ESGQSVIgQV 
ESGQSVIgQV 
ESGQSVIgQV 
ESGQSVIgQV 
ESGQSVIcQV 
ESGQSVIgQV 
ESGQSVIgQV 
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KPdNSaALTT 
KPdNSaALTT 
KPdNSaALTT 
KPdNSaALTT 
KPdNSaALTT 
KPaNSaALTT 
KPaNSaALTT 
KPaNSaALTT 
KPdNSaALTT 
KPaNSvALTT 
KPaNSvALTT 



*******_** **_**-**** 



msa72034 
msa72034 .2{ 

msa72034 . 

msa72034. 
msa72034 .2{ 

msa72034 . 

msa72 034. 

msa72034 . 
msa72034.2{ 

msa72034. 
msa72034.2{!73 



.2{173_090' 
173_18RS2i; 
2{173_2603 
2{173_A909 
173_CJB110 
2{l73_COHl 
2{173_M732 
2{l73_M781/ 
173_1169NT; 
2{173__H36B 
'JM9130013 
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VDTphHISAP 
VDTphHISAP 
VDTphHISAP 
VDTphHISAP 
VDTphHISAP 
VDT.pHISAP 
VDT.pHISAP 
VDT.pHISAP 
VDT.pHISAP 
VDT.pHISAP 
VDT.pHISAP 



DaLKTTQSSP 
DaLKTTQSSP 
DaLKTTQSSP 
DaLKTTQSSP 
DaLKTTQSSP 
DaLKTTQSSP 
DaLKTTQSSP 
DaLKTTQSSP 
DdLKTTQSSP 
DaLKTTQSSP 
DaLKTTQSSP 



WEStSTKLT 
WEStSTKLT 
WEStSTKLT 
WEStSTKLT 
WEStSTKLT 
WESpSTKLT 
WESpSTKLT 
WESpSTKLT 
WEStSTKLT 
WESpSTKLT 
WESpSTKLT 



EETYKQKDGk 
EETYKQKDGq 
EETYKQKDGq 
EETYKQKDGq 
EETYKQKDGk 
EETYKQKDGq 
EETYKQKDGq 
EETYKQKDGq 
EETYKQKDGq 
EETYKQKDGq 
EETYKQKDGq 
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dLANMVRSGQ 
dLANMVRSGQ 
dLANMVRSGQ 
dLANMVRSGQ 
dLANMVRSGQ 
dLANMVRSGQ 
dLANMVRSGQ 
dLANMVRSGQ 
dLANMVRSGQ 
dLANMVRSGQ 
eLANMVRSGQ 
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Consensus 



***__***★★ *„******** ****..***** *********- -********* 



msa72034.2{l73_090} 
msa72034 . 2 { 173_18RS21 } 
msa72034 . 2 { 173_2603 } 
msa72034 . 2 (173_A909 } 
msa72034 . 2 { 173_CJB110 } 
msa72034 . 2 { 173_C0H1 } 
msa72034 . 2 {l73_M732 } 
msa72034 , 2 { 173_M7 81 } 
msa72034.2{l73_1169NT} 
msa72034.2{l73_H36B} 
msa72034.2{l73_JM9130013} 

Consensus 



msa72034 
msa72034 .2{ 

msa72034 . 

msa72 034 . 
rasa72034 .2{ 

tnsa72034. 

msa72034 . 

msa72034 . 
msa72034 .2{ 

msa72034. 
msa72034.2{l73 



.2{173_090} 
173_18RS21" 
2{l73_2603 
2{173_A909 
173_CJB110} 
2{l73_COHl} 
2{173_M732} 
2{l73_M78l} 
173__1169NT} 
2{173_H36B} 
_JM9130013} 
Consensus 



msa72034 
msa72034.2{ 

msa72034. 

msa72034. 
msa72034.2{ 

msa72034- 

msa72034. 

msa72034 . 
msa72034.2{ 

msa72034. 
msa72034.2{l73 



.2{173_090} 
173_18RS2l} 
2f 173_2603} 
2{173_A909} 
173_CJB110} 
2 { 173_COHl } 
2{173_M732} 
2{l73_M78l} 
173_1169NT} 
2{173_H36B} 
_JM9130013} 
Consensus 



msa72034 . 2 { 173^090 
msa72 034 . 2 { 173_18RS21 
msa72034 .2 {l73_2603 
msa72034.2 {173_A909; 
rasa72034 . 2 { 173_CJB110 [ 
msa7 2034.2(17 3_C0H1 
msa72034 . 2 { 173_M732 
msa72034 . 2 { 173_M781 ]■ 
msa72034 . 2 { 173_1169NT} 
msa72034 . 2 {l73_H36B} 
msa72034 . 2 { 173_JM9130Q13 } 

Consensus 



msa72034 
msa72034.2{ 

msa72034. 

msa72034. 
msa72034.2{ 

msa72034 . 

msa72034 . 

msa72034 . 
msa72034.2{ 

msa72034 . 
msa72034.2{l73 



.2{173_090} 
173_18RS21} 
2{173_2603} 
2{173_A909 J 
173_CJB110> 
2{173_C0H1} 
2{173_M732* 
2{l73_M78l" 
173_1169NT' 
2{173_H36B} 
_JM9130013} 
Consensus 



msa72034 
msa72034.2{ 

msa72034. 

msa72034 . 
msa72034.2{ 

tnsa72034 . 

msa72034 . 

msa72034 . 
msa72034 .2{ 

msa72034 . 



,2{173_090] 

173_18RS21 

2{173_2603 

2{173_A909 

173_CJB110 

2{173_C0H1} 

2{173_M732) 

2{173_M781^ 

173_1169NT 

2{173_H36B 



YDIIAKENPS 

YD I IAKENPS 

YDIIAKENPS 

YDIIAKENPS 

YDIIAKENPS 

YDIIAKENPS 

YDIIAKENPS 

YDIIAKENPS 

YDIIAKENPS 

YDIIAKENPS 

YDIIAKENPS 
********** 



KGGETNNGL I 
KGGETNNGLI 
KGGETNNGL I 
KGGETNNGL I 
KGGETNNGL I 
KGGETNNGL I 
KGGETNNGL I 
KGGETNNGL I 
KGGETNNGL I 
KGGETNNGL I 
KGGETNNGL I 



LNAVITTRRQ 
LNAV I TTRRQ 
LNAVITTRRQ 
LNAVITTRRQ 
LNAVITTRRQ 
LNAVITTRRQ 
LNAVITTRRQ 
LNAVITTRRQ 
LNAVITTRRQ 
LNAVITTRRQ 
LNAVITTRRQ 
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VTSEELVNMA 
VTSEELVNMA 
VTSEELVNMA 
VTSEELVNMA 
VTSEELVNMA 
VTSEELVNMA 
VTSEELVNMA 
VTSEELVNMA 
VTSEELVNMA 
VTSEELVNMA 
VTSEELVNMA 
********** 

151 

LLVKGLGHSI 

LLVKGLGHSI 

LLVKGLGHSI 

LLVKGLGHSI 

LLVKGLGHSI 

LLVKGLGHSI 

LLVKGLGHSI 

LLVKGLGHSI 

LLVKGLGHSI 

LLVKGLGHSI 

LLVKGLGHSI 
********** ********** **-******* 



EAIEEARKLK 
EAIEEARKLK 
EAIEEARKLK 
EAIEEARKLK 
EAIEEARKLK 
EAIEEARKLK 
EAIEEARKLK 
EAIEEARKLK 
EAIEEARKLK 
EAIEEARKLK 
EAIEEARKLK 



150 

DTNQPFLGVP 
DTNQPFLGVP 
DTNQPFLGVP 
DTNQPFLGVP 
DTNQPFLGVP 
DTNQPFLGVP 
DTNQPFLGVP 
DTNQPFLGVP 
DTNQPFLGVP 
DTNQPFLGVP 
DTNQPFLGVP 



********** ********** ********** 



YAdGKI STFD 
YAdGKI STFD 
YAdGKI STFD 
YAdGKI STFD 
YAdGKI STFD 
YAdGKI STFD 
YAdGKI STFD 
YAdGKI STFD 
YAdGKI STFD 
YAgGKI STFD 
YAgGKISTFD 



201 

PEYGWRNITD 
PEYGWRNITD 
PEYGWRNITD 
PEYGWRNITD 
PEYGWRNITD 
PEYGWRNITD 
PEYGWRNITD 
PEYGWRNITD 
PEYGWRNITD 
PEYGWRNITD 
PEYGWRNITD 



SKLYG1THNP 
SKLYG1THNP 
SKLYG1THNP 
SKLYG1THNP 
SKLYG1THNP 
SKLYGpTHNP 
SKLYGxTHNP 
SKLYGpTHNP 
SKLYGpTHNP 
SKLYGpTHNP 
SKLYGpTHNP 



********** *****_**** 



wdLAHNAGGS 

wdLAHNAGGS 

wdLAHNAGGS 

wdLAHNAGGS 

wdLAHNAGGS 

wnLAHNAGGS 

wdLAHNAGGS 

wnLAHNAGGS 

mLAHNAGGS 

wnLAHNAGGS 

wnLAHNAGGS 
„_******** 



200 

GFIILGQTNF 

GFIILGQTNF 

GFIILGQTNF 

GFI ILGQTNF 

GFI ILGQTNF 

GFIILGQTNF 

GFI ILGQTNF 

GFIILGQTNF 

GFIILGQTNF 

GFIILGQTNF 

GFIILGQTNF 
********** 

250 

GMTP I ASGSD 
GMTPIASGSD 
GMTP I ASGSD 
GMTPIASGSD 
GMTPIASGSD 
GMTPIASGSD 
GMTPIASGSD 
GMTPIASGSD 
GMTPIASGSD 
GMTPIASGSD 

GMTPIASGSD 
******_*** ********** 



SSYVKKYKDL 

SSYVKKYKDL 

SSYVKKYKDL 

SSYVKKYKDL 

SSYVKKYKDL 

SSYVKKYKDL 

SSYVKKYKDL 

SSYVKKYKDL 

SSYVKKYKDL 

SSYVKKYKDL 

SSYVKKYKDL 
********** 



SGGSAAalAS 
SGGSAAalAS 
SGGSAAalAS 
SGGSAAalAS 
SGGSAAalAS 
SGGSAAalAS 
SGGSAAalAS 
SGGSAAalAS 
SGGSAAalAS 
SGGSAAvIAS 
SGGSAAvIAS 



251 

AGGSIRIPSS 
AGGSIRIPSS 
AGGSIRIPSS 
AGGSIRIPSS 
AGGSIRIPSS 
AGGSIRIPSS 
AGGSIRIPSS 
AGGSIRIPSS 
AGGSIRIPSS 
AGGSIRIPSS 
AGGSIRIPSS 
********** 

301 

LTYLKKSDQT 

LTYLKKSDQT 

LTYLKKSDQT 

LTYLKKSDQT 

LTYLKKSDQT 

LTYLKKSDQT 

LTYLKKSDQT 

LTYLKKSDQT 

LTYLKKSDQT 

LTYLKKSDQT 

LTYLKKSDQT 
********** ********** 



WTGLVGLKPT 
WTGLVGLKPT 
WTGLVGLKPT 
WTGLVGLKPT 
WTGLVGLKPT 
WTGLVGLKPT 
WTGLVGLKPT 
WTGLVGLKPT 
WTGLVGLKPT 
WTGLVGLKPT 
WTGLVGLKPT 
********** 



LVSVNDLKSL 
LVSVNDLKSL 
LVSVNDLKSL 
LVSVNDLKSL 
LVSVNDLKSL 
LVSVNDLKSL 
LVSVNDLKSL 
LVSVNDLKSL 
LVSVNDLKSL 
LVSVNDLKSL 
LVSVNDLKSL 



RGLVSnEKPD 

RGLVSnEKPD 

RGLVSnEKPD 

RGLVSnEKPD 

RGLVShEKPD 

RGLVSnEKPD 

RGLVSnEKPD 

RGLVSnEKPD 

RGLVSnEKPD 

RGLVSnEKPD 

RGLVSnEKPD 
*****_**** ********** ********** 



SYSTAVHFPL 
SYSTAVHFPL 
SYSTAVHFPL 
SYSTAVHFPL 
SYSTAVHFPL 
SYSTAVHFPL 
SYSTAVHFPL 
SYSTAVHFPL 
SYSTAVHFPL 
SYSTAVHFPL 
SYSTAVHFPL 



300 

TKSSRDAETL 
TKSSRDAETL 
TKSSRDAETL 
TKSSRDAETL 
TKSSRDAETL 
TKSSRDAETL 
TKSSRDAETL 
TKSSRDAETL 
TKSSRDAETL 
TKSSRDAETL 
TKSSRDAETL 



PIAYTLKSPM 
PIAYTLKSPM 
PIAYTLKSPM 
PIAYTLKSPM 
PIAYTLKSPM 
PIAYTLKSPM 
PIAYTLKSPM 
PIAYTLKSPM 
PIAYTLKSPM 
PIAYTLKSPM 

PIAYTLKSPM 
********** 



GTEVSQDAKN 

GTEVSQDAKN 

GTEVSQDAKN 

GTEVSQDAKN 

GTEVSQDAKN 

GTEVSQDAKN 

GTEVSQDAKN 

GTEVSQDAKN 

GTEVSQDAKN 

GTEVSQDAKN 

GTEVSQDAKN 
********** 



351 

kQGFKVTEID 
kQGFKVTE ID 
kQGFKVTEID 
kQGFKVTEID 
kQGFKVTEID 
kQGFKVTEID 
kQGFKVTEID 
eQGFKVTEID 
kQGFKVTEID 
kQGFKVTEID 



LPIDGRALMR 
LPIDGRALMR 
LPIDGRALMR 
LPIDGRALMR 
LPIDGRALMR 
LPIDGRALMR 
LPIDGRALMR 
LPIDGRALMR 
LPIDGRALMR 
LPIDGRALMR 



DYSTLAIGMG 
DYSTLAIGMG 
DYSTLAIGMG 
DYSTLAIGMG 
DYSTLAIGMG 
DYSTLAIGMG 
DYSTLAIGMG 
DYSTLAIGMG 
DYSTLAIGMG 
DYSTLAIGMG 



GAFSTIEKDL 
GAFSTIEKDL 
GAFSTIEKDL 
GAFSTIEKDL 
GAFSTIEKDL 
GAFSTIEKDL 
GAFSTIEKDL 
GAFSTIEKDL 
GAFSTIEKDL 
GAFSTIEKDL 



350 

AIMDNVtFLR 

AIMDNVtFLR 

AIMDNVtFLR 

AIMDNVtFLR 

AIMDNVtFLR 

AIMDNVtFLR 

AIMDNVtFLR 

AIMDNVtFLR 

AIMDNVtFLR 

AIMDNViFLR 

AIMDNViFLR 
******_*** 

400 

KKHGFTKEDV 
KKHGFTKEDV 
KKHGFTKEDV 
KKHGFTKEDV 
KKHGFTKEDV 
KKHGFTKEDV 
KKHGFTKEDV 
KKHGFTKEDV 
KKHGFTKEDV 
KKHGFTKEDV 
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Table 48: Comparative Sequences relating to SAG1474 



msa72034 . 2 { 173_JM9130013 } 

Consensus 



kQGFKVTEID LPIDGRALMR DYSTLAIGMG GAFSTIEKDL KKHG FTKEDV 
_********* ********** ********** ********** ********** 



msa72034 
msa72034.2{ 

msa72034. 

msa72034 . 
msa72034 . 2{ 

msa72034 . 

msa72034 . 

msa72034 . 
msa72034 . 2 { 

msa72034 . 
msa72034 .2{l73 



.2{l73_090l 
173_18RS2l} 
2{173_2603} 
2{173_A909} 
173_CJB110) 
2{l73_COHl) 
2{173_M732} 
2{173_M781} 
173_1169NT} 
2{173_H36B} 
_JM9130013} 
Consensus 



401 

DPITWaVHVI 
DPITWaVHVI 
DPITWaVHVI 
DPITWaVHVI 
DPITWaVHVI 
DPITWaVHVI 
DPITWaVHVI 
DPITWaVHVI 
DPITWaVHVI 
DPITWaVHVI 
DPITWgVHVI 



YQN3DKAELK 
YQNSDKAELK 
YQNSDKAELK 
YQNSDKAELK 
YQNSDKAELK 
YQNSDKAELK 
YQNSDKAELK 
YQNSDKAELK 
YQNSDKAELK 
YQNSDKAELK 
YQNSDKAELK 



KSImEAQKHM 
KSImEAQKHM 
KSImEAQKHM 
KSImEAQKHM 
KSImEAQKHM 
KSIvEAQKHM 
KSIvEAQKHM 
KSIvEAQKHM 
KSImEAQKHM 
KSImEAQKHM 
KSImEAQKHM 



*****_**** ********** ***_****** 



DDYRKAMEKL 

DDYRKAMEKL 

DDYRKAMEKL 

DDYRKAMEKL 

DDYRKAMEKL 

DDYRKAMEKL 

DDYRKAMEKL 

DDYRKAMEKL 

DDYRKAMEKL 

DDYRKAMEKL 

DDYRKAMEKL 
********** 



450 

HKQFPIPLSP 

HKQFPIFLSP 

HKQFPIPLSP 

HKQFPIFLSP 

HKQFPIFLSP 

HKQFPIFLSP 

HKQFPIFLSP 

HKQFPIFLSP 

HKQFPIFLSP 

HKQFPIFLSP 

HKQFPIFLSP 
********** 



msa72034 
msa72034 . 2{ 

msa72034 . 

msa72034. 
msa72034 .2{ 

msa72034. 

msa72034 . 

msa72034 . 
msa72034.2{ 

msa72034 . 
msa72034.2{l73 



.2{l73_090 
173_18RS21 
2(l73_2603 
2{173_A909; 
173_CJB110; 
2fl73_COHl 
2{173_M732 
2{173__M781 
173_1169NT/ 
2{l73_H36B> 
_JM9130013) 
Consensus 



451 

TTASLAPLNT 
TTASLAPLNT 
TTASLAPLNT 
TTASLAPLNT 
TTASLAPLNT 
TTASLAPLNT 
TTASLAPLNT 
TTASLAPLNT 
TTASLAPLNT 
TTASLAPLNT 
TTASLAPLNT 
********** 



DPYVTEeDKR 
DPYVTEeDKR 
DPYVTEeDKR 
DPYVTEeDKR 
DPYVTEeDKR 
DPYVTEkDKR 
DPYVTEkDKR 
DPYVTEkDKR 
DPYVTEeDKR 
DPYVTEeDKR 
DPYVTEeDKR 



AIYNMENLSQ 
AIYNMENLSQ 
AIYNMENLSQ 
AIYNMENLSQ 
AIYNMENLSQ 
AIYNMENLSQ 
AIYNMENLSQ 
AIYNMENLSQ 
AIYNMENLSQ 
AIYNMENLSQ 
AIYNMENLSQ 



EERIALFNRQ 
EERIALFNRQ 
EERIALFNRQ 
EERIALFNRQ 
EERIALFNRQ 
EERIALFNRQ 
EERIALFNRQ 
EERIALFNRQ 
EERIALFNRQ 
EERIALFNRQ 
EERIALFNRQ 



******_*** ********** ********** 



500 

WEPMLRRTPF 

WEPMLRRTPF 

WEPMLRRTPF 

WEPMLRRTPF 

WEPMLRRTPF 

WEPMLRRTPF 

WEPMLRRTPF 

WEPMLRRTPF 

WEPMLRRTPF 

WEPMLRRTPF 

WEPMLRRTPF 
********** 



msa72034 
msa72034.2{ 

msa72034. 

msa72034 . 
msa72034 .2{ 

msa72034 . 

msa72 034 . 

msa72034 . 
msa72034.2{ 

msa72034 . 
msa72034.2{l73 



.2{173_090} 
173_18RS2l} 
2{173_2603} 
2{173_A909> 
173J2JB110} 
2{l73_COHl} 
2{173_M732) 
2{173_M781} 
173_1169NT} 
2{173_H36B} 
_JM9130013} 
Consensus 



501 

TqlANMTGLP 

TqlANMTGLP 

TqlANMTGLP 

TqlANMTGLP 

TqlANMTGLP 

TpIANMTGLP 

Tp I ANMTGLP 

TpIANMTGLP 

TqlANMTGLP 

TqlANMTGLP 

TqlANMTGLP 
*„******** 



AISIPTYLSE 

AISIPTYLSE 

AISIPTYLSE 

AISIPTYLSE 

AISIPTYLSE 

AISIPTYLSE 

AISIPTYLSE 

AISIPTYLSE 

AISIPTYLSE 

AISIPTYLSE 

AISIPTYLSE 
********** 



SGLPIGTMLM 

SGLPIGTMLM 

SGLPIGTMLM 

SGLPIGTMLM 

SGLPIGTMLM 

SGLPIGTMLM 

SGLPIGTMLM 

SGLPIGTMLM 

SGLPIGTMLM 

SGLPIGTMLM 

SGLPIGTMLM 
********** 



AGANYDMVLI 

AGANYDMVLI 

AGANYDMVLI 

AGANYDMVLI 

AGANYDMVLI 

AGANYDMVLI 

AGANYDMVLI 

AGANYDMVLI 

AGANYDMVLI 

AGANYDMVLI 

AGANYDMVLI 
********** 



550 

KFATFFEKhH 

KFAT FFEKhH 

KFATFFEKhH 

KFATFFEKhH 

KFATFFEKhH 

KFATFFEKhH 

KFATFFEKhH 

KFATFFEKhH 

KFATFFEKhH 

KFAT FFEKyH 

KFATFFEKyH 
********_* 



msa72034 
msa72034 . 2 { 

msa72034 . 

msa72034 . 
msa72034. 2{ 

msa72034 . 

msa72034 . 

msa72034 . 
msa72034.2{ 

msa72034 . 
msa72034 .2 {173 



.2{173_090 
173_18RS2l' 
2{l73_2603 
2{173_A909 ! 
173_CJB110 ' 
2{l73_COHl' 
2{173_M732' 
2{l73_M78l} 
173__1169NT} 
2{l73_H36B> 
_JM9130013} 
Consensus 



551 

GFNVKWQRI I 
GFNVKWQRI I 
GFNVKWQRI I 
GFNVKWQRI I 
GFNVKWQRI I 
GFNVKWQRI I 
GFNVKWQRI I 
GFNVKWQRI I 
GFNVKWQRI I 
GFNVKWQRI I 
GFNVKWQRI I 



DKEVKPStgL 
DKEVKPStgL 
DKEVKPStgL 
DKEVKPStgL 
DKEVKPStgL 
DKEVKPSadL 
DKEVKPSadL 
DKEVKPSadL 
DKEVKPStgL 
DKEVKPStgL 
DKEVKPStgL 



IQPTNSLFKA 
IQPTNSLFKA 
IQPTNSLFKA 
IQPTNSLFKA 
IQPTNSLFKA 
IQPTNSLFKA 
IQPTNSLFKA 
IQPTNSLFKA 
IQPTNSLFKA 
IQPTNSLFKA 
IQPTNSLFKA 



HSSLVNLEEN 
HS SLVNLEEN 
HSSLVNLEEN 
HSSLVNLEEN 
HSSLVNLEEN 
HSSLVNLEEN 
HSSLVNLEEN 
HSSLVNLEEN 
HSSLVNLEEN 
HSSLVNLEEN 
HSSLVNLEEN 



600 

SQVTQVSISK 
SQVTQVSISK 
SQVTQVSISK 
SQVTQVSISK 
SQVTQVSISK 
SQVTQVSISK 
SQVTQVSISK 
SQVTQVSISK 
SQVTQVSISK 
SQVTQVSISK 
SQVTQVSISK 



********** *******__* ********** ********** ********** 



msa72034 
msa72034.2{ 

msa72034 . 

msa72 034. 
msa72034-2{ 

msa72034. 

msa72034.. 

msa72034 . 
msa72034.2{ 

msa72034. 
msa72034.2{l73 



.2{173_090 
173_18RS21 
2f 173_2603 
2{173_A909 
173_CJB110 
2{l73_COHl 
2{173_M732 
2{173__M781 
173_1169NT 
2{173_H36B 
_JM9130013 
Consensus 



601 

KWMKSSVKNK 

KWMKSSVKNK 

KWMKSSVKNK 

KWMKSSVKNK 

KWMKSSVKNK 

KWMKSSVKNK 

KWMKSSVKNK 

KWMKSSVKNK 

KWMKSSVKNK 

KWMKSSVKNK 

KWMKSSVKNK 
********** 



619 

psvmayqka 
psvtnayqka 
psvmayqka 
psvmayqka 
psvmayqka 
psvmayqka 
psvmayqka 
psvmayqka 
psvmayqka 

psvmay 
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Table 49: Comparative Sequences related to SAG1502 



SEQ 10 NO: 4901 
STRAIN 2603 

aaacatccgatacttaatgatcaaaaatccttagcaattgttgaacagat 
agaatatgattttgataaattcgataattcagaagcttctttttatgcaa 
cattagctagawttcgcgttatggatagagaaatcaaaaaatttattaga 
gaaaatccaaatagtcaaatcctttcaattggttgtggacttgatacaag 
gtttgaaagagtcgataatggacaaattaggtggtataaccttgatttgc 
cagagg 1 1 a t ggaga taagaaaattatttttt gaagagca tgaa agag 1 1 
actaatatagcaaaatcagccctagatgaaacttggacacgggaggtaaa 
tccccaaaatgccccttttctaatcgtgtcagaaggtgttttaatgtttc 
taaaagaagatgacgtagagacttttcttcatatcctgacaaattcattt 
agccaatttatggcacaatttgatttgtgtcataaggaaatgattaataa 
aggaaagcaacatgatacagtaaagtatatggatacagaatttcagtttg 
gtatcacagatggtcatgagattgtggatttagaccctaaattaaagcaa 
ataaatctgattaactttacagatgagatgagcaaatttgagttaggcac 
acttcgctctttacttccaacaattcgtaaatttaataattgtttaggtg 
tgtacgaatataaagcatc 

SEQ XD NO: 4902 
STRAIN 090 

TAATGATCAAAAAT C CTTAG CAATTGTTG AACAGATAGAATATGATTTTG 

ATAAATTCGATAATTC^GA^GCTTC"lU'l"l'lATGGAACATTAGCrrAGAATT 

CGCGTTATGGATAGAGAAATCAAAAAATTTATTAGAGAAAATCCMATAG 

TCAAATCCTTTCAATTGGTTGTGGACriTGATACAAGGTTTGAAAGAGTCG 

ATAATGGACAAATTAGGTGGTATAACCTTGATTTGCCAGAgGTTATGGAG 

ATAAGAAAATTATTTTTTGAAGAG CATGAAAG AGTTACTAATAT AG CAAA 

AT CAG CCATAGATGAAACTTGGACACGGG AGGTAAAT CC C CAAAATGCCC 

CTTTT CT AAT CGTGTCAGAAGGTGTTTTAATGTTT CTAAAAGAAGATGAC 

GTAG AGACTTTTCTTCATAT CCTGACAAATT CATTTAGC CAATTTATGGC 

ACAATTTGATTTGTGT CATAAGGAAATGATTAATAAAGG AAAG CAACATG 

ATACAGTAAAGTATATGGATACAGAATTTCAGTTTGGTATCACAGATGGT 

CATGAGATTGTGGATTTAGACCCTAAATTAAAGCAAATAAATCTGATTAA 

CTTTACAGATGAGATGAGCAAATTTGAGTTAGGCACA 

TTCCAACAATTCGTAAATTTAATAATTGTTTAGGTGTGTACGAATATAAA 

GCATC 

SEQ ID NO: 4903 
STRAIN A909 

AAACATCCGATACTTAATGA 

TCAAAAATCCTTAG CAATTGTTGAACAGATAGAATATGATTTTGATAAAT 
TCGATAATTCAG AAG CTT CTTTTTATG CAACATTAG CTAGAATT CGCGTT 
ATGGATAGAGAAAT CAAAAAATTTATTAGAGAAAAT C CAAATAGT CAAAT 
CcTTT CaATTGGTTGTGGACITGATACAAGGTTTGAAAGAGT CGATAATG 
GACAAATTAGGTGGTATAAC CTTG ATTTG C CAGAGGTTATGG AG ATAAGA 
AAATTaTTTTTTGAAGAG CATG AAAGAGTTACTAATATAG CAAAATCAG C 
CCTAGATGaAACITGGACACGGGAGGTAAATCCCCAAAATGCCCCTTTTC 
TAATCGTGT CAG AAGGTGTTTTAATGTT t CTAAAAGAAGATGACGTAGAG 
ACTTTT CTTGATATCCTGAGAAATT CATTTAGCCAATTTATGGCACAATT 
TGATTTGTGT CATAAGGAAATGATTAATAAAGGAAAG CAACATGATACAG 
TAAAGTATATGGATACAGAATTTCAGTTTGGTATCACAGATGGTCATGAG 
ATTGTGGATTTAGAC CCTAAATTAAAG CAAAT AAATCTGATTAACTTTAC 
AGATGAGATGAG CAAATTTG AGTTAGG CACACTTCGCT C1TTACTTC CAA 
CAATT CGTAAATTTAATAATTGTTTAGGTGTGTACGAATATAAAG CATC 

SEQ ID NO: 4904 
STRAIN H36B 

AAACATCCX3ATACTTAATGATCAAAAATCCTTAGCA 

ATTGTTGAAGA.GATAGAAT ATGATTTTGAT AAATTCGATAATT CAGAAG C 
TTCTTTTTATGCAaCA.TTAGCTAGAATTCGCGTTATGGATAGAGAAAT CA 
AAAAATTTATTAGAGAAAATCCAAATAGTCATATCCTTT CAATT GGCTGT 
GgACTTGATACAAGGTTTGAAAGAGTCGATAATGGACAAATTAGGTGGTA 
TAACCITGATTTGCCAGAGGTTATGGAGATAAGAAAATTATTTT 
' AGCATGAAAGAGTTACTAATATAG CAAAATCAGCCcTAGATGAAACTTGG 
ACACGGGAGGTAAATCCCCAAAATGCCCCTITTTCrrAATCGTGTCAGAAGG 
TGTTTTAATGTTTCTAAAAGAAGATGACGTAGAGACriTTCITCATATCC 
TGACAAATT CATTTAGC CA^TTTATGG CACAATTTGATTTGTGT CAg AAG 
GAAATGATTAATAAAGGAAAGCAACATGATACAGTAAAGTATATGGATAC 
AGAATTTCAGTTGGGTATCACAGATGGTCATGAAATTGTGGATTTAGACC 
CTAAATTAAAGCAAATAAATCTGATTAACITTACAGATGAGATC 
TTTGAGTTAGGCACACTTCG CTCTTTACTT CCAACAAT^ 
TAATTGTTTAGGTGTGTACGAATATAAAGCATC 

SEQ ID NO: 4905 
STRAIN 18RS21 

AACATCCGATACTTAATGATCAAAAATCCTTAGCAAT 
TGTTGAACAGAT AGAATATG ATTTTGATAAATT CGAT AATTCAGAAG CTT 
CTITTTTATG CAACATTAG CTAGAATTCGCGTTATGGATAGAGAAATCAAA 
AAATTTATTAGAGAAAAT CCAAATAGT CaAAT CCTTT CAATTGGTTGTGG 
ACTTGATACAAGGTTTGAA^GAGTCGATAATGGACAAATTAGGTGGTATA 
AC CTTGATTTG C CAG AG GTTATGGAGAT AAGAAAATT ATTTTTTG AAGAG 
CATGAAAGAGTTACT AATATAG CAAAATCAGC CCTAG ATG AAACTTGGAC 
ACGGGAGGTAAATCCCCAAAATGCCCCTTTTCTAATCGTGTCAgAAGGTG 
TTTTAATGTTTCTAAAAGAAGATGACGTAGAG ACTriT CTT CATATC CTG 
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AC^AATTCATTTAGCCAATTTATGGCACaATTTGATTTGTGTCATAaGGA 
AATG ATT AAT AAAG GAAAG CAACATGATACAGTAAAGTATATGG ATACAG 
AATTT CAGTTTGGT AT CACAGATGGT CAT G AGATTGTGG ATTT AGAC C C T 
AAATTAAAGCAAATAAATCTGATTAACTTTACAGATGAGATGAGCAAATT 
TG AGTTAGG CACACTT CG CTCTTTACTTCCAACAATTCGTAAA.T TTAAT A 
ATTGTTTAGGTGTGT ACGAA t ATA a aG CATC 

SEQ ID NO: 4906 
STRAIN M732 

AAACATCCGATACTTAATGATCAAAAATCCTTAG CAATTGTTGAACA 

GATAG AATAT GATTTGGAT AAATT CGATAATT CAGAAG CTT CTTTTTAT G 

CAACATTAGCTAGAATTCGCGTTATGGATAGAGAAATCAAAAAATTTATT 

AGAGAAAAT C CAAAT AGT CAAATCCTTTCAATTGGTTGT GGACTT GATAC 

AAGGTTTGAAAGAGTCGATAATGGACAAATTAGGTGGTATAACCTTGATT 

TGCCAGAGGTTATGGAGATAAGAAAATTATTTTTTGAAGAGCATGAAAGA 

GTTACTAAT ATAG CAAAAT CAG CC CTAGATGAAACTTGGACACGGGAGGT 

AAATCCCCAAAATGCCCCTTTTCTAATCGTGTCAGAAGGTGTTTTAATGT 

TTCTAAAAgAAGATGACGTAGAGACTTTTCTTCAtATCCTGACAAATT^ 

TTTAG CCAATTTATGGC a CAATTTG ATTTGTGTCATAAGG AAATG ATTAA 

TAAAGGAAAGCAACATGATACAGTAAAGTATATGGATACAGAATTTCAGT 

TTGGTATCACAGATGGTCATGAGATTGTGGATTTAGACCCTAAATTAAAG 

CAAAT AAAT CTGATTAACTTTACAGATGAGATGAG CAAATTTGAGTTAgG 

CACACTT CGCTTCTTTACTTCCAACAATTCGTAi^ 

G t GTGTACGAAT ATAAAG CATC 

SEQ ID NOs 4907 
STRAIN COH1 

AAACATC CX3ATACTTAATGATCAAAAATCCTTAG CAA 
TTGTTGAACAGATAGAATATGATTTGGATAAATT CGATAATT CAGAAG CT 
TLTTTT'l ATG CAACATTAGCTAGAATT CGCGTTATGGAT AGAGAAAT CAA 
AAAATTTATT AGAGAAAATCCAAATAGT CAAAT C CTTTCAATTGGTTGTG 
GACTTGATACAAGGTTTGAAAGAGTCGATAATGGACAAATTAGGTGGTAT 
AACCI^GATTTGCCAGAGGTTATGGAGATAAGAAAATTATTTTTTGAAGA 
GCATGAAAGAGTTACTAATATAGCAAAAT CAG CCCTAGATGAAACTTGG A 
CACGGGAGGTAAAT CCC CAAAATG C CCCTTTTCTAATCGTGT CAG AAGGT 
GTTTTAATGTTT CTAAAAGAAGATGACGT AGAGACITTTCTT CATAT CCT 
GACAAATT CATTTAGCCAATTT ATGGCACAATTTGATTTGTGT CATAAGG 
AAATGATTAATAAAGGAAAGCAACATGATACAGTAAAGTATATGGATACA 
GAATTTCAGTTTGGTAT CACAGATGGT CATGAGATTGTGGATTTAGAC CC 
TAAATTAAAGCAAATAAATCTGATTAACTTTACAGATGAGATGAGCAAAT 
TTGAGTTAGGCACACTTCGCTCTTTACTTCCAACAATTC 
AATTGTTTAGGTGTGTACG AATATAAAGCAT C 

SEQ ID NO: 4908 
STRAIN M7 81 

AAACATCCGATACTTAATGATCA 

AAAATCCTTAGCAATTGTTGAACAGATAGAATATGATTTGGATAA^TTCG 
ATAATTCAGAAG CTT CTTTTTATG CAACATTAGCT AG AATTCGCGTTATG 
GATAGAGAAATCAAAAAATTTATTAGAGAAAAT CCAAATAGT CAAAT CCT 
TTCAATTGGTTGTGGACTTGATACAAGGTTTGAAAGAG 
AAATTAGGTGGTATAACCTTGATTTGCCAGAGGTTATGGAGATAAGAAAA 
TTATTTTTTGAAGAG CATG AAAGAGTTACTAATATAGCAAAATCAGC C CT 
AGATGAAACTTGGAGACGGGAGGTAAATC C CCAAAATGC C CCTTTT CTAA 
TCGTGTCAGAAGGTGTTTTAATGTTTCTAAAAgAAGATGACGTAGAGACT 
TTT CTTCATATCCTGACAAAT t CATTTAGCCAATTTAtGGCACAATTTGA 
TTTGTGT CATAAGGAAATGATTAATAAAGG AAAG CAACATGATACAGTAA 
AGTATATGGATACAGAATTT CAGTTTGGTAT CACAGATGGT CATGAGATT 
GTGGATTTAg AC CCTAAATTAAAG CAAATAAAT CTGATTAACTTTACAG A 
TGAGATGAGCAAATTTGAGTTAGG GACACTTCGCT CTTTACTT C CA^CAA 
TTCGTAAATTTAATAAT tGTTTAGGTGTGTACGAATATAAAG CATC 

SEQ ID NO: 4909 
STRAIN CJB110 

AAACATCCGATACTTAATGATCAAAA^TCCTTAGCAA 
TTGTTGAACA.GATAGAATATGATTTTGATAAATT CGATAATT CAGAAGCT 
TCTTTTTATG CAACATT AGCTAGAATT CG CGTTATGGATAGAGAAAT CAA 
AAAATTTATT AGAGAAAATC CAAATAGTCAAATCCTTTCAATTG 
GAOTTGATACAAGGTTTGAAAGAGTCGATAATGGACAAATTAGGTGGTAT 
AACCTTGATTTG CCAGAGGTTATGG AGATAAGAAAATT^ 
GCATGAAAGAGTTACTAATATAGCAAAAT CAG CCATAGATGAAACTTGG A 
CACGGGAGGTAAAT CCC CAAAATGC CC CTTTTCTAAT CGTGT CAGAAGGT 
GTTTTAATGTTT CTAAAAGAAGATG ACGTAGAGACITTTCTT CAT AT CCT 
' GACAAATTCATTT AG CCAATTTATGG CACAATTT GATTTGTGT CATAAG G 
' AAATGATTAATAAAGGAAAG CAACATGATACAGTAAAGT ATATGG ATACA 
GAATTT CAGTTTGGT ATCACAGATGGTCATGAGATTGTGGATTTAGACCC 
TAAATTAAAG CAAATAAAT CTG ATT AACTTTACAG ATGAG AT GAG CAAAT 
TTGAGTTAGG CACACITCG CTCTTTACTTC CAACAA 
AATTGTTTAGGTGTGTACGAAT ATAAAG CATC 

SEQ ID NO: 4910 
STRAIN 116 9NT 

AAACATCCGATACTTAATGATCAAAAATCCTTAGCAAT 
TGTTGAACAGATAGAATATGATTTTGATAAATTCGATAATTCAGAAGCTT 



831 



WO 2004/018646 



PCT/US2003/026827 



Table 49: Comparative Sequences related to SAG1502 



CTTTTTATGCAACATTAGCTAGAATTCGCGTTATGGATAGAGAAATCAAA 
AAATTTATTAGAGAAAATCCAAATAGTCATATCCTTTCTATTGGTTGTGG 
ACTTGATACAAGGTTTGAAAGAGTCGATAATGGACAAATTAGGTGGTATA 
ACCTTGATTTGCCAGAGGTTATGGAGATAAGAAAATTATTTTTTGAAGAG 
CATGAAAGAGTTACTAATATAGCAAAATCAGCCCTAGATGAAACTTGGAC 
ACAGGAGGTAAATCCCCAAAATGCCCCTTTTCTGATCGTGTCAGAAGGTG 
TTTTAATGTTTCTAAAAGAAGATGACGTAGAGACTTTTcTTCATATCCTG 
ACAAATTCATTTAGCGAATTTATGGCACAATTTGATTTGTGt CAGAAGGA 
AATGATTAATAAAGGAAAGCAACATGATACAGTAAAGTATATGGATACAG 
AATTTCAGTTTGGTATCACAGATGGTCATGAAATTGTGGATTTAGACCCT 
AAATTAAAGCAAATAAATCTGATTAACITTACAGATGAGATGAGCAAATT 
TGAGTTAGGCACACTTCGCTCTTTACTTCCAACAATTCGTAAATTTAATA 
ATTGTTTAGGTGTGTACGAATATAAAGCATC 

SEQ ID NO: 4911 
STRAIN JH9130013 

AGCAATTGTTGAACAGATAGAATATGATT 

TTGATAAATTCGATAATTCAGAAGCTTCTTTTTATGCAACATTAGCTAGA 
ATTCGCGTTATGGATAGAGAAATCAAAAAATTTATTAGAGAAAATCCAAA 
TAGTCA.TATCCTTTCAATTGGCTGTGGACTTGATACAAGGTTTGAAAGAG 
T CGATAATGGACAAATTAGGTGGT ATAAC CTTGATTTG CCAGAGGTTATG 
GAGATAAGAAAATTATTTTTTGAAG AG CATGAAAG AGTTACT AATATAG C 
AAAATCAGCCCTAGATGAAACTTGGACACGGGAGGTAAATCCCCAAAATG 
CCCCTTTTCTAATCGTGTCAGAAGGTGTTTTAATGTTTCTAAAAGAAGAT 
GACGTAGAG ACTTTT CTTCATATCCTGACAAATT CATTT AGC CAATTTAT 
GG CACAATTTGATTTGTGTCAgAAGGAAATGATTAATAAAGGAAAGCAAC 
ATGATACA.GTAAAGTATATGGATACAGAATTTCAGTTTGGTATCACAGAT 
GGTCATG AAATTGTGGATTT AG AC CCTAAATTAAAG CAAATAAATCTGAT 
TAACTTTACAGATGAGATGAGCAAATTTGAGTTAGGC^GA.CTTCGCTCTT 
TACTTCCAACAATTCGTAAATTTAATAATTGTTTAGGTGTGTACGAATAT 
AAAGCATC 



PRETTY of: /biotmp/msa42193 . 2 { *} January 21, 2003 05:04 



msa42193 
msa42193 .2{ 
msa42193 .2{ 

msa42193. 

msa42193 - 

msa42193. 

msa42193. 

msa42193 . 

msa42193 . 
msa42193.2{l76 
msa42193 .2{ 



.2{l76_090} 
176_CJB110} 
176_18RS21} 
2{176_2603} 
2{176_A909} 
2{l76_COHl) 
2{176_M732} 
2{l76_M78l} 
2{176_H36B} 
;_JM9130013} 
176_1169NT} 
Consensus 



AAACATCCGA 
-AACATCCGA 
AAACATCCGA 
AAACATCCGA 
AAACATCCGA 
AAACATCCGA 
AAACATCCGA 
AAACATCCGA 



taatga 

TACTtaatga 
TACT taatga 
TACTtaatga 
TACTtaatga 
TACTtaatga 
TACTtaatga' 
TACTtaatga 
TACTtaatga 



tcaaaaatcc 
tcaaaaatcc 
tcaaaaatcc 
tcaaaaatcc 
tcaaaaatcc 
tcaaaaatcc 
tcaaaaatcc 
tcaaaaatcc 
tcaaaaatcc 



AAACATCCGA TACTtaatga tcaaaaatcc 
********** **** 



1 1 AGCAATTG 
1 t AGCAATTG 
tt AGCAATTG 
1 1 AGCAATTG 
1 1 AGCAATTG 
1 1 AGCAATTG 
1 1 AGCAATTG 
1 1 AGCAATTG 
1 1 AGCAATTG 
— AGCAATTG 
1 1 AGCAATTG 



******** 



50 

TTGAACAGAT 

TTGAACAGAT 

TTGAACAGAT 

TTGAACAGAT 

TTGAACAGAT 

TTGAACAGAT 

TTGAACAGAT 

TTGAACAGAT 

TTGAACAGAT 

TTGAACAGAT 

TTGAACAGAT 
********** 



msa42193 
msa42193.2{ 
msa42193.2{ 

msa42193 . 

msa42193 . 

msa42193. 

msa42193. 

msa42193 . 

msa42193 . 
msa42193.2{l76 
msa42193.2{ 



.2 {176^090} 
176_CJB110} 
176_18RS2l} 
2{176_2S03) 
2{176_A909} 
2{l76_COHl} 
2{176_M732) 
2{176_M781} 
2{l76_H36B) 
_JM9130013} 
176_1169NT} 
Consensus 



51 

AGAATATGAT 

AGAATATGAT 

AGAATATGAT 

AGAATATGAT 

AGAATATGAT 

AGAATATGAT 

AGAATATGAT 

AGAATATGAT 

AGAATATGAT 

AGAATATGAT 

AGAATATGAT 
********** 



TT t GATAAAT 

TT t GATAAAT 

TTtGATAAAT 

TT t GATAAAT 

TTtGATAAAT 

TTgGATAAAT 

TTgGATAAAT 

TTgGATAAAT 

TTtGATAAAT 

TTtGATAAAT 

TTtGATAAAT 
**_******* 



TCGATAATTC 
TCGATAATTC 
TCGATAATTC 
TCGATAATTC 
TCGATAATTC 
TCGATAATTC 
TCGATAATTC 
TCGATAATTC 
TCGATAATTC 
TCGATAATTC 
TCGATAATTC 



AGAAGCTTCT 
AGAAGCTTCT 
AGAAGCTTCT 
AGAAGCTTCT 
AGAAGCTTCT 
AGAAGCTTCT 
AGAAGCTTCT 
AGAAGCTTCT 
AGAAGCTTCT 
AGAAGCTTCT 
AGAAGCTTCT 



********** ********** 



100 

TTTTATGCAA 

TTTTATGCAA 

TTTTATGCAA 

TTTTATGCAA 

TTTTATGCAA 

TTTTATGCAA 

TTTTATGCAA 

TTTTATGCAA 

TTTTATGCAA 

TTTTATGCAA 

TTTTATGCAA 
********** 



msa42193 
rasa42193.2{ 
msa42193.2{ 

msa42193 . 

tnsa42193 . 

msa42193 . 

msa42193. 

msa42193. 

msa42193 . 
msa42193.2{l76 
msa42193 .2{ 



.2{176_090} 
176_CJB110} 
176_18RS2l} 
2{176_2603} 
2{176_A909} 
2{176_C0H1) 
2{176_M732} 
2{176_M781} 
2{176_H36B} 
_JM9130013} 
176_1169NT} 
Consensus 



101 

CATTAGCTAG 
CATTAGCTAG 
CATTAGCTAG 
CATTAGCTAG 
CATTAGCTAG 
CATTAGCTAG 
CATTAGCTAG 
CATTAGCTAG 
CATTAGCTAG 
CATTAGCTAG 
CATTAGCTAG 



AaTTCGCGTT 
AaTTCGCGTT 
AaTTCGCGTT 
AwTTCGCGTT 
AaTTCGCGTT 
AaTTCGCGTT 
AaTTCGCGTT 
AaTTCGCGTT 
AaTTCGCGTT 
AaTTCGCGTT 
AaTTCGCGTT 



ATGGATAGAG 
ATGGATAGAG 
ATGGATAGAG 
ATGGATAGAG 
ATGGATAGAG 
ATGGATAGAG 
ATGGATAGAG 
ATGGATAGAG 
ATGGATAGAG 
ATGGATAGAG 
ATGGATAGAG 



AAATCAAAAA 
AAATCAAAAA 
AAATCAAAAA 
AAATCAAAAA 
AAATCAAAAA 
AAATCAAAAA 
AAATCAAAAA 
AAATCAAAAA 
AAATCAAAAA 
AAATCAAAAA 
AAATCAAAAA 



150 

ATTTATTAGA 
ATTTATTAGA 
ATTTATTAGA 
ATTTATTAGA 
ATTTATTAGA 
ATTTATTAGA 
ATTTATTAGA 
ATTTATTAGA 
ATTTATTAGA 
ATTTATTAGA 
ATTTATTAGA 



********** *_******** ********** ********** ********** 



msa42193 .2{176_090; 
msa42193 . 2 { 176_CJB110 / 
msa42193 .2{176_18RS21; 
msa42193.2{l76_2603 
msa42193 . 2 { 176_A909 
msa42193 . 2 { 176_C0H1 



151 

GAAAATCCAA 
GAAAATCCAA 
GAAAATCCAA 
GAAAATCCAA 
GAAAATCCAA 
GAAAATCCAA 



ATAGTCAaAT 
ATAGTCAaAT 
ATAGTCAaAT 
ATAGTCAaAT 
ATAGTCAaAT 
ATAGTCAaAT 



CCTTTCaATT 
CCTTTCaATT 
CCTTTCaATT 
CCTTTCaATT 
CCTTTCaATT 
CCTTTCaATT 



GGtTGTGGAC 
GGtTGTGGAC 
GGtTGTGGAC 
GGtTGTGGAC 
GGtTGTGGAC 
GGtTGTGGAC 



200 

TTGATACAAG 
TTGATACAAG 
TTGATACAAG 
TTGATACAAG 
TTGATACAAG 
TTGATACAAG 



832 



WO 2004/018646 



PCT/US2003/026827 



Table 49: Comparative Sequences related to SAG1502 



msa42193 . 2 ( 176_M732 } 
msa42193.2{176_M78l} 
msa4il93 .2{176_H36B} 
msa42193.2{l76_JM9130013} 
msa42193 . 2 { 176JL169NT} 

Consensus 



GAAAATCCAA 
GAAAATCCAA 
GAAAATCCAA 
GAAAATCCAA 
GAAAATCCAA 



ATAGTCAaAT 
ATAGTCAaAT 
ATAGTCAtAT 
ATAGTCAtAT 
ATAGTCAtAT 



CCTTTCaATT 
CCTTTCaATT 
CCTTTCaATT 
CCTTTCaATT 
CCTTTCtATT 



GGtTGTGGAC 
GGtTGTGGAC 
GGCTGTGGAC 
GG cTGTGGAC 
GGtTGTGGAC 



TTGATACAAG 
TTGATACAAG 
TTGATACAAG 
TTGATACAAG 
TTGATACAAG 



********** *******_** ******_*** **_******* ********** 



msa42193 
msa42193 .2{ 
msa42193 .2{ 

msa42193 . 

rasa42193 . 

msa42193 . 

msa42193 . 

msa42193 . 

msa42193 . 
msa42193.2{l76 
msa42193.2{ 



.2{176_090} 
176__CJB110} 
176_18RS2l} 
2fl76_2603} 
2f 176_A909} 
2{176_COHl} 
2{176_M732} 
2(l76_M78l} 
2{176_H36B} 
_JM9130013} 
176_JL169NT} 
Consensus 



201 

GTTTGAAAGA 

GTTTGAAAGA 

GTTTGAAAGA 

GTTTGAAAGA 

GTTTGAAAGA 

GTTTGAAAGA 

GTTTGAAAGA 

GTTTGAAAGA 

GTTTGAAAGA 

GTTTGAAAGA 

GTTTGAAAGA 
********** 



GTCGATAATG 

GTCGATAATG 

GTCGATAATG 

GTCGATAATG 

GTCGATAATG 

GTCGATAATG 

GTCGATAATG 

GTCGATAATG 

GTCGATAATG 

GTCGATAATG 

GTCGATAATG 
********** 



GACAAATTAG 

GACAAATTAG 

GACAAATTAG 

GACAAATTAG 

GACAAATTAG 

GACAAATTAG 

GACAAATTAG 

GACAAATTAG 

GACAAATTAG 

GACAAATTAG 

GACAAATTAG 
********** 



GTGGTATAAC 

GTGGTATAAC 

GTGGTATAAC 

GTGGTATAAC 

GTGGTATAAC 

GTGGTATAAC 

GTGGTATAAC 

GTGGTATAAC 

GTGGTATAAC 

GTGGTATAAC 

GTGGTATAAC 
********** 



250 

CTTGATTTGC 

CTTGATTTGC 

CTTGATTTGC 

CTTGATTTGC 

CTTGATTTGC 

CTTGATTTGC 

CTTGATTTGC 

CTTGATTTGC 

CTTGATTTGC 

CTTGATTTGC 

CTTGATTTGC 
********** 



msa42193 . 2 { 176_090 } 
msa42193 . 2 { 176_CJB110 
msa42193 . 2 { 176_18RS21 
msa42193 .2{176_2603 
msa42193 . 2 { 176__A909 " 
msa42193 . 2 ( 176_COHl 
msa42193 .2{176_M732 
msa42193 . 2 { 176_M781 
msa42193 . 2 { 176_H36B 
msa42193.2{l76_JM9130013 
msa42193 . 2 { 176_1169NT; 

Consensus 



251 

CAGAGGTTAT 
CAGAGGTTAT 
CAGAGGTTAT 
CAGAGGTTAT 
CAGAGGTTAT 
CAGAGGTTAT 
CAGAGGTTAT 
CAGAGGTTAT 
CAGAGGTTAT 
CAGAGGTTAT 
CAGAGGTTAT 



GGAGATAAGA 
GGAGATAAGA 
GGAGATAAGA 
GGAGATAAGA 
GGAGATAAGA 
GGAGATAAGA 
GGAGATAAGA 
GGAGATAAGA 
GGAGATAAGA 
GGAGATAAGA 
GGAGATAAGA 



AAATTATTTT 
AAATTATTTT 
AAATTATTTT 
AAATTATTTT 
AAATTATTTT 
AAATTATTTT 
AAATTATTTT 
AAATTATTTT 
AAATTATTTT 
AAATTATTTT 
AAATTATTTT 



TTGAAGAGCA 
TTGAAGAGCA 
TTGAAGAGCA 
TTGAAGAGCA 
TTGAAGAGCA 
TTGAAGAGCA 
TTGAAGAGCA 
TTGAAGAGCA 
TTGAAGAGCA 
TTGAAGAGCA 
TTGAAGAGCA 



300 

TGAAAGAGTT 
TGAAAGAGTT 
TGAAAGAGTT 
TGAAAGAGTT 
TGAAAGAGTT 
TGAAAGAGTT 
TGAAAGAGTT 
TGAAAGAGTT 
TGAAAGAGTT 
TGAAAGAGTT 
TGAAAGAGTT 



********** ********** ********** ********** ********** 



msa42193. 2 { 176^090} 
msa42193 . 2 { 176_CJB110} 
msa42193 . 2 { 176JL8RS21 } 
msa42193 . 2 {176_2603 } 
msa42193 . 2 f 176_A909 } 
ms a4 2 1 9 3 . 2 { 1 7 6_COHl } 
msa42193 .2(176_M732} 
msa42193.2{176__M78i; 
msa4 2 19 3 . 2 { 17 6__H3 SB ' 
msa42193.2{l76_JM9130013' 
msa42193.2{l76_1169NT} 

Consensus 



301 

ACTAATATAG 

ACTAATATAG 

ACTAATATAG 

ACTAATATAG 

ACTAATATAG 

ACTAATATAG 

ACTAATATAG 

ACTAATATAG 

ACTAATATAG 

ACTAATATAG 

ACTAATATAG 
********** 



CAAAATCAGC 

CAAAATCAGC 

CAAAATCAGC 

CAAAATCAGC 

CAAAATCAGC 

CAAAATCAGC 

CAAAATCAGC 

CAAAATCAGC 

CAAAATCAGC 

CAAAATCAGC 

CAAAATCAGC 
********** 



CaTAGATGAA 

CaTAGATGAA 

CcTAGATGAA 

CcTAGATGAA 

CcTAGATGAA 

CcTAGATGAA 

CcTAGATGAA 

CcTAGATGAA 

CcTAGATGAA 

CcTAGATGAA 

CcTAGATGAA 
*_******** 



ACTTGGACAC 
ACTTGGACAC 
ACTTGGACAC 
ACTTGGACAC 
ACTTGGACAC 
ACTTGGACAC 
ACTTGGACAC 
ACTTGGACAC 
ACTTGGACAC 
ACTTGGACAC 
ACTTGGACAC 



350 

gGGAGGTAAA 
gGGAGGTAAA 
gGGAGGTAAA 
gGGAGGTAAA 
gGGAGGTAAA 
gGGAGGTAAA 
gGGAGGTAAA 
gGGAGGTAAA 
gGGAGGTAAA 
gGGAGGTAAA 
aGGAGGTAAA 



********** _********* 



j. j j 

:2i 



msa42193 
msa42193 
msa42193 

tnsa42193 . 

tnsa42193 . 

msa42193 . 

msa42193 . 

msa42193 . 

msa42193 . 
msa42193.2{l76 
msa42193.2{ 



,2{l76_090j 
176_CJB110J 
176_18RS21} 
2{176_2603} 
2{176_A909} 
2{l76_COHl} 
2f 176J4732} 
2{176_M781} 
2{176_H36B^ 
_JM9130013' 
176_1169NT; 
Consensus 



351 

TCCCCAAAAT 

TCCCCAAAAT 

TCCCCAAAAT 

TCCCCAAAAT 

TCCCCAAAAT 

TCCCCAAAAT 

TCCCCAAAAT 

TCCCCAAAAT 

TCCCCAAAAT 

TCCCCAAAAT 

TCCCCAAAAT 
*** ******* 



GCCCCTTTTC 
GCCCCTTTTC 
GCCCCTTTTC 
GCCCCTTTTC 
GCCCCTTTTC 
GCCCCTTTTC 
GCCCCTTTTC 
GCCCCTTTTC 
GCCCCTTTTC 
GCCCCTTTTC 
GCCCCTTTTC 



TaATCGTGTC 
TaATCGTGTC 
TaATCGTGTC 
TaATCGTGTC 
TaATCGTGTC 
TaATCGTGTC 
TaATCGTGTC 
TaATCGTGTC 
TaATCGTGTC 
TaATCGTGTC 
TgATCGTGTC 



AGAAGGTGTT 
AGAAGGTGTT 
AGAAGGTGTT 
AGAAGGTGTT 
AGAAGGTGTT 
AGAAGGTGTT 
AGAAGGTGTT 
AGAAGGTGTT 
AGAAGGTGTT 
AGAAGGTGTT 
AGAAGGTGTT 



400 

TTAATGTTTC 
TTAATGTTTC 
TTAATGTTTC 
TTAATGTTTC 
TTAATGTTTC 
TTAATGTTTC 
TTAATGTTTC 
TTAATGTTTC 
TTAATGTTTC 
TTAATGTTTC 
TTAATGTTTC 



********** *_******** ********** ********** 



msa42193 »2{176_090 
msa42193 . 2 { 176_CJB110 
msa42193 . 2 { 176JL8RS21 
msa42193.2{!76_2603} 
msa42193 .2{176_A909} 
msa42193 . 2 { 176_COHl } 
msa42193 .2(176_M732} 
msa42193 .2fl76_M78l} 
msa42193.2{l76_H36B; 
msa42193.2{l76_JM9130013 
msa42193 . 2 { 176_1169NT" 

Consensus 



401 

TAAAAGAAGA 

TAAAAGAAGA 

TAAAAGAAGA 

TAAAAGAAGA 

TAAAAGAAGA 

TAAAAGAAGA 

TAAAAGAAGA 

TAAAAGAAGA 

TAAAAGAAGA 

TAAAAGAAGA 

TAAAAGAAGA 
********** 



TGACGTAGAG 
TGACGTAGAG 
TGACGTAGAG 
TGACGTAGAG 
TGACGTAGAG 
TGACGTAGAG 
TGACGTAGAG 
TGACGTAGAG 
TGACGTAGAG 
TGACGTAGAG 
TGACGTAGAG 
********** 



ACTTTTCTTC 

ACTTTTCTTC 

ACTTTTCTTC 

ACTTTTCTTC 

ACTTTTCTTC 

ACTTTTCTTC 

ACTTTTCTTC 

ACTTTTCTTC 

ACTTTTCTTC 

ACTTTTCTTC 

ACTTTTCTTC 
********** 



ATATCCTGAC 

ATATCCTGAC 

ATATCCTGAC 

ATATCCTGAC 

ATATCCTGAC 

ATATCCTGAC 

ATATCCTGAC 

ATATCCTGAC 

ATATCCTGAC 

ATATCCTGAC 

ATATCCTGAC 
********** 



450 

AAATTCATTT 
AAATTCATTT 
AAATTCATTT 
AAATTCATTT 
AAATTCATTT 
AAATTCATTT 
AAATTCATTT 
AAATTCATTT 
AAATTCATTT 
AAATTCATTT 
AAATTCATTT 
********** 



msa42193 .2{l76_090 
msa42193 . 2 { 176_CJB110 
msa42193 . 2 { 176_18RS21 
msa42193 . 2 { 176_2603 
rasa42 193 . 2 { 176_A909 



451 

AGCCAATTTA 
AGCCAATTTA 
AGCCAATTTA 
AGCCAATTTA 
AGCCAATTTA 



TGGCACAATT 
TGGCACAATT 
TGGCACAATT 
TGGCACAATT 
TGGCACAATT 



TGATTTGTGT 
TGATTTGTGT 
TGATTTGTGT 
TGATTTGTGT 
TGATTTGTGT 



CAtAAGGAAA 
CAtAAGGAAA 
CAtAAGGAAA 
CAtAAGGAAA 
CAtAAGGAAA 



500 

TGATTAATAA 
TGATTAATAA 
TGATTAATAA 
TGATTAATAA 
TGATTAATAA 



833 



WO 2004/018646 



PCT/US2003/026827 



Table 49: Comparative Sequences related to SAG1502 



msa42 193 . 2 { 176__COHl } 
msa42 193 . 2 (176_M732 } 
msa42193.2{l76_M78l} 
msa4 2 193 . 2 { 176_H3 6B} 
msa42193 . 2 { 176_JM913 0013 } 
msa42193 . 2 { 176_1169NT) 

Consensus 



AGCCAATTTA 
AGCCAATTTA 
AGCCAATTTA 
AGCCAATTTA 
AGCCAATTTA 
AGCCAATTTA 



TGGCACAATT 
TGGCACAATT 
TGGCACAATT 
TGGCACAATT 
TGGCACAATT 
TGGCACAATT 



********** ********** 



TGATTTGTGT 

TGATTTGTGT 

TGATTTGTGT 

TGATTTGTGT 

TGATTTGTGT 

TGATTTGTGT 
********** 



CAtAAGGAAA 
CAtAAGGAAA 
CAtAAGGAAA 
CAgAAGGAAA 
CAgAAGGAAA 
CAgAAGGAAA 



TGATTAATAA 
TGATTAATAA 
TGATTAATAA 
TGATTAATAA 
TGATTAATAA 
TGATTAATAA 



**_******* ********** 



msa42193 
msa42193 .2{ 
msa42193 .2{ 

msa42193 . 

msa42193 . 

msa42193 . 

msa42193 . 

msa42193 . 

msa42193 . 
msa42193.2{l76 
msa42193 .2{ 



.2{l76_090' 
176_CJB110 : 
176_18RS21 
2(l76_2603 
2{176__A909 
2{l76_COHl 
2f 176_M732 
2{176__M781 
2{l76_H36B 
JM9130013 1 
176_1169NT; 
Consensus 



501 

AGGAAAGCAA 
AGGAAAGCAA 
AGGAAAGCAA 
AGGAAAGCAA 
AGGAAAGCAA 
AGGAAAGCAA 
AGGAAAGCAA 
AGGAAAGCAA 
AGGAAAGCAA 
AGGAAAGCAA 
AGGAAAGCAA 



CATGATACAG 
CATGATACAG 
CATGATACAG 
CATGATACAG 
CATGATACAG 
CATGATACAG 
CATGATACAG 
CATGATACAG 
CATGATACAG 
CATGATACAG 
CATGATACAG 



TAAAGTATAT 
TAAAGTATAT 
TAAAGTATAT 
TAAAGTATAT 
TAAAGTATAT 
TAAAGTATAT 
TAAAGTATAT 
TAAAGTATAT 
TAAAGTATAT 
TAAAGTATAT 
TAAAGTATAT 



GGATACAGAA 
GGATACAGAA 
GGATACAGAA 
GGATACAGAA 
GGATACAGAA 
GGATACAGAA 
GGATACAGAA 
GGATACAGAA 
GGATACAGAA 
GGATACAGAA 
GGATACAGAA 



550 

TTTCAGTTtG 
TTTCAGTTtG 
TTTCAGTTtG 
TTTCAGTTtG 
TTTCAGTTtG 
TTTCAGTTtG 
TTTCAGTTtG 
TTTCAGTTtG 
TTTCAGTTgG 
TTTCAGTTtG 
TTTCAGTTtG 



********** ********** ********** ********** ********_* 



msa42193 
msa42193 .2{ 
rasa42193 ,2{ 

msa42193 . 

msa42193. 

msa42193 . 

msa42193 . 

msa42193. 

msa42193. 
msa42193 .2(176 
msa42193 .2{ 



.2{l76_090" 
176_CJB110/ 
176_18RS2l' 
2{176_2603 
2{l76_A909 

2{l76_COHli 
2{l76_M732 

2{l76_M781 
2{176_H36B 
_JM913 0013 
176_1169NT] 
Consensus 



551 

GTATCACAGA 
GTATCACAGA 
GTATCACAGA 
GTATCACAGA 
GTATCACAGA 
GTATCACAGA 
GTATCACAGA 
GTATCACAGA 
GTATCACAGA 
GTATCACAGA 

GTATCACAGA 
********** 



TGGTCATGAg 

TGGTCATGAg 

TGGTCATGAg 

TGGTCATGAg 

TGGTCATGAg 

TGGTCATGAg 

TGGTCATGAg 

TGGTCATGAg 

TGGTCATGAa 

TGGTCATGAa 

TGGTCATGAa 
*********_ 



ATTGTGGATT 

ATTGTGGATT 

ATTGTGGATT 

ATTGTGGATT 

ATTGTGGATT 

ATTGTGGATT 

ATTGTGGATT 

ATTGTGGATT 

ATTGTGGATT 

ATTGTGGATT 

ATTGTGGATT 
********** 



TAGACCCTAA 

TAGACCCTAA 

TAGACCCTAA 

TAGACCCTAA 

TAGACCCTAA 

TAGACCCTAA 

TAGACCCTAA 

TAGACCCTAA 

TAGACCCTAA 

TAGACCCTAA 

TAGACCCTAA 
********** 



600 

ATTAAAGCAA 

ATTAAAGCAA 

ATTAAAGCAA 

ATTAAAGCAA 

ATTAAAGCAA 

ATTAAAGCAA 

ATTAAAGCAA 

ATTAAAGCAA 

ATTAAAGCAA 

ATTAAAGCAA 

ATTAAAGCAA 
********** 



rasa42193 . 2 { 176_090 } 
msa42193 .2f 176_CJB110} 
msa42193 .2{176_18RS21} 
msa42193 . 2 { 176^2603 } 
msa42193 . 2 { 176_A909 } 
msa42193 . 2 { 176_COHl } 
rasa42 193 . 2 { 176_M732 } 
msa4 2 1 93 . 2 { 1 7 6_M7 8 1 } 
msa42193.2{l76_H36B} 
msa42193 . 2 { 176_JM9130013 } 
msa42193 ,2{176_1169NT} 

Consensus 



601 

ATAAAT CTGA 

ATAAATCTGA 

ATAAAT CTGA 

ATAAATCTGA 

ATAAATCTGA 

ATAAATCTGA 

ATAAATCTGA 

ATAAATCTGA 

ATAAATCTGA 

ATAAATCTGA 

ATAAATCTGA 
********** 



TTAACTTTAC 

TTAACTTTAC 

TTAACTTTAC 

TTAACTTTAC 

TTAACTTTAC 

TTAACTTTAC 

TTAACTTTAC 

TTAACTTTAC 

TTAACTTTAC 

TTAACTTTAC 

TTAACTTTAC 
********** 



AGATGAGATG 

AGATGAGATG 

AGATGAGATG 

AGATGAGATG 

AGATGAGATG 

AGATGAGATG 

AGATGAGATG 

AGATGAGATG 

AGATGAGATG 

AGATGAGATG 

AGATGAGATG 
********** 



AGCAAATTTG 

AGCAAATTTG 

AGCAAATTTG 

AGCAAATTTG 

AGCAAATTTG 

AGCAAATTTG 

AGCAAATTTG 

AGCAAATTTG 

AGCAAATTTG 

AGCAAATTTG 

AGCAAATTTG 
********* * 



650 

AGTTAGGCAC 

AGTTAGGCAC 

AGTTAGGCAC 

AGTTAGGCAC 

AGTTAGGCAC 

AGTTAGGCAC 

AGTTAGGCAC 

AGTTAGGCAC 

AGTTAGGCAC 

AGTTAGGCAC 

AGTTAGGCAC 
********** 



msa42193 
msa42193 .2{ 
msa42193.2{ 

msa42193 . 

msa42193 . 

msa42193 . 

msa42193 . 

msa42193 . 

msa42193 . 
msa42193.2{l76 
msa42193 .2{ 



.2{176_090} 
176_CJB110} 
176_18RS21} 
2{176_2603> 
2{176_A909} 
2{l76_COHl} 
2{176_M732} 
2{176_M781} 
2{176_H36B} 
_JM9130013} 
176_1169NT} 
Consensus 



651 

ACTTCGCTCT 

ACTTCGCTCT 

ACTTCGCTCT 

ACTTCGCTCT 

ACTTCGCTCT 

ACTTCGCTCT 

ACTTCGCTCT 

ACTTCGCTCT 

ACTTCGCTCT 

ACTTCGCTCT 

ACTTCGCTCT 
********** 



TTACTTCCAA 

TTACTTCCAA 

TTACTTCCAA 

TTACTTCCAA 

TTACTTCCAA 

TTACTTCCAA 

TTACTTCCAA 

TTACTTCCAA 

TTACTTCCAA 

TTACTTCCAA 

TTACTTCCAA 
********** 



CAATTCGTAA 

CAATTCGTAA 

CAATTCGTAA 

CAATTCGTAA 

CAATTCGTAA 

CAATTCGTAA 

CAATTCGTAA 

CAATTCGTAA 

CAATTCGTAA 

CAATTCGTAA 

CAATTCGTAA 
********** 



ATTTAATAAT 

ATTTAATAAT 

ATTTAATAAT 

ATTTAATAAT 

ATTTAATAAT 

ATTTAATAAT 

ATTTAATAAT 

ATTTAATAAT 

ATTTAATAAT 

ATTTAATAAT 

ATTTAATAAT 
********** 



700 

TGTTTAGGTG 

TGTTTAGGTG 

TGTTTAGGTG 

TGTTTAGGTG 

TGTTTAGGTG 

TGTTTAGGTG 

TGTTTAGGTG 

TGTTTAGGTG 

TGTTTAGGTG 

TGTTTAGGTG 

TGTTTAGGTG 
********** 



msa42193.2{l76__090 
msa42193 . 2 {l76_CJB110 
msa42193 .2{176_18RS21 : 
msa42193.2(l76_2603 
msa42193 . 2 (176_A909 
msa4 2193. 2(17 6_COHl 
msa42193.2fl76_M732 
msa42193 . 2 (176_M781; 
ms a4 2 1 9 3 . 2 { 1 7 6_H3 6B 
msa42193.2{l76_JM9130013" 
msa42193 . 2 { 176_1169NT 

Consensus 



701 

TGTACGAATA 

TGTACGAATA 

TGTACGAATA 

TGTACGAATA 

TGTACGAATA 

TGTACGAATA 

TGTACGAATA 

TGTACGAATA 

TGTACGAATA 

TGTACGAATA 

TGTACGAATA 
********** 



719 

TAAAGCATC 

TAAAGCATC 

TAAAGCATC 

TAAAGCATC 

TAAAGCATC 

TAAAGCATC 

TAAAGCATC 

TAAAGCATC 

TAAAGCATC 

TAAAGCATC 

TAAAGCATC 
********* 



SEQ ID NO: 4912 

STRAIN 2603 frame: 1 

KHPILNDQKSLAIVEQIEYDFDK^DNSEASFYATLARXRVMDREIKKFIRENPNSQILSI 
GCGLDTRFERVDNGQ IRWYNLDLPEVME I RKLFFEEHERVTNIAKSALDETWTREVNPQN 
APFL I VSEG VLMFLKEDDVETFLH I LTNS FSQFMAQFDLCHKEM I NKGKQHDTVKYMDTE 



834 



WO 2004/018646 
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Table 49: Comparative Sequences related to SAG1502 



FQFGITDGHEIVDLDPKLKQINLINFTDEMSKFELGTLRSLLPTIRKFNNCLGVYEYKA 

SEQ ID NO: 4913 

STRAIN 090 frame: 2 

NDQKS LA IVEQ I E YD FD KFDNSEAS FYATLARI RVMDRE I KKF I RENPNS Q I LS I GCGLD 
TRFERVDNGQI RWYNLDLPE VME I RKLFFEEHERVTNIAKSAIDETWTREVNPQNAPFL I 
VSEGVLMFLKEDDVETFLH I LTNSFSQFMAQFDLCHKEMINKGKQHDTVKYMDTEFQFG I 
TDGHE I VDLDPKLKQ INLI NFTDEMSKFELGTLRSLLPTI RKFNNCLGVYE YKA 

SEQ ID NO: 4914 

STRAIN A909 frame: 1 

KHP I LNDQKSLAI VEQI E YD FDKFDNS EAS FYATLARI RVMDRE I KKF I RENPNS Q I LS I 
GCGLDTRFERVDNGQ I RWYNLDLPEVME I RKLFFEEHERVTNI AKSALDETWTREVNPQN 
APFLI VSEGVLMFLKEDDVETFLHI LTNSFSQFMAQFDLCHKEM INKGKQHDTVKYMDTE 
FQFG I TDGHE I VDLDPKLKQ INL I NFTDEMSKFELGTLRSLLPTI RKFNNCLGVYEYKA 

SEQ ID NO: 4915 

STRAIN H36B frame: 1 

KHP I LNDQKS LAI VEQI EYDFD KFDNSEAS FYATLARI RVMDRE I KKF I RENPNSHI LS I 
GCGLDTRFERVDNGQ I RWYNLDLPEVME I RKLFFEEHERVTNI AKSALDETWTREVNPQN 
APFL I VSEGVLMFLKEDDVETFLHI LTNS FSQFMAQFDLCQKEM INKGKQHDTVKYMDTE 
FQLG ITDGHE I VDLDPKLKQ INL INFTDEMSKFELGTLRSLLPTIRKFNNCLGVYEYKA 



SEQ ID NO: 4916 

STRAIN 18RS21 frame: 3 

HP I LNDQKSLAI VEQI EYDFDKFDNSEAS FYATLARI RVMDRE I KKF I RENPNSQ I LS I G 
CGLDTRFERVDNGQ I RWYNLDLPEVME I RKLFFEEHERVTNI AKSALDETWTREVNPQNA 
PFL I VSEGVLMFLKEDDVETFLH I LTNS FSQFMAQFDLCHKEM INKGKQHDTVKYMDTE F 
QFGITDGHE I VDLDPKLKQ INL I NFTDEMSKFELGTLRSLLPTI RKFNNCLGVYEYKA 

SEQ ID NO: 4917 

STRAIN M732 frame: 1 

KHPI LNDQKSLAI VEQI EYDLDKFDNSEAS FYATLARI RVMDREI KKFIRENPNSQI LS I 
GCGLDTRFERVDNGQ I RWYNLDLPEVME I RKLFFEEHERVTNI AKSALDETWTREVNPQN 
APFL I VSEGVLMFLKEDDVETFLH I LTNS FSQFMAQFDLCHKEM I NKGKQHDTVKYMDTE 
FQFGITDGHEIVDLDPKLKQINLINFTDEMSKFELGTLRSLLPTIRKFNNCLGVYEYKA 

SEQ ID NO: 4918 

STRAIN COH1 frame: 1 

KHPI LNDQKSLAI VEQIEYDLDKFDNSEASFYATLARIRVMDRE I KKFIRENPNSQILS I 
GCGLDTRFERVTJNGQ I RWYNLDLPEVME I RKLFFEEHERVTNI AKSALDETWTREVNPQN 
APFLI VSEGVLMFLKEDDVETFLH I LTNSFSQFMAQFDLCHKEMINKGKQHDTVKYMDTE 
FQFG I TDGHE I VDLD PKLKQ I NL I NFTDEMSKFELGTLRSLLPT I RKFNNCLGVYEYKA 



SEQ ID NO: 4919 

STRAIN M781 frame: 1 

KHPIIuNDQKSLAIVEQIEYDLDKFDNSEASFYATIJ^IRWlDREIKKF'IRENPNSQILSI 
GCGLDTRFERVDNGQ I RWYNLDLPEVME I RKLFFEEHERVTNI AKSALDETWTREVNPQN 
APFL I VSEGVLMFLKEDDVETFLH I LTNS FSQFMAQFDLCHKEM INKGKQHDTVKYMDTE 
FQFG I TDGHE I VDLDPKLKQ I NL I NFTDEMSKFELGTLRS LLPT I RKFNNCLGVYEYKA 

SEQ ID NO: 4920 

STRAIN CJB110 frame: 1 

KHPI LNDQKSLAI VEQI EYDFDKFDNSEAS FYATLARIRVMDREI KKFI RENPNSQI LS I 
GCGLDTRFERVDNGQ I RWYNLDLPEVME I RKLFFEEHERVTNI AKSAI DETWTREVNPQN 
APFLI VSEGVLMFLKEDDVETFLHILTNSFSQFMAQFDLCHKEMINKGKQHDTVKYMDTE 
FQFG I TDGHE I VDLDPKLKQ INLI NFTDEMSKFELGTLRSLLPT I RKFNNCLGVYEYKA 

SEQ ID NO: 4921 

STRAIN 1169NT frame: 1 

KHP I LNDQKSLAI VEQI EYDFDKFDNSEAS FYATLARI RVMDRE I KKFI RENPNSHI LS I 
GCGLDTRFERVDNGQ I RWYNLDLPEVME I RKLFFEEHERVTN I AKSALDETWTQEVNPQN 
AP FL I VSEGVLMFLKEDDVETFLH I LTNS F SQFMAQFDLCQKEM I NKGKQHDTVKYMDTE 
FQFGITDGHE I VDLD PKLKQINLINFTDEMSKFELGTLRSLLPTI RKFNNCLGVYEYKA 



SEQ ID NO: 4922 

STRAIN JM9130013 frame: 2 

AI VEQI EYDFDKFDNSEAS FYATLARI RVMDRE I KKF I RENPNSHI LS I GCGLDTRFERV 
DNGQI RWYNLDLPEVME I RKLFFEEHERVTN I AKSALDETWTREVNPQNA PFL I VSEGVL 
MFLKEDDVETFIjHILTNSFSQFMAQFDLCQKEMINKGKQHDTVKYMDTEFQFGITDGHEI 
VDLDPKLKQ I NL INFTDEMSKFELGTLRSLLPT I RKFNNCLGVYEYKA 

PRETTY of: /biotmp/msa42204 . 2 { * } January 21, 2003 05:05 



msa42204 . 2 { 176_H36B} 
msa42204 - 2 { 176_JM9130013 } 
msa42204 . 2 { 176_090 , 
msa42204 . 2{l76_18RS21 
msa42204.2{!76_2603 
maa42204.2{l76_A909} 
msa42204 . 2 { 176_CJB110 } 



khpilndqks 

ndqks 

-hpilndqks 
khpilndqks 
khpilndqks 
khpilndqks 



1AIVEQIEYD 
-AIVEQIEYD 
1AIVEQIEYD 
1AIVEQIEYD 
1AIVEQIEYD 
1 AIVEQIEYD 
1AIVEQIEYD 



fD KFDNSEAS 
fDKFDNSEAS 
f DKFDNSEAS 
fDKFDNSEAS 
fDKFDNSEAS 
fDKFDNSEAS 
fDKFDNSEAS 



FYATLARiRV 
FYATLARiRV 
FYATLARiRV 
FYATLARiRV 
FYATLARxRV 
FYATLARiRV 
FYATLARiRV 



50 

MDREIKKFIR 
MDREIKKFIR 
MDREIKKFIR 
MDREIKKFIR 
MDREIKKFIR 
MDREIKKFIR 
MDREIKKFIR 
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Table 49: Comparative Sequences related to SAG1502 



msa42204 . 2 { 176_COHl } 
msa42204 . 2 { 176_M732 } 
msa42204 .2{l76_M78l} 
msa42204 . 2{176_1169NT} 

Consensus 



khpilndqks 
khpilndqks 
khpilndqks 
khpilndqks 



1AIVEQIEYD 
1AIVEQIEYD 
1AIVEQIEYD 
1AIVEQIEYD 



1DKFDNSEAS 
1DKFDNSEAS 
1DKFDNSEAS 
f DKFDNSEAS 



FYATIxARiRV 
FYATIiARi RV 
FYATLARiRV 
FYATLARiRV 



MDREIKKFIR 
MDREIKKFIR 
MDREIKKFIR 
MDREIKKFIR 



_********* _********* *******_** ********** 



msa42204 . 
msa42204.2{l76 
msa42204 
msa422 04 - 2{ 

msa42204 . 

msa42204 . 
msa42204.2{ 

msa42204 . 

msa42204 . 

msa42204 . 
msa42204 . 2{ 



2{17S_H36B 
_JM9130013 
.2{l76_090 
176_18RS21 
2{176_2603; 
2{l76_A909 
176J2JB110 
2{l76_COHl 
2{176_M732 
2{176_M781 ! 
176_1169NT; 
Consensus 



51 

ENPNShlLSI 
ENPNShlLSI 
ENPNSqILSI 
ENPNSqILSI 
ENPNSqILSI 
ENPNSqILSI 
ENPNSqILSI 
ENPNSqILSI 
ENPNSqILSI 
ENPNSqILSI 
ENPNShlLSI 



GCGLDTRFER 

GCGLDTRFER 

GCGLDTRFER 

GCGLDTRFER 

GCGLDTRFER 

GCGLDTRFER 

GCGLDTRFER 

GCGLDTRFER 

GCGLDTRFER 

GCGLDTRFER 

GCGLDTRFER 
********** 



VDNGQIRWYN 

VDNGQIRWYN 

VDNGQIRWYN 

VDNGQIRWYN 

VDNGQIRWYN 

VDNGQIRWYN 

VDNGQIRWYN 

VDNGQIRWYN 

VDNGQIRWYN 

VDNGQIRWYN 

VDNGQIRWYN 
********** 



LDLPEVMEIR 

LDLPEVMEIR 

LDLPEVMEIR 

LDLPEVMEIR 

LDLPEVMEIR 

LDLPEVMEIR 

LDLPEVMEIR 

LDLPEVMEIR 

LDLPEVMEIR 

LDLPEVMEIR 

LDLPEVMEIR 
********** 



100 

KLFFEEHERV 

KLFFEEHERV 

KLFFEEHERV 

KLFFEEHERV 

KLFFEEHERV 

KLFFEEHERV 

KLFFEEHERV 

KLFFEEHERV 

KLFFEEHERV 

KLFFEEHERV 

KLFFEEHERV 
********** 



msa42204 . 
msa42204.2{l76 
msa42204 
msa422 04 . 2 { 

msa42204 . 

msa42204 . 
msa42204.2{ 

msa42204 . 

msa42204 . 

msa42204 . 
rasa42204 .2{ 



2{l76_H36B} 
_JM9130013} 
.2{176_090} 
176_18RS2l} 
2 {176^2603} 
2{176_A909} 
176J2JB110 
2{l76_COHl 
2{176_M732 
2{176_M781) 
176_1169NT} 
Consensus 



101 

TNIAKSA1DE 
TNIAKSA1DE 
TNIAKSAiDE 
TNIAKSA1DE 
TNIAKSAIDE 
TNIAKSAIDE 
TNIAKSAiDE 
TNIAKSAIDE 
TNIAKSAIDE 
TNIAKSAIDE 
TNIAKSAIDE 



TWTrEVNPQN 
TWTrEVNPQN 
TWTrEVNPQN 
TWTrEVNPQN 
TWTrEVNPQN 
TWTrEVNPQN 
TWTrEVNPQN 
TWTrEVNPQN 
TWTrEVNPQN 
TWTrEVNPQN 
TWTqEVNPQN 



APFLIVSEGV 
APFLIVSEGV 
APFLIVSEGV 
APFLIVSEGV 
APFLIVSEGV 
APFLIVSEGV 
APFLIVSEGV 
APFLIVSEGV 
APFLIVSEGV 
APFLIVSEGV 
APFLIVSEGV 



*******_** ***_****** ********** 



LMFLKEDDVE 

LMFLKEDDVE 

LMFLKEDDVE 

LMFLKEDDVE 

LMFLKEDDVE 

LMFLKEDDVE 

LMFLKEDDVE 

LMFLKEDDVE 

LMFLKEDDVE 

LMFLKEDDVE 

LMFLKEDDVE 
********** 



150 

TFLHILTNSF 

TFLHILTNSF 

TFLHILTNSF 

TFLHILTNSF 

TFLHILTNSF 

TFLHILTNSF 

TFLHILTNSF 

TFLHILTNSF 

TFLHILTNSF 

TFLHILTNSF 

TFLHILTNSF 
********** 



msa42204 - 
msa42204.2{l7€ 
msa42204 
msa42204.2{ 

msa42204 . 

msa42204 . 
tnsa42204.2{ 

msa42204 . 

msa42204 . 

msa42204 . 
msa42204.2{ 



2{176_H36B 
_JM9130013 
.2{176__090' 
176_18RS21 
2{176_2603 
2{176_A909 
176_CJB110; 
2ll76_COHl 
2{176_M732' 
2{176_M781 
176_1169NT 
Consensus 



151 

SQFMAQFDLC 
SQFMAQFDLC 
SQFMAQFDLC 
SQFMAQFDLC 
SQFMAQFDLC 
SQFMAQFDLC 
SQFMAQFDLC 
SQFMAQFDLC 
SQFMAQFDLC 
SQFMAQFDLC 
SQFMAQFDLC 



qKEM INKGKQ 
qKEM INKGKQ 
hKEM I NKGKQ 
hKEM I NKGKQ 
hKEM INKGKQ 
hKEM INKGKQ 
hKEM INKGKQ 
hKEM INKGKQ 
hKEM INKGKQ 
hKEM INKGKQ 
qKEM INKGKQ 



********** _********* 



HDTVKYMDTE 

HDTVKYMDTE 

HDTVKYMDTE 

HDTVKYMDTE 

HDTVKYMDTE 

HDTVKYMDTE 

HDTVKYMDTE 

HDTVKYMDTE 

HDTVKYMDTE 

HDTVKYMDTE 

HDTVKYMDTE 
********** 



200 

IVDLDPKLKQ 

I VDLDPKLKQ 

IVDLDPKLKQ 

IVDLDPKLKQ 

IVDLDPKLKQ 

IVDLDPKLKQ 

IVDLDPKLKQ 

IVDLDPKLKQ 

IVDLDPKLKQ 

IVDLDPKLKQ 

IVDLDPKLKQ 
**„******* ********** 



FQ1GITDGHE 
FQf GITDGHE 
FQf GITDGHE 
FQf GITDGHE 
FQf GITDGHE 
FQf GITDGHE 
FQf GITDGHE 
FQf GITDGHE 
FQf GITDGHE 
FQf GITDGHE 
FQf GITDGHE 



msa42204 . 
msa42204.2{!76 
msa42204 
msa42204.2{ 

msa42204 . 

msa42204 . 
msa42204 . 2{ 

msa42204 . 

msa42204 . 

msa42204 . 
msa42204 . 2{ 



2{176_H36B} 
_JM9130013} 
.2{176_090} 
176_18RS21} 
2 { 176J2603} 
2{l76_A909j 
176_CJB110} 
2(l76_COHl} 
2{176_M732} 
2{l76_M78l} 
176_1169NT} 
Consensus 



201 

INLINFTDEM 
INLINFTDEM 
INLINFTDEM 
INLINFTDEM 
INLINFTDEM 
INLINFTDEM 
INLINFTDEM 
INLINFTDEM 
INLINFTDEM 
INLINFTDEM 
INLINFTDEM 
********** 



SKFELGTLRS 
SKFELGTLRS 
SKFELGTLRS 
SKFELGTLRS 
SKFELGTLRS 
SKFELGTLRS 
SKFELGTLRS 
SKFELGTLRS 
SKFELGTLRS 
SKFELGTLRS 
SKFELGTLRS 



LLPTIRKFNN 
LLPTI RKFNN 
LLPTIRKFNN 
LLPTIRKFNN 
LLPTIRKFNN 
LLPTIRKFNN 
LLPTIRKFNN 
LLPTIRKFNN 
LLPTIRKFNN 
LLPTIRKFNN 
LLPTIRKFNN 



********** ********** 



239 

CLGVYEYKA 
CLGVYEYKA 
CLGVYEYKA 
CLGVYEYKA 
CLGVYEYKA 
CLGVYEYKA 
CLGVYEYKA 
CLGVYEYKA 
CLGVYEYKA 
CLGVYEYKA 
CLGVYEYKA 
********* 
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Table 50: Comparative Sequences relating to SAG 1024 



SEQ ID NO. 5001 
STRAIN 2603 

ATGAAAAAAC7\AAAACTATTACTGCTTATTGGAGGCTTATTAATAATGATAATGATGACA 
GCATGTAAGGATTCAAAAATCCCAGAAAACCGCACAAAGGAAGAGTACCAAGCTGAACAA 
AATTTTAAAC CGTTTTTTGAGTTTTTAGCACAAAAAG AT AAAGATl^ CAAAATACAA 
AAATACTTACTATTAGTATCGGATTCAGGTGATGCATTAGATTTAGAATATTTCTAT^ 
ATTGAAGATTTAAAAAAAAATAAGGATTTAGGGAAGT^ 

GAAAAGCCGGGTGGCTATAATGAGTTAGAAAATAAAGAGGTCCCATTTGAATATTTTAAA 

AATAATATAGTTTATCCAAAAGGAAAACCGAATATTACATTTGATGACrTTATTA 

GCAATGGATACTAAAGAATTAAAAGAATTAAAAA^ 

AAACATC CGGAAACTGAGTTGAAAGATAT AACATATG AATTG CCG ACACAGTCGAAG CTT 
ATTAAAAAA 



SEQ ID NO. 5002 

STRAIN 090 

TAAGG ATT CAAAAAT CC CAG AAAACCG CACAAAG 

GAAGAGTACCAAGCTGAACAAAATTTTAAACrGTriTTTGAGTTTTTAGC 
ACAAAAATATAAAGATTTGAACAAAATACAAAAATACTTACTATTAGTAT 
CGGATTCAGGTG ATG CATTAG ATTTAGAATATTT CTATAGTATT CAAGAT 
TTAAAAAAAAATAAGGATTTAGGGAAGTTTGAAACAAGAAAAAGTCAAAT 
AGAAAAG CCGGGTG G CT ATAATGAGTTAG AAAATAAAGAGGT CC CATTTG 
AATATTTTAAAAATAATATAGTTTATCCAAAAGGAA^CCGAATATTACA 
TTTGATGACTITT'ATTATCGGAGCAATGGATACTAAAGAATTAAAA 
AAAAGTAAAAAGTTATTTATTAAAACATC CGG AAACTGAGTTGAA^GATA 
TAACATATGAATTGC CGACACAGT CGAAG CTTATTAAAAAA 



SEQ ID NO. 5003 

STRAIN 18RS21 

TAAGGATTGAAAAATCCCAGAAAACCGCACAAAGGAAG 

AGTACCAAGCTGAACAAAATTTTAAACCGTTTTTT CACAA 

AAAGATAAAGATTTGAGCAAAATACy^AAATACTTACTATTAGTATCGG^ 

TTCAGGTGATGCATTAGATTTAGAATATTTCTATAGTATTCAAGATT^ 

AAAAAAATAAGGATTTAGGG AAGTTTGAAACAAGAAAAAGT CAAATAGAA 

AAGCCGGGTGGCTATAATGAGTTAGAAAATAAAGAGGTC C CATTTGAAT A 

TTTTAAAAATAATATAGTTTATCCAAAAGGAAAA.CCGAATATTACATTTG 

ATGACTTTATTATCGGAG CAATGG ATACTAAAGAATT AAAAGAATTAAAA 

GAATTAAAAAAATTAAAAGTAAAAAGTTATTTATTAAAACATCCGGAAAC 

TGAGTTGAAAGATATAACATATGAATTGCCGGCACAGTCGAAGCTTATTA 

AAAAA 



PRETTY of : /biotmp/msa212269 . 2 { * } February 10, 2003 05:07 .. 

1 50 

msa212269 .2{l84_090} : 

msa212269.2{l84_2603} atgaaaaaac aaaaactatt actgcttatt ggaggcttat taataatgat 

msa212269 .2{l84_18RS2lj ~ 

Consensus ********** ********** ********** ********** ********** 



msa212269 .2{l84_Q90 
msa212269 . 2 { 184_2603 
msa2122 69 . 2 { 184_18RS21 [ 

Consensus 



51 100 

TAAGG ATT CAAAAAT CCCAGAAAAC CGCACAAAGG 

aatgatgaca gca t gTAAGG ATT CAAAAAT CCCAGAAAAC CGCACAAAGG 

TAAGG ATT CAAAAAT CCCAGAAAAC CGCACAAAGG 

********** ********** ********** ********** ********** 



msa2 12269 .2{l84_090 
msa212269 . 2 { 184_2603 
msa212269 . 2{ 184_18RS21 

Consensus 



101 . 150 

AAGAGTACCA AGCTGAACAA AATTTTAAAC t GTTTTTTGA GTTTTTAGCA 

AAGAGTACCA AGCTGAACAA AATTTTAAAC C GTTTTTTGA GTTTTTAGCA 

AAGAGTACCA AGCTGAACAA AATTTTAAAC cGTTTTTTGA GTTTTTAGCA 
********** ********** ********** _********* ********** 



msa2 12269 .2{l84_090 
msa212269 . 2 { 184_2603 
msa212269 . 2 { 184_18RS21^ 

Consensus 



151 200 

CAAAAAtATA AAGATTTGAa CAAAATACAA AAATACTTAC TATTAGTATC 

CAAAAAgATA AAGATTTGAg CAAAATACAA AAATACTTAC TATTAGTATC 

CAAAAAgATA AAGATTTGAg CAAAATACAA AAATACTTAC TATTAGTATC 
******_*** *********_ ********** ********** ********** 



msa212269 . 2 { 184_090 } 
msa212269 . 2 { 184_2603 } 
msa212269.2{l84_18RS21) 

Consensus 



201 250 

GGATTCAGGT GATG CATTAG ATTTAGAATA TTTCTATAGT ATTCAAGATT 

GG ATT CAGGT GATG CATTAG ATTTAGAATA TTTCTATAGT ATTCAAGATT 

GGATTCAGGT GATG CATTAG ATTTAGAATA TTTCTATAGT ATTCAAGATT 
********** ********** ********** ********** ********** 



msa212269 . 2 { 184_090 } 
msa212269.2{l84_2603} 
msa212269 . 2 { 184_18RS2l} 

Consensus 



251 • 300 

TAAAAAAAAA TAAGGATTTA GGGAAGTTTG AAACAAGAAA AAGTCAAATA 

TAAAAAAAAA TAAGGATTTA GGGAAGTTTG AAACAAGAAA AAGTCAAATA 

TAAAAAAAAA TAAGGATTTA GGGAAGTTTG AAACAAGAAA AAGTCAAATA 
********** ********** ********** ********** ********** 



msa212269 . 2 { 184_090 } 
msa212269 . 2 { 184_2603 j 
msa2 12269 . 2 { 184_18RS21 } 

Consensus 



301 350 

GAAAAGCCGG GTGG CTATAA TGAGTTAGAA AATAAAGAGG TCCCATTTGA 

GAAAAGCCGG GTGG CTATAA TGAGTTAGAA AATAAAGAGG TCCCATTTGA 

GAAAAGCCGG GTGGCTATAA TGAGTTAGAA AATAAAGAGG TCCCATTTGA 
********** ********** ********** ********** ********** 



837 



WO 2004/018646 



PCT/US2003/026827 



Table 50: Comparative Sequences relating to SAG 1024 



msa212269.2{l84_0 90} 
msa212269 . 2 { 184J2603 } 
msa212269 . 2 { 184_18RS2l} 

Consensus 



msa212269.2{l84_0 90) 
msa212269 . 2 { 184J2603 } 
msa212269 . 2 { 184_18RS21 } 

Consensus 



msa212269.2{l84_090} 
msa2 12269 . 2 { 184_2603 } 
msa212269.2{l84_18RS2l) 

Consensus 



msa212269.2{l84_090 
msa212269 . 2 { 184_2603 
msa212269 . 2{ 184_18RS21 

Consensus 



msa212269 . 2 { 184_090 } 
msa212269.2{l84_2603} 
msa212269.2{l84_18RS2l} 

Consensus 



351 400 
ATATTTTAAA AATAATATAG TTTATCCAAA AGGAAAACCG AATATTACAT 
ATATTTTAAA AATAATATAG TTTATCCAAA AGGAAAACCG AATATTACAT 
ATATTTTAAA AATAATATAG TTTATCCAAA AGGAAAACCG AATATTACAT 
********** ********** ********** ********** ********** 

401 450 

TTGATGACTT TATTATCGGA GCAATGGATA CT 

TTGATGACTT TATTATCGGA GCAATGGATA CT aaagaatta 

TTGATGACTT TATTATCGGA GCAATGGATA CTaaagaatt aaaagaatta 
********** ********** ********** ** 

451 500 
AAAGAATTAA AAAAATTAAA AGTAAAAAGT TATTTATTAA AACATCCGGA 
AAAGAATTAA AAAAATTAAA AGTAAAAAGT TATTTATTAA AACATCCGGA 
AAAGAATTAA AAAAATTAAA AGTAAAAAGT TATTTATTAA AACATCCGGA 
********** ********** ********** ********** ********** 

501 . 550 

AACTGAGTTG AAAGATATAA CATATGAATT GCCGaCACAG TCGAAGCTTA 
AACTGAGTTG AAAGATATAA CATATGAATT GCCGaCACAG TCGAAGCTTA 
AACTGAGTTG AAAGATATAA CATATGAATT GCCGgCACAG TCGAAGCTTA 
********** ********** ********** ****_***** ********** 

551 

TTAAAAAA 
TTAAAAAA 
TTAAAAAA 
******** 



SEQ ID NO. 5004 

STRAIN 2603 frame: 1 

MKKQKLLLiLIGGLLIMIMMTACKDSKI PENRTKEEYQAEQNFKPFFEFLAQKDKDLSKIQ 
KYLLLVSDSGDALDLEYFYS IQDLKKNKDLGKFETRKSQI EKPGGYNELENKEVPFE YFK 
NNI VYPKGKPNITFDDFI IGAMDTKELKELKKLKVKS YLLKHPETELKD ITYELPTQSKL 
IKK 

SEQ ID NO. 5005 

STRAIN 090 frame: 2 

KDSKI PENRTKEEYQAEQNFKLFFEFLAQKYKDLNKIQKYLLLVSDSGDALDLEYFYS I Q 
DLKICNKDIiGKFETRKSQ IEKPGGYNELENKEVPFE YFKNNI VYPKGKPN I TFDDF 1 1 GAM 
DT KELKKLKVKS YLLKHPETELKD I TYELPTQSKLI KK 

SEQ XD NO. 5006 

STRAIN 18RS21 frame: 2 

KDSKI PENRTKEEYQAEQNFKPFFEFLAQKDKDLSKI QKYLLLVSDSGDALDLE YFYS I Q 
DLKKNKDI/3KFETRKSQ I EKPGGYNELENKEVPFE YFKNN I VYPKGKPN I TFDDF 1 1 GAM 
DTKELKELKELKKLKVKS YLLKHPETELKD ITYELPAQSKL I KK 



PRETTY of: /biotmp/msa212547 . 2 { * } February 10, 2003 05:11 



msa212547 .2{184_18RS21} 
msa212547 . 2 {184J2603 } 
msa212547.2{l84_090} 
Consensus 



msa212547 . 2 { 184_18RS2l} 
msa212547 . 2 { 184_2603 } 
msa212547.2{l84_090} 
Consensus 



msa212547.2{l84_18RS2l} 
msa212547.2{l84_2603} 
msa212547 . 2 { 184_090} 
Consensus 



msa212547 . 2{ 184_18RS2l} 
msa212547 . 2 {184_2 603 j 
msa212547.2{l84_090) 
/ Consensus 



— KDSKI PEN RTKEEYQAEQ 

mkkqklllli ggllimimmt acKDSKI PEN RTKEEYQAEQ 
— KDSKI PEN RTKEEYQAEQ 



50 

NFKpFFEFLA 
NFKpFFEFLA 
NFK1FFEFLA 



********** ********** ********** ********** ***_****** 



51 

QKdKDLsKIQ KYLLLVSDSG DALDLEYFYS IQDLKKNKDL 
QKdKDLsKIQ KYLLLVSDSG DALDLEYFYS IQDLKKNKDL 
QKyKDLnKIQ KYLLLVSDSG DALDLEYFYS IQDLKKNKDL 
**_***_*** ********** ********** ********** 



101 

EKPGGYNELE NKEVPFEYFK NNIVYPKGKP NITFDDFIIG 
EKPGGYNELE NKEVPFEYFK NNIVYPKGKP NITFDDFIIG 
EKPGGYNELE NKEVPFEYFK NNIVYPKGKP NITFDDFIIG 
********** ********** ********** ********** 

151 186 
KELKKLKVKS YLLKHPETEL KDITYELPaQ SKLIKK 
KELKKLKVKS YLLKHPETEL KDITYELPtQ SKLIKK 
KELKKLKVKS YLLKHPETEL KDITYELPtQ SKLIKK 
********** ********** ********-* 



100 

GKFETRKSQI 

GKFETRKSQI 

GKFETRKSQI 
********** 

150 

AMDTkelkel 

AMDT. . .kel 

AMDT 

**★*_ 
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SEQ ID NO. 5101 
STRAIN 2603 

ttgaataataaaggtgtcggtggcgatggtgtccaaatttatcaatacta 

tatcaaaatggacaacaataaaccttacttaagtcccaaagataagacta 

ctgtagagaagttagaagatcgctggaaaaaaattactttcaaagttcag 

gatactggcattggtttgaaagacgtttatcttcaatctgttaagtatgt 

tggtggtggcaataataatttagaccttatcacacctccaggatttaaaa 

aagaagataaaaaagttgaaaaaccaaaattagaccgtccaccaggaatt 

gatttaccagcaccaacttcaatgagaagttttgattattcaaccccacc 

gggaactaagccaagcaaacccaaagatagtttatcaactcctccaggtt 

tcccagatttaaacacgccgccggatgaagcaccaaaggatagtaaaaaa 

gacgctattgaagataaatcaggagcaattaaatatgctaagtctcttca 

acttagctttgttgatggccctattttagctagcaaagtaaatggcaaaa 

tattacaagtcgaatctgatggcaaattagtcattcctagaaatgctttg 

tcagctaatcaatttgatgacactagtcttaaaatttatcgtaataataa 

tcgcaataaagaaattactatcacaacagattattttgcagatacaaaat 

atgtcaatatcacagcggttgactatttgagcaatactacttttgagcaa 

ttagctactggtgaaacagtagattaccatgccattgtattttcaagctt 

tgctgctattaaagacaagggtggtaagatttatgttaacgataaattgc 

aagaaacttctcgtatagcgcttaaagataaatctgttaagattggtatt 

gaattaccaaatgatgfccagacatattgatagtttatctgttcgtcgttt 

gaatgaggttaaaactgttgataatatcttgaaaaatgatgaacaagaca 

ttaatctcagcaaaacttaccaattaaaatacaacccgacaaatcgtcgt 

ctagagtttactattaataacattaactcaagttcagaaatcatgaccac 

tttcaaagatggaaagatgccagaattggttgaacaaaaagatgtttctt 

tggatataaacgatatggacatgagtaagtttaaaactattcgacttgga 

cgaaaggattctgaatttaagggacaacttattgcaaaaactggaacagt 

tgaattagatatgtttttcaaacaatctcaagacccagcttcaattatta 

aaaaaatataccttatccaaaatggtgttccaaatgaattgaaaaaattt 

gactctagttttggtttaactgaaagtcagatagatggatactatattta 

taaagatgcaattaaccttaaatttaaattaaccagtggtgcaagtctta 

aagttgtttataaagggcaagaagatccatatagtcatcagaaagaagat 

atgactaaaaaaggtgaacagctcagtcattcaactcaagccaatgaaaa 

tacagcaaaagtaacctttgctaatattgactggtcacattatagtaagg 

ttactgtgaatggaaaagaagttgttaaaggtagtgagttacctttaact 

aaaggatggacaacatttgtattacataaaacagaaaattcattaaatgt 

taaaagtttgattatggagacgggtagtgtaagtaagaaagttcaacaac 

ttcctttaagtcctagattatctaaaaataagcatatgagggatatgcta 

cttactatgcaaaaagattcagcgtattacgaaacaagtgacagtctagt 

ccttcgaattaatctcactgcagatactaaacttaattttaatgctgtta 

aaggagcgagtgctcttactgaaaatatgatgatgagacagtttgcagtt 

gctggaccacaagatgatcctgttagtgaacataaatacccatcagtatt 

tctcttaactcctgccttattggaaactgctagtgaggcaactctaaatg 

gtaaggaaatcacagcatctggtattatcggtcacatcaaggatggtgat 

aaaagcaagcatgttgaagtcaaaatggtgaatgaaaatggagacatgct 

aggaacccctgttattattcaaggtaaagacttgactaatcgaacaaaac 

cattaatgagtggacgtagagtactttatgccggtaaacaatatgagttc 

cgggctaaattaccacttagtcgttttaacacttggattagggfctgaagt 

ggtaacagaagcaggagagaaagcaagtattgttcgtcgcatgttctttg 

accaatcagttccagagcttaacacagcagttgctaaacgtgatttgact 

tctgatactgctcttatccacatcgttgccaaagatgactctctaaaact 

aaaattatatcaagatgattcattacttgaatctgttgataaaaccggtc 

tttatagttttagaaatggtgtagaaatcactaaagatatgacagtacca 

ctagaatttggagataatattattaagttatctgctgttgacttatcaaa 

ttatcgtcgtaatgagacccttcatatctatagaaaccgttttgatgtta 

aagcaagccaaatgacagctgacaaaggagctaaagtaactgtggatatg 

ttgatgaagcacttagttgttccagaaatggcaggagcttatacattaac 

aatcgacgaagctccaaacacaaatgaatcaggaatgttaacaaacgcta 

aagtatcgattcattatgtaaatggtggtgttgataaagttgatgttccg 

attaaagtagttgacttagaagctattcgtaaagctgaagaagcacgtaa 

agctgaagaagcacgtaaagctgaagaagcacgtaaagctgaagagggac 

ataaaacccaagaagcacctatagttgaagaaggctacaaggttaataac 

gttcatcaaactgatactacagttaaagcgtctgatttaccaaagactaa 

gacagtttccgcagttcatatggctagaacagacaataaacagataactt 

cacatcagacacatgttgaaaaacaaattaaaaatacattgccatccact 

ggtgacagcaaacgtggttattatatcactggaatggctatcgttatgct 

gagtgtattatttagtttagctaaaaagtttaaaagcaaatat 

SEQ XD NO. 5102 
STRAIN A909 

TTGAATAATAAAGGTGTCGGTGGCGAT 

GGTGTC CAAATTTAT CAATACT ATAT CAAAATGG ACAACAATAAACCTT A 
CTTAAGTCC GAAAGATAAGACTACTGTAG AGAAGTTAGAAGATCG CTGGA 
AAAAAATTACTTT CAAAGTT CAGGATACTGG CAT TGGTTT GAAAGACGTT 
TATCrTCAATCTGTTAAGTATGTTGGTGGTGGCAATAATAATTTAGACCT 
TATCACACCrr CCAGGATTT AAAAAAGAAG ATAAAAAAGTTGAAAAAC CAA 
AATT AGACCGTC CACCAGGAATTG ATTTACCa C CACCAACTT CAATG AG A 
AGTTTTGATTATTCAAC CC CAC CGGGAACTAAGC CAAGGAAACC CAAAGA 
TAGTTTATCAACT C CTC CAGGTTT C CCAGATTTAAACACG C CGC CGG ATG 
AAG CACTAAAGG AT AGTAAAAAAGACG CT ATTGAAGATAAAT CAGGAGCA 
ATTAAATATG CT AAGTCTCTTCAACTT AGCTTTGT TGATGAC C CTATTTT 
AGCrrAGCAAAGTAAATGGCAAAATATTACAAGTCGAATCTGATGGCAAAT 
TAGT CATTCCTAGAAATG CTTTGT CAG CT AATCAATTTGATG ACACTAGT 



839 



WO 2004/018646 
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Table 51: Comparative Sequences relating to SAG0677 



CTTAAAATTT AT CGT AATAATAAT CGCAAT AAAGAAATT ACT AT CACAAC 
AGATTATTTTGCAGATACAAAATATGTCAATATCACAGCGGTTGACTATT 
TGAGCAATACTACTTTTGAGCAATTAGCTACTGGTGAAACAGTAGATTAC 
CATGCCATTGTATTTTCAAGCTTTGCTGCTATTAaAGACAAGGGTGGTAA 
GATTTATGTTAACG ATAAATTG CAAGAA^CTT CT CGT ATAG C G CTTAAAG 
ATAAATCTGTTAAG ATTGGT ATTG AATTAC CAAATGATGT CAGACATATT 
GATAGTTTAT CTGTTCGTC GTTTGAATGAGGTTAAAACTGTTGATAATAT 
CTTG AAAAA.TGATG AACAAG ACATTAATCT CAG CAAAACTT ACCAATTAA 
AAT ACAAC C CG ACAAAT CGT CGT CTAGtAGTTT ACTATTAATAACATTAAC 
TCAAGTT CAG AAAT CATGAC CACTTT CAAAGATGGAAAG ATG CCAGAATT 
GGTTGAa CAAAAAGATGTTT CTTTGGATATAa aCG ATATGGACATGAGT A 
AGTTTAAAACTATTCGACITGGACGAAAGGATTCTGAATTTAAGGGACAA 
CTTATTGCAAAAACTGGA^CAGTTGAATTAGATATGTTTTTCAAACAATC 
TCAAGACCCAGCTTCAATTATTAAAAAAATATACCTTATCCAAAATGGTG 
TTCaAAATGAATTGAAAAAATTTGACTCTAGTTTTGCTTTAACT 
CAGATAGATGGATACTATATTTATAAAGATGC1AATTAACCTTAAATTTAA 
ATTAACCAGTGGTG CAAGT CTT AAAGTTGTTT ATAAAGGGCAAG AAG AT C 
CATATAGT CAT CAGAAAGAAGATATGACTAAAAAAGGTG AACAG CTCAGT 
CATTCAACTCAAGCCAATGAAAATACAGCAA^ 

TGACTGGTCACATTATAGTAAGGTTACTCTGAATGGAAAAGAAGTTGGTA 
AAGGTAGTGAGTTACCTTTAACTAAAGGATGCACAAf^TTTGTATTACAT 
AAAACAGAAAATTC^TTAAATGTTAAAAGTTTGATTATGGAGACGGGTAG 
TGTAAGTAAGAAAGTTCAACAACTT CCTTTAAGT C CT AGATTATCTAAAA 
ATAAG CATATGAGGGATATGCTACTTACTATGCAAAAAGATTCAG CGTAT 
TACGAaaCAAGTGACAGTCTAGTCCnrTCGAATTAATCTCACTGCAGATAC 
TAAACTT AATTTTAATG CTGTT AAAGGAG CGAGTG CT CTTACTGAAAATA 
TGATGATGAGACAGTTTG CAGTTG CTGGAC CACAAGATGATCCTGTTAGT 
GAACATAAA.TACCCA.TCAGTATTTCTCTTAACTCCTG CCTTATTGGAAAC 
TGCTAGTGAGGCAACTCTaAATGGTAAGGAAAT^ 

TCGGTCACATCAAGGATGGTGATAAAAGCAAGCATGTTGAAGTCAAAATG 
GTGAATGAAAATGGAGACATGCTAGGAAC C CCTGTTATTATTCAAGGTAA 
AGACTTGACTAATCGAACAAAACCATTAATGAGTGGACGTAGAGTACTTT 
ATGCCGGTAAACAATATGAGTT CCGGG CTAAATTAC CACTTAGT CGTTTT 
AACACTTGGATTAGGGTTGAAGTGGTAACAGAAG 

TATTGTT CGT CG CATGTT CTTTGAC CAAT CAG t TCCAGAG CTTAACACAG 
CAGTTGCTAAACGTGATTTGACTTCTGATACTGCTCrTATCCACATCGTT 
GC CAAAGATGACT CT CTAAAACTAAAATTATAT CAAGATG ATTCATTACT 
TGAAT CTGTTGATAAAAC CGGTCTTTATAGTTTTAGAAATGGTGTAGAAA 
TCACTAAAGATATGACAGTACCACTAGAATTTGGAGATAATATTATTAAG 
TTATCTG CTGTTGACTTAT CAAATTAT CGTCGTAATGAGACCCTT CATAT 
CTATAGAAACCX^TTTTGATGTTA^GCAAGCCAAATGAa^GCTGACAAAG 
GAGCTAAAGT AACTGTGGATATGTTGATGAAG CACTTAGTTGTTC CAGAA 
ATGGCAGGAGCTTATACATTAACAATCGACGAAGAT C CAAACACAAATGA 
AT CAGCAATGTTAACAAACGCTAAAGT AT CGATT CATTATGTAAATGGTG 
GTGTTGATAAAGTTGATGTT C CGATTAAAGTAGTTGACTTAG AAG CTATT 
CGTAAAGCTGAAGAAGCACATAAAGCTGACGAAGCACGTAAAGCTGAAGA 
AG CACGTAAAGCTGAAGAAGCACGTAAAG CTGAAGAAG CACGTAAAGCTG 
AAGAGGGACATaAAACCCAAGAAG CAC CTATAGTTGAAGAAGGCTACAAG 
GTTAATAACGTTCAT CAAACTGATACTACAGTTAAAG CGTCTGATTTAC C 
AAAGACTAAGACAGTTTCCG CAGTT CATATGG CT AGAACAGACAATAAAC 
AGATAACTT CACATCAGACA(TATGTTGAAAAA.CAAATTAAAAATA 

SEQ ID NO. 5103 
STRAIN H3 6B 

TGGTGTCCAAATTTATCAATACTATATCAAAATGGACAACAATAAACCTT 

ACTT AAGTCC CAAAGAT AAGACTACTGTAGAGAAGTTAG a aGAT CGCTGG 

AAAAAAATTACTTTCAAAGTTCAGGATACTGGCATTGGT^^ 

TTAT CTT CAAT CTGTT AAGTAT GTTGGTGGTGG CAATAATAATTTAGAC C 

TTATCACACCJTCCAGGATTTAAAAAAGAAGATAAAAAAGTTGA 

AAATTAGAC CGTCCACCAGGAATTG ATTTACCAG CACCAACTT CAATGAG 

AAGTTTTGATTATTCAACC C CAC CGGGAACTAAG C CAAG CAAA.C CCAAAG 

ATAGTTTATCAACTCCTCCAGGTTTCCCAGATTTAAACACGCCGCCGGAT 

GAAG CACTAAAGGATAGTAAAAAAG ACG CTATTGAAGAT AAAT CAGG AG C 

AATTAAATATG CTAAGT CTCTTCAACTTAGCTTTGTTGATGAC C CTATTT 

TAG CTAGCAAAGTAAATGG CAAAATATTACAAGT CGAAT CTGATGGCAAA 

TTAGTCATT CCTAGAAATGCTTTGTCAGCTAAT CAATTTGATGACACTAG 

TCTTAAAATTTATCGTAATAAT AAT CG CAATAAAGAAATTa cTAT CACAA 

CAGATTATTTTG CAG ATACAAAATATGTCAATAT CACAG CGGTTG ACTAT 

TTGAGCAATACTACTTTTGAGCAATTAGCTACTGGTGAAaCAGTA 

CCATGCCATTGTAtTTTCAAGCTTTGCTGCTATTAAAGACAAG^ 

AGATTTATGTCAACGATAAATTGCAAGAAACTT CTCGTAT AGCG CTTAAA 

GATAAAT CTGTTAAG ATTGGTATTGAATTAC CAAATG ATGTCAGACATAT 

TGATAGTTTATCTGTTCGTCGTTTGAATGAGCTTAAAACTGTTGATAATA 

T CTTGAAAAATGATGAACAAGACATTAAT CTCAG CAAAACTTACCAATTA 

AAATACAACCCGACAAATCGTCGTCTAGAGTTTACTATTAATAACATTAA 

CTCAAGTTCAGAAAT CATGACCACTTT CAAAG ATGGAAAGATG C CAgAAT 

TGGTTGAACAAAAAGATGTTTCTTTGGATATAAACGATA 

AAGTTTAAAACrATTOlACTTGGACGAA^ 

ACTTATTGC!AAA^CTG<XAA.CAGTTGAATTAGATATC 

CT CAAGACC CAG CTT CAATTATTAAAAAAATATAC CTTATCCAAAATGGT 

GTTCCAAATGAATTGAAAAAATTTGACTCTAGTTTTC 

T CAG ATAGAT GGAT ACTAT ATTTATAAAG ATG CAATTAAC CTTAAATTTA 

AATTAAC CAGTGGTG CAAGTCTTAAAGTTGTTTATAAAG GGCAAG AAGAT 
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CCATATAG t CAT CAGAAAGAAG AT ATG ACTAAAAAAG GTG AACAG CT CAG 
TCATTCAACT CAAG C CAATGAAAATACAG CAAAAGTAAC CTTTG CTAATA 
TTGACIGGTCACATTATAGTAA.GGTTACTGTGAATGGAAAAGAAGTTGGT 
AAAGGTAGTGAGTTACCITTAACTAAAGGATGGACAACATTTGTATTACA 
TAAAACAG AAAATT CATTAAATGTT AAAAGTTTG ATTATGGAGACGG GTA 
GTGTAAGTAAGAAAGTT CAACAACTTC CTTTAAGT CCTAG ATTAT CT AAA 
AATAAGCAT ATG AGGGATATG CTACTTACTATGCAAAAAGATTCAG CGTA 
TT ACGAAACAAGTGACAGT CTAGTCCT TCG AATT AAT CT CACTG CAGAT A 
CTAAACTTAATTTTAATGCTGTTAAAGGAGCGAGTGCTCTTA 
ATGATGATG AGACAGTTTG CAGTTG CTGG ACCACAAG ATGAT C CTGTTAG 
TGAACAT AAATAC C CATCAGTATTT CTCTT AACT CCTG C CTTATT GG AAA. 
CTG CT AGTG AGG C aACTCTAAATGGTAAGG AAAT CACAG C AT CTGGTATT 
AT CGGTCACATCAAGGATGG t G ATAAAAG CAAGCATGTTG AAGT CAAAAT 
GGTGAATGAAAATGGAGACATGCTAGGAACCCCTGTTATTATTCAAGGTA 
AAGACTTGACTAAT CGAACAAAAC CATTAATG AGTGGACGT AGAGTACTT 
TATGCCGGTAAACAATATGAGTTCCGGGCTAAATTACCACTTAGTCGTTT 
TAAC a CTTGGATTAGGGTTG AAGTGGTAACAG AAG CAGG AGAGAAAG CAA 
GTATTGTTCGTCGCATGTT CTTTG ACCAATCAGTTCCAGAGCITAA(^ 
GCAGTTGCTAAACGTGATTTGACITCTGATACTGCT 

TGCCAAAGATGACTCTCTAAAACTAAAATTATAT CAAGATGATT CATTAC 
TTGAATCTGTTGATAAAACCGGTCTTTATAGTTTTAGAAATGGTGTAGAA 
ATCACTAAAGATATGACAGTACCACTAGAATTTGGAGATAATATTACTAA 
GTTAT CTGCTGTTGACITATCAAATTATCGT CGTAATGAGAC CCTTCATA 
T CTATAGAAACCGTTTTGATGTTAAAG CAAGCCAAATG ACAG CTGACAAA 
GGAGCTAAAGTAACTGTGGATATGTTGATGAAGGACTTAGTTGTTCCAGA 
AATGGa^GGAGCTTATACATTAACAATCGACGAAGCTCCAAACACAAATG 
AATCAGGAATGTTAACAAACGCTAAAGTATCGATTCATTATGTAAATGGT 
GGTGTTGATAAAG 1 1 GATGTTCCX3ATTAAAGTAGTTG ACTTAGAAGCTAT 
T CGTAAAGCTGAAGAAG CACATAAAGCTG ACG AAG CACGTAAAG CTGAAG 
AAGCACGTAAAGCTGACaAAGCA.CATAAAGCTGAAGAAGTACGTAAAGCT 
GAAGAAG CACATAAAGT CGAAGAAG CACGTAAAGCTG AAGAGGGACATAA 
AACCCAAGAAGCACCTATAGTTGAAGAAGG CTACAAGGTT AATAACGTT C 
ATCAAACTGATACTACAGTTAAAGCGTCTGATTTACCAAAGACTAAGACA 
GTTT C CG CAGTTCATATGG CTAGAACAGACAATAAACAG ATAACTT CACA 
TCAGACACATG 

SEQ ID NO. 5104 
STRAIN 18RS21 

TTGAATAATAAAGGTGTCGGTGGCGATGGTGTCCAA 

ATTT ATCAATACTATAT CAAAATGGACAACAATAAAC CTT ACTTAAGTCC 
CAAAGATAAGACTACTGTAGAGAAGTTAGAAGATCGCTGGAAAAAAATTA 
CTTT CAAAGTTCAGGATACTGG CATTGGTTTGAAAGACG 
TCTGTTAAGTATGTTGK3TGGTGGGAATAATAATTTAG AC CTT AT CACAC C 
T C CAGGATTT AAAAAAGAAGATAAAAAAGTTGAAAAA.C CAAAATT AG ACC 
GT CCACCAGGAATTGATTT ACCAG CAC CAACTTCAATGAG AAGTTTT GAT 
TATTCAACC C CAC CGGGAACTAAG CCAAG CAAAC C CAAAGAT AGTTTAT C 
AACTCCTCCAGGTTTCCCAGATTTAAACACGCCGCCGGaTGAAGCACC^ 
AGGATAGTAAAAAAGACGCTATTGAAGATAAATCAGGAGCAATTAAATAT 
GCTAAGTCTCTTCAACrTAGCTTT'GTTGATGACCOT 

AGTAAATGG CAAAATATTACAAGT CGAAT CTGATGGCAAATTAGTCATTC 
CT AGAAATG CTTTGT CAG CTAATCAATTTGATGACACTAGTCTTAAAATT 
TATCGTAATAATAATCGCAATAAAGAAATTACTATCACAACAGATTATTT 
TG CAGATACAAAATATGTCAAT AT CACAG CGGTTGACTATTTGAG CAATA 
CTACTI^FTGAGCAATTAGCTACTGGTGAAAC^ 
GTATTTTCAAGCTTTGCTGCTATTAA^ 

TAACGAT AAATTG CTVAGAaACTTCTCGTATAGCG CTTAAAGATAAAT CTG 
TTAAGATTGGTATTGAATTACCAAATGATGTCAGACATATTGATAGTTT 
TCTGTTCGTCGTTTGAATGAGGTTAAAACTGTTGATAATATCTTGAAAAA 
TGATGAAGAAGACATTAATCTCAGCAAaACTT^ 

CGACAAATCGT CGT CTAGAGTTTACTATTAAT AACATTAACT CAAGTT CA 
GAAAT CATGACCACTTTCAAAG ATGGAAAGATGC CAG AATTGGTTGAACA 
AAAAGATGTTTCTTTGGATATaAACGATATGGACATGAGTAAGTTTAAAA 
CTATT CG ACITGGACGAAAGGATT CTGAATTTAAGGGACAACTTAT^ 
AAAACTGGAACAGTTGAATTAGATATGTTTTT CAAACAAT CT CAAG ACC C 
AG CTT CAATTATTAAAAAAATATAC CTTATC CAAAATGGT GTTC CAAATG 
AATTGAAAAAATTTGACTCTAGTTTTGGTTTAACTGAAAGTC^ 
GGATACT ATATTTAT AAAGATG CAATTAAC CTTAAATTTAAATTAAC CAG 
TGGTG CAAGTCTTAAAGTTGTTTATAAAGGGCAAGAAGAT CCAT ATAGT C 
AT CAGAAAGAAGATATGACTAAAAAAGGTGAACAG CT CAGTCATT CAACT 
CAAGC CAATGAAAATACAG CAAAAGTAAC CTTTG CTAATATTGACTGGTC 
ACATTATAGTAAGGTTACTGTGAATGGAAAAGAAGTTGTTAAAGGTAGTG 
AGTTACCTTTAACTAAAGGATGGACAACATTTGTATTACATA^ 
AATTCATTAAATGTTAAAAGTTTGATTATGGAGACGGGTAGTGTAAGTAA 
GAAAGTT CAACAACTTCCTTTAAGT CCTAGATTAT CTAAAAATAAGCAT A 
TGAGGGATATG CTACTTACTATGCAAAAAGATT CAGCGTATT ACG AAACA 
AGTGACAGTCTAGT CCTTCGAATTAAT CT CACTG CAGATACTAAACTTAA 
TTTTAATGCTT GTTAAAGGAG CGAGTG CTCTTACTG AAAATATGATGATG A 
C^CAGTTTGCAGTTGCTGGACCACAAGATGATCCTGTTAGTGAACATAAA 
TACCCAT CAGTATTTCT CTTAACT C CTG C CTTATT GGAAACTG CTAGTGA 
GG CAA CTCTAAATGGTAAGG AAAT CACAG CAT CTGGTATTATCGGTCACA 
TCAAGGATGGTGAT AAAAG CAAGCATGTTGAAGT CAAAATGGTG AAT GAA 
AATGGAG ACATG CTAGGAACCC CTGTTATT ATTCAAGGT AAAGACTTGAC 
TAAT CGAACAAAACCATTAATGAGTGGACGTAGAGTA CTTT ATG CCGGTA 
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AACAATATGAGTTCCGGGCTAAATTACCACTTAGTCGTTTTAACACTTGG 
ATTAGGGTTGAAGTGGTAACAGAAGCAGGAGAGAAAGCAAGTATTGTTCG 
TCGCATGTTCTTTGACCAATCAGTTCCAGAGCTTAACACAGCAGTTGCTA 
AACGTGATTTGACTTCTGATACTGCTCTTATCCACATCGTTGCCAAAGAT 
GACT CT CTAAAACT AAAATTATAT CAAGATGATT CATTAC t T GAAT CTGT 
TGATAAAACCGGTCTTTATAGTTTTAGAAATGGTGTAGAAATCACTAAAG 
AT ATGACAGTAC CACTAGAATTTGGAG ATAATATTATTAAGTTAT CTG CT 
GTTGACTTATCAAATTATCGTCGTAATGAGACCCTTCATATCTATAGAAA 
CCGTTTTGATGTTAAAGCAAGCCAAATGACAGCTGACAAAGGAGCTAAAG 
TAACTGTGGaTATGTTGATGAAGCACTTAGTTGTTCCAGAAATGGCAGGA 
G CTT ATACATTAA.CAATCGACG AAG CT CCAAACACAAATG AATCAGGAAT 
GTTAACAAACG CTAAAGTAT CG ATT CATTATGT AAATGGTGGTGT TGATA 
AAGTTGATGTTCCGATTAAAGTAGTTGACTTAGAAGCTATTCGTAAAGCT 
GAAG AAG CACGT AAAGCTGAAGAAG CACGT AAAG CTGAAG AGGGACATAA. 
AAC C CAAGAAG CACCTAT AGTTGAAGAAGG CTAGAAGGTT AATAACGTT C 
ATCAAACTGATACTACAGTTAAAGCGTCTGATTTACCAAAGACTAAGACA 
GTTTCCGCAGTTCATATGGCTAGAACAGACAATAAACAGATAACTTCACA 
TCAGACACATGTTGAA 

SEQ ID NO. 5105 
STRAIN M732 

TTGAATAATAAAGGTGTCGGTGGCGATGGTGTCC 

AAATTTATCAATACTATAT CAAAATGGACAACAATAAAC CTTACTTAAGT 
CCCAAAGATAAGACTACTGTAGAGAAGTTAGAAGATCGCTGGAAAAAAAT 
TACTTTCAAAGTT C^GGATAOTGG CATTGGTTTGAAAGACGTTTATCTTC 
AATCTGTTAAGTATGTTGGTGGTGGCAATAATAATTTAGACCTTATCACA 
C CTCCAGGATTTAAAAAAG AAGATAAAAAAGTTG AAAAAC CAAAATTAGA 
CCGT CCa C CAGGAATTGATTTACCAG CAC CAACTT CAATGAGAAGTTTTG 
ATTATTCAACCCCAC CGGGAACTAAGC CAAGCAAACCCAAAGATAGTTTA 
TCAACrrCCTCCAGGTTTCCCAGATTTAAACACGCCGCCGGATGAAGCCAC 
CAAAGGATAGTAAAAAAGACGCTATTG AAGATAAATCAGGAG CAATTAAA 
TATGCTAAGTCT CTT CAACTTAG CTTTGTTGATGACC CTATTTTAG CTAG 
CAAAGTAAATGGCAAAATATTACAAGT CG AA.T CTGATGG CAAATTAGT CA 
TTCCTAGAAATGCTTTGTCAGCTAATCAATTTGATGACACT 
ATTTATCGTAATAATA^TCGCAATAAA-GAAATTACTATCACAACAGATTA 
TTTTG CAGAT ACAAAATATGTCAATAT CACAG CGGTTGACTATTTGAG CA 
ATACT ACTTTTG AG CAATT AGCTA CTGGTG AAACAGTAGATTAC CATG C C 
ATT GTATTTT CAAG CTTTG CTG CTATT AAAGACAAGG GTG GTAAGATTT A 
TGTTAACGAT AAATTGCAAG AAACTTCT CGTATAG CG CTTAAAGATAAAT 
CTGTTAAGATTGGTATTGAATTACCAAATGATGTCAGACATATTGATAGT 
TTAT CTGTT CGT CGTTT GAATGAGGTT AAAACTGTTGATAATAT CTTGAA 
AAATGATGAACAAGACATTAATCTCAGGAAAACTTACCAATC 
AC CCG ACAAATCGTCGT CTAGAGTTTACT ATT AATAAGA.TTAACT CAAGT 
TCAGAAATCATGACCACITTCAAAGATGGAAAGATGCCAGAATTGGT^ 
ACAAAAAGATGTITCTTTGGATATAAACGATATGG^ 
AAACT ATTCGACTTGGACGAAAGG ATT CTGAATTT AAGGGA 
GCAA^VAACTGGAACAGTTGAATTAGAT ATGTTTTT CAAACAAT CT CAAGA 
CCCAG CTT CAATTATTAAAAAAATATACCTTATC CAAAATGG t GT T CCAA 
ATGAATTGAAAAA^TTTGACT CTAGTTTTGGTTTAACT CAGATA 
GATGGATACT ATATTTATAAAGATG CAATTAACCTTAAaTTTAAATTAAC 
CAGTGGTGCAAGTCTTAAAGTTGTTTATAAA.GGGCAAGAAGATCCATATA 
GTCATCAGAAAGAAGATATGACTAAAAAAGGTGAACAGCTCAGTCATTCA 
ACTCAAG CCAATGAAAATACAGCAAAAGTAACCTTTG CTAATATTGACTG 
GTCACATTATAGTAAGGTTACTGTGAATGGAAAAGAAGTTGGTAAAGGTA 
GTGAGTTACCTTTAACTAAAGGATGGACAACATTTGTATTACATAAAACA 
GAAAATTCATTAAATGTTAAAAGTTTGATTATGGAGACGGGTAGTGTAAG 
TAAG A=AGTT CAACAACTT C CTTTAAGTCCTAGATTATCT AAAAATAAG C 
ATATGAGGGATATGCTACTTACTATGCAAAAAGATTCAGCGTATTACGAA 
ACAAGTG ACAGT CTAGT CCTTCGAATTAAT CTCACTG CAGATACTAAACT 
TAATTTT AATG CTGTTAAAGGAGCG AGTG CT CTTACTGAAAATATGATGA 
TGAGACAGTTTGCAGTTG CTGGAC CACi^AGATGAT CCTGTT aGTG AACAT 
AAATACCCATCA-GTaTTTCTCTTAACTCCTGCCTTATTGGAAaCTGCTAG 
TGAGG CAACT CTAAATGGTAAGGAAAT CACAGCAT CTGGTATTAT CGGTC 
ACAT CAAGG ATGGTG ATAAAAGCAAGCATGTTGAAGT CAAAATGGTGAAT 
GAAAATGGAGAa^TGCTAGGAACCCCTGTTATTATTCAAGGTAAAGACTT 
GACTAATCGAACAAAAC CATTAATG AGTGGACGTAGAGTACTTTATG C CG 
GTAAACAATATGAGTTCCGGGCTAA^TTACCACTTAGtCGTTTTAACACT 
TGGATTAGGGTTGAAGTGGTAACAGAAGCAGGAGAGAAAGCAAGTATTGT 
T CGT CGCATGTT CTTTGAC CAAT CAGTTCCAGAG CTT AACACAG CAGTTG 
CTAAACGTGATTTGACTT OTGATACTG CTCTT ATCCACAT CGTTG CCAAA 
GATGACTCTCTAAAACT AAAATTAT AT CAAGATGATT CATTACTTGAAT C 
TGTTGATAAAACCGGTCITTATAGTTTTAG 

AAGATATGACAGTACCACTAGAATTTGGAGATAATATTATTAAGTTATCT 
GCTGTTGACTTAT CAAATTATCGTCGT AATGAGACCCTT CATAT CTATAG 
AAACCGTTTTGATGTT AAAG CAAG C CAAATGACAG CTGACAAAGGAG CT A 
AAGTAACTGTGGATATGTTGATGAAGCACTTAGTTGTTCCAGAAATGGCA 
GGAG CTT ATACATTAACAATCGACGAAG CTCCAAACACAAAT GAATCAGG 
AATGTTAACAAACGCTAAAGTATCGATTCATTATGTAAATGGTGGTGTTG 
ATAAAGTTGATGTT C CGATTAAAGT AGTTG ACTTAGAAG CTATTCGTAAA 
G CTGAAG AAG CACATAAAG CTGACGAAG CACGTAAAGCTG AAGAAGCACG 
TAAAG CTGAAGAAG CACATAAAGCTGAAGTlTVGTACGTAA^GCTGAAGAAG 
CACATAAAGT CX3AAGAAGCACGTAAAG CT . 
GAAG CACCTATAGTTGAAGAAGGCTACAAAGTTAATAACGTTCATCAAAC 
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TG AT ACT ACAGTTAA^G CGT CTGATTTAC CAAAG ACT AAGACAGTTT C C G 
CAGTT CATATGG CTAGAACAG ACAATAAACAG ATAACTT CACAT CAG ACA 
CATGTTGAAAA 

SEQ ID NO. 5106 
STRAIN COH1 

TTGAATAATAAAGGTGTCGGTGGCGATGGT 

GT C CAAATTTAT CAAT ACT AT AT CAAAATG GACAACAAT AAAC CTT ACT T 
AAGTCCCAAAGATAAGACTACTGTAGAGAAGTTAGAAGATCGCTGGAAAA 
AAATTACTTTCAAAGCTCAGGATACTGGCATTGGTTTGAAAGACGTTTAT 
CTTCAATCTGTTAAGTATGTTGGTGGTGGCAATAATAATTTAGACCTTAT 
CA.CACCTCCAGGATTTAAAAAAGAAGATAAAAAAGTTGAAAAACCAAAAT 
TAGACCGTCCACCAGGAATTGACTTACCAGCACCAACTTCAATGAGAAGT 
TTTGATTATTCAACCCCACCGGGAACTAAGCCA^ 

TTTATCAACT CCT C CAGG t TT CCCAGATTTAAACACG CCG C CGGATG AAG 
CCaCCAAAGGATAGTAAAAAAGACGCTATTGAAGATAAATCAGGAGCAAT 
TAAATATGCTAAGTCTCTTCAACTTAGCTTTGTTGATGACCCTATTTTAG 
CTAG CAAAGTAAATGGGAAAATATTACAAGT CGAAT CTGATGG CAAATT A 
GTCATTCCTAGAAATGCTTTGTC^GCTAATCAATTTGATGACACTAGTCT 
TAAAATTTAT CGTAATAATAATCG GAATAAAG AAACTACT AT CACAACAG 
ATTATTTTG CAGATACAAA^TATGT CAATAT CACAG CGGTTGACTATTTG 
AGCAATACTACl"l"lTGAGCAATTAGCTACrGGTGAAACAGTAGATTACCA 
TG CCACTGTATTTTCAAGCTTTG CTG CTATTAAAGACAAGGGTGGTAAGA 
TTTATGTTAACGATAAATTGCAAGAAACTTCT CGTATAG CG CTTAAAGAT 
AAATCTGTTAAGATTGGTATTGAATTACCAAATGATGTCAGACATATTGA 
TAGTTTATCTGTTCGTCGTTTGAATGAGGTTAAAACTGTTGATAATATCT 
TGAAAAATGATGAACAAGACATTAATCTC^ 

TACAACC CGACAAAT CGTCGTCTAGAGTTTACTATTAATAACATTAACT C 

AAGTTCAGAAAT CATGACCACTTT CAAAGATGGAAAGATG C CAGAATTGG 

TTGAACAAAAAGATGTTTC"iTT'GGATATAAACGATATGGA(^TGAGTAAG 

TTTAAAACTATTCGACTTGGACGAAAGGATTCroAATT^ 

TATTGC^AAAACTGGAACAGTTGAATTAGATATGTTTTT 

AAGAC CCAG CTT CAATT ATT AAAAAAATATAC CTTATCCAAAATGGTGTT 

CCAAATGAATTGAAAAA^TTTGACTCTAGTTTTGGTTTAACT 

GATAGATGGATACTATATTTATAAAGATGCAATTAACCTTAAATTTAAAT 

TAACCAGTGGTG CAAGT CTTAAAGTTGTTTATAAAGGG CAAG AAGAT CCA 

TATAGT CATCAGAAAGAAGATATGACTAAAAAAGGTGAACAG CT CAGT CA 

TTCAACT CAAG C CAATG AAAAT ACAGCAAAAGTAACCTTTGCTAATATTG 

ACTGGTCACATTATAGTAAGGTTACTGTGAATGGAAAAGAAGTTGGTAAA 

GGTAGTGAGTTACCTCTAACTAAAGGATGGACAACATTTGTATTACATAA 

AACAGAAAATTCATTAAATGTTAAAAGTTTGATTATGGAGACGGGTAGTG 

TAAGTAAGAAAGTTCAACAACTT CCTTTAAGT CCTAgATTAT CTAAAAAT 

AAG CATATGAGGGATATGCTACTTACTATG CAAAAAG ATT CAGCGTATTA 

CG AAACAAGTGACAGT CTAGTC CTT CG AATTAATCTCACTG CAGATACTA 

AACTTAATTTTAATG CTGTTAAAGGAGCG AGTGCT CTTACTGAAAATATG 

ATGATGAGACAGTTTGCAGTTGCTGGACCACAAGATGATCCTGTTAGTGA 

ACATAAATACCCATCAGTATTT CTCTTAACT CCTG CCTTATTGGAAACTG 

CT AGTGAGG CAACT CTAAATGGTAAGGAAATCACAGCATCTGGTATTAT C 

GGTCACATCAAGGATGGTGATAAAAGCAAGCATGTTGAAGTCAAAATGGT 

GAATG AAAATGG AGACATG CTAGGAAC CCCTGTTATTATT CAAGGTAAAG 

ACITGACTAATCGAAaAAAACCACTAATGAGTGGACGTAGAGTACrTTAT 

GCCGGTAAACAATATGAGTTCCGGGCTAAATTACC^CrTAGTCGTTTTAA 

CACTTGGATT AGGGTTGAAGTGGT AACAGAAG CAG GAGAGAAAG CAAGT A 

TTGTTCGTCGCATGTTCTTTGACCAATCAGTTC 

GTTGCTAAACGTGACTtGACTTCTGATACTGCTCTT^ 

CAAAGATGACTCTCTAAAa CTAAAATTATAT CAAGATGATTCATTACTTG 

AATCTGTTGATAAAACCGGTCTCTATAGTTTTAGAAATGGTGTAGAAATC 

ACTAAAGATATGAC^GTACCACTAGAATTTGGAGATAATATTATTAAGTT 

AT CTG CTGTTG ACTTATCAAATTAT CGT CGTAATGAGACC CTT CATATCT 

ATAGAAACCGTTTTGATGTTAAAGCAAGCCAAATGACAGCTGACAAAGGA 

G CTAAAGTAACT GTGGATATGTTGATGAAG CACTTAGTTGTT CCAGAAAT 

GG CAGGAGCTTATACATTAACAAT CGACGAAG CTC CAAACACAAATGAAT 

CAGGAATGTTAACAAACG CTAAAGTAT CGATT CATTATGTAAATGGTGGT 

GTTGATAAAGTTGATGTTCCGATTAAAGTAGTTGACTTAGAAGCTATTCG 

TAAAGCTaAAGAAGCACATAAAGCTGACGAAGCACGTAAAGCTGAAGAAG 

CACX3TAAAGCTGAAGAAGCACATAAAGCTGAAGAAGTACGTAAAGCTGAA 

GAAG CACAT AAAGTCGAAG AAG CACGTAA^GCTGAAG AGGGACATAAAAC 

C CAAGAAGCAC CTATAGTTGAAGA^GG CTACAAAGTT AAT AACGTTCATC 

AAACTGATACTACAGCTAAAGCGTCTGATTTACCAAAGACTAAGACAGTT 

T C CG CAGTT CATATGG CTAGAACAGACAATAAACAGATAACTTCACATCA 

GACACATGT 

SEQ ID NO. 5107 
STRAIN M781 

TTGAATAATAAAGGTGTCGGTGGCGATGGT 

GT CCAAATTTAT CAATACTATAT CAAAATGGACAACAATAAAC CTTACTT 
AAGTC CCAAAGATAAGACTACTGTAGAGAAGTTAG AAGAT CG CTGGAAAA 
AAATT ACTTT CAAAGTT CAGGATACTGGCATTGGTTTGAAAGACGTTTAT 
CITCAATCTGCTAAGTATGTTGGTGGTGGCAATA^TAATTTAGACCTTAT 
CACAC CT CCAGGATTTAAAAAAGAAGATAAAAAAGTTGAAAAACCAAAAT 
TAGACCGTCCACCAGGAATTGATTTACCAGCACCAACTTCAA 
TTTGATT ATT CAAC CCCAC CGGGAACTAAG CCAAG CAAAC CCAAAGATAG 
TTTATCAACT CCT C CAGGTTT C CCAGATTT AAACACG CCG C CGGATGAAG 
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CCaCCAAAGGATAGTAAAAAAGACGCTATTGAAGATAAATCAGGAGCAAT 
TAAATATGCTAAGTCTCTT CAACT T AG CTTTGTTG ATGAC CCTATTTTAG 
CTAGCAAAGTAAATGGCAAAATATTACAAGTCGAATCTGATGGCAAATTA 
GT CATTC CT AGAAAT GCTTT GT CAG CT AAT CAAT TTGATG ACACT AGT CT 
TAAaATTTATCGTAATAATAATCGCAATAAAGAAATTaCTATCACAACAG 
ATTATTTTGCAGATACAAAATATGTCAATATCACAGCGGTTGACTATTTG 
AGCAATACTACTTTTGAGCAATTAGCTACTGGTGAAACAGTAGATTACCA 
TGCCATTGTATTTT CAAGCTTTG CTG CTAT TAAAGACAAGGGTGGTAAGA 
TTTATGTTAACGATAAATTGCAAGAAACrrTCrrCGTATAGCGCTTAAAGAT 
AAAT CTGTT AAGATTGGTATTG AATTAC CAAATG ATGTCAGACATATTG A 
TAGTTTATCTGTTCGTCGTTTGAATGAGGTTAAAACTGTTGATAATATCT 
TGAAAAATGATG AACAAGACATTAATC TCAG CAAAACTTACCAATTAAAA 
TACAACC CG ACAAATCGT CGTCTAG AGTTTACTATTAAT AACATTAA.CT C 
AAGTT CAGAAAT CATGAC CACTTT CAAAG ATGGAAAGATG C CAGAATTG G 
TTGAACAAAAAGATGTTTCTTTGGATATAAACGATATGGACATGAGTAAG 
TTTAAAACTATTCGACTTGGACGAAAGGATTCTGAATTTAAGGGACAACT 
TATTGCAAAAACTGGAACAGTTGAATTAGATATGTTTTT CAAACAAT CT C 
AAGACCCAGCTTCAATTATTAAAAAAATATACCTTATCCAAAATGGTGTT 
CCAAATGAATTGAAAAAATTTGACrCrAGTTTTGGTTTAACTGAAAGTCA 
GATAGATGGATACTATATTTATAAAGATG CAATTAACCTTAAATTTAAAT 
TAAC CAGTGGTG CAAGTCTT AAAGTTGTTTAT AAAGGGCAAG AAGAT C CA 
TATAGT CAT CAGAAAGAAGATATG ACTAAAAAAGGTGAACAG CT CAGT CA 
TTCAACTCAAGCCAATGAAAATACAGCAAAAGTAACCTTTGCTA 
ACTGGTCACATTATAGTAAGGTTACTGTGAATG^IAAAAGAAGTTGGTAAA 
GGTAGTGAGTTACCTTTAACTAAAGGATGGACAACATTTGTATTACATAA 
AACAGAAAATTCATTAAATGTTAAAAGTTTGATTATGGAGACGGGTAGTG 
TAAGTAAGAAAGTTCAACAACTTCCTTTAAGT C CTAGATT AT CTAAAAAT 
AAGCATATGAGGGATATGCTACITACTATG CAAAAAGATT CAGCGTATT A 
CX3AAACAAGTGACAGTCTAGTC CTT CG AATTAAT CTCACTGCAGATACT A 
AACTTAATTTTAATGCTGTTAAAGGAGCGAGTGCTCTTACTGAAAATATG 
ATGATGAa^CAGTTTGCAGTTGCTGGACCACAAGATGATCCTGTTAGTGA 
ACATAAATACCCAT CAGTATTT CT CTTAACTCCTG CCTT ATTGGAAACTG 
CTAGTGAGG CAACTCTAAATGGTAAGGAAATCACAGCAT CTGGTATTAT C 
GGTCACATCAAGGATGGTG ATAAAAGCAAG CATGTTG AAGTCAAAATGGT 
GAATGAAAATGGAGACATG CTAGGAACCCCTGTTATT ATT CAAGGTAAAG 
ACTTGACTAATCGAAC^^^CCATTAATGAGTGGACGTAGAGTACTTTAT 
GCCGGTAAACAATATGAGTT CCGGG CTAAATTAC CACTTAGT CGTTTTAA 
CACTT GGATTAGGGTTGAAGTGGTAACAGAAG CAGGAGAGAAAG CAAGTA 
TTGTTCGTCGCATGTTCTTTGACCAATCAGTTCCAG^ 
GTTGCTAAACGTGATTTGACITCTGATACTGCTCTT 

CAAAGATGACT CT CT AAAACTAAAATTAT ATCAAGATGATTCATTACTTG 
AATCTGTTGATAAAAC CGGT CTTT ATAGTTTTAGAAATGGTGTAGAAAT C 
ACTAAAGATATGACAGT AC CACTAGAATT TGGAGATAATATTATTAAGTT 
AT CTG CTGTTGACTTATCAAATTATCGT CGTAATGAGAC C CTT CATAT CT 
ATAGAAACCGTTTTGATGTTAAAGCAAGCCAAATGACAGCTGACAAAGGA 
GCTAAAGTAACTGTGGATATGTTG ATGAAG CACTTAGTTGTT CCAGAAAT 
GG CAGGAGCTTATACATTAACAAT CGACX5AAGCTCCAAACACAAATGAAT 
CAGGAATGTTAACAAACGCTAAAGTAT CGATTCATTATGTAAATGGTGGT 
GTTGATAAAGTTGATGTTC CGATTAAAGTAGTTGACITA 
TAAAG CTGAAGAAG CACATAAAG CTGACGAAG CACGTAAAGCTG AAGAAG 
CACGTAAAGCTGAAGAAGCACATAAAGCTGAAGAAGTACGTAAAGCTGAA 
GAAG CACATAAAGT CGAAGAAG CA C CGTAAAG CTG AAGAGGG ACATAAAA 
CCCAAGAAG CACCTATAGTTGAAGAAGGCTACAAAGTTAATAACGTT CAT 
CAAACTGAT ACTACAGTTAAAG CGTCTGATTTACCAAAGACTAAGACAGT 
TTCCGCAGTTCATATGGCTAGAACAGACAATAAACAGATAACTTCA 
AGACACATGTTG 

SEQ ID NO. 5109 
STRAIN JM9130013 

TGGTGTCCAAATTT ATCAATACTATAT CAAAATGG ACAACAATAAAC 

CITACTTAAGTCCCAAAGATAAGACTACTGTAGAGAAGTT AGAAGAT CG C 

TGGAAAAAAATTACTTTCAAAGTTCAGGATACTGGCAT^ 

CGTTTATCTTCAATCTGTTAAGTATGTTGGTGGTGGCAATAATAATTTAG 

ACCTTATCACACCn'CCAGGATTTAAAAAAGAAGATAAAAAAGTTGAAA 

CCAAAATTAGACCGT C CAC CAGGAATTGATTTAC CAG CACCAACTT CAAT 

GAGAAGTTTTGATTATT CAAC CCCAC CGGGAACTAAG CCAAG CAAACCCA 

AAGATAGTTTATCAACT CCT CCAGGTTTCC CAGATTTAAACA 

GATGAAGCACCAAAGGATAGTAAAAAAa^CG CTATTGAAGATAAATCAGG 

AG CAATTAAATATG CTAAGT CTCTT CAACTTAGCTT^ 

TTTTAGCTAGCAAAGTAAATGGCAAAATATTACAAGTCX3AATCTGATGGC 

AAATTAGTCATTCCTAG AAATG CTTTGTCAGCTAATCAATTTGATGACAC 

TAGTCTTAAAATTTATCGT AATAAT AATCG CAATAAAGAAATTACTATCA 

CAACAGATTATTTTGCAGATACAAAATATGTCAATATCACAGCGGTTG^ 

TATTTGAGCAaTACTACITTTGAGCAATTAGCTACTC 

TTAC CATGC CATTGTATTTT CAAG CTTTG CTG CTATTAAAGACAAGGGTG 

GTAAGATTTATGTTAACGATAAATTGC^AGAAACTTCTCGTATAG CG CTT 

AAAGATAAATCTGTTAAGATTGGTATTGAATTAC CAAATGATGT CAGACA 

TATTGATAGTTTATCTGTTCGTCGTTTGAATGAGGTTAA 

ATAT CTTGAAAAATGATGAACAAG ACATTAATCT CAG CAAAACTTAC CAA 

TTAAAATACAACCCGACAAATCGT CGT CTAGAGTTTACTATTAATA^ 

TAACT CAAGTTCAGAAATCATGACCACTTT CAAAG ATGGAAAGATG CCAG 

AATTGGTTGAACAAAAAGATGU'iTCTTTGGATATAAACGATATGGACATG 

AGTAAGTTTAAAACTATTCGACITGGACGAAAGGATTCT 
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ACAACTTATTG CAAAAACT G GAACAGTTGAATTAG ATAT GTTTTT CAAAC 
AAT CT G^GAC CCAG CI^CAATTATTAAAAAAAT ATACCITAT CCAA=AT 
GGTGTT C CAAATG AATTGAAAAAATTTG ACTCTAGTTTTGGTTTAACTG A 
AAGTCAGATAGATGGATACTATATTTATAAAGATGCAATTAACCTTAAAT 
TTAAATTAACCAGTGGTGCAaGTCTTAAAGTTGTTTATAAAGGGCAAGAA 
GATC CAT ATAGT CAT CAG AAAGAAG AT ATG ACT AAAAr AGGTGAACAG CT 
CAGT CATTCAACT CAAG CCAATGAAAATACAG CAAAAGTAACCTTTG CT A 
ATATTGACT GGT CACATTAT AGTAAGGTT ACTGTG AATGG AAAAGAAGTT 
GGTAAAGGTAGTGAGTTACCTTTAACTAAAGGATGGAGAACATTTGTATT 
ACATAAAACAGAAAATTCATTAAATGTTAAAAGTTTGATTATGGAGACGG 
GTAGTGTAAGTAAGAAAGTTCAACAACTTCCTTTAAGTCCTAGATTATCT 
AAAAATAAG CATATG AGGG ATATG CTACTTACTATG CAAAAAGATTCAG C 
GTATTACGAAACLAAGTGACAGTCTAGTCCTTCGAATTAATCrCACTGCAG 
ATACTAAACTTA^TTTTAATGCTGTTAAAGGAGCGAGTGCTCTTACTGAA 
AATATGATG ATGAGACAGTTTG CAGTTGCTGG AC CACAAGATGAT C CTGT 
TAGTGAACATAAATAC C CAT CAGTATTTCT CTTAACT CCTG C CTTATTGG 
AAACTGCTAGTGAGG CAACTCTAAATGGTAAGGAAAT CACAG CAT CTGGT 
ATTAT CGGTCACATCAAGGATGGT GATAAAAG CAAG CATGTTGAAGT CAA 
AATGGTG AATGAAAATGGAG ACATG CTAGGAAC C C CTGTTATTATTCAAG 
GTAAAGACTTGACTAAT CG AACAAAAC CATTAATGAGTGGACGTAGAGTA 
CITTATGCCGGTAAACAATATGAGTTC CGGG CTAAATTACCACTTAGTCG 
TTTTAACACTTGGATTAGGGTT GAAGTGGT AACAG AAGCAGGAg aGa a ag 
cAaGTATTGTTCGTCG CATGTT CTTTGAC CAATCAGTT C CAG AG CTTAAC 
ACAG CAGTTG CTAAACGTGATTTG ACTT CTGATACTG CT CTTAT C CACAT 
CGTTGC CAAAGATGACT CT CTAAAACTAAAATTATATCAAGATGATTCAT 
TACTTGAAT CTGTTGATAAAAC CGGT CTTT ATAGTTTTAGAAATGGTGT A 
GAAAT CACT AAAGAT ATGACAGTAC CACTAGAATTTGGAGAT AAT ATTAT 
TAAGTTATCTGCTGTTGACTTATCAAATTATCGTCGTAATGAGACCCTTC 
ATAT CTATAGAAACCGTTTTGATGTTAAAG CAAG C CAAATGACAGCTGAC 
AAAGGAGCTAAAGTAACTGT GG ATATGTTGATGAAGCACTTAGTTGTTC C 
AGAAATGGCAGGAGCTTATACATTAACAATCGACGAAGCTCCAAAaA.CAA 
ATGAATCAGGAATGTTAACAAACG CTAAAGTAT CGATTCATT ATGTAAAT 
GGTGGTGTTGATAAAGTTG ATGTT C CGATT AAAGTAGTTGACTTAGAAG C 
TATTCGTAAAG CTGAAGAAG CACATAAAG CTGACGAAGCACGTAAAG CTG 
AAGAAGCACGTAAAGCTGAAGAAGCACATAAAGCTGAAGAAGTACGTAAA 
GCTGAAGAAGCACATAAAGTCGAAGAAGCACCGTAAAGCTGAAGAGGGAC 
ATAAAACCCAAGAAGCACCTATAGTTGAAGAAGGCTACAAGGTTAATAAC 
GTTCATCAAACTGATACTACAGTTAAAG CGTCTGATTTAC CAAAGACTAA 
GACAGTTTC CG CAGTTCATATGGCTAGAACAGACAATAAACAGATAACTT 
CACAT CAGACACATGTTG 



MSA Alignment Results: Pretty output 

PRETTY of: /biotmp/msa23528 0 . 2 { * } December 10, 2002 05:12 

1 

msa235280.2{l95_COHl} ttgaataata aaggtgtcgg tggcgaTGGT 
msa235280 ,2{195_M732} ttgaataata aaggtgtcgg tggcgaTGGT 
msa235280 .2{l95_M78l} ttgaataata aaggtgtcgg tggcgaTGGT 

msa235280.2{l95JK36B} TGGT 

msa235280 .2{l95_JM9130013} TGGT 

msa235280.2{l95_18RS2l} ttgaataata aaggtgtcgg tggcgaTGGT 
msa235280.2{l95_2603 j ttgaataata aaggtgtcgg tggcgaTGGT 
msa235280 .2{195_A909} ttgaataata aaggtgtcgg tggcgaTGGT 
Consensus **** 



GTCCAAATTT 
GTCCAAATTT 
GTCCAAATTT 
GTCCAAATTT 
GTCCAAATTT 
GTCCAAATTT 
GTCCAAATTT 
GTCCAAATTT 



50 

ATCAATACTA 
ATCAATACTA 
ATCAATACTA 
ATCAATACTA 
ATCAATACTA 
ATCAATACTA 
ATCAATACTA 
ATCAATACTA 



********** ********** 



msa235280 .2f 195_COHl> 
msa235280 . 2 { 195_M732 } 
msa235280 . 2 { 195_M781 } 
msa235280.2{l95_H36B} 
msa235280 .2{195_JM9130013} 
msa235280 . 2 { 195_18RS21 } 
msa235280 .2f 195_2603} 
msa235280.2{195_A90 9} 
Consensus 



51 

TATCAAAATG 

TATCAAAATG 

TATCAAAATG 

TATCAAAATG 

TATCAAAATG 

TATCAAAATG 

TATCAAAATG 

TATCAAAATG 
********** 



GACAACAATA 

GACAACAATA 

GACAACAATA 

GACAACAATA 

GACAACAATA 

GACAACAATA 

GACAACAATA 

GACAACAATA 
********** 



AACCTTACTT 
AACCTTACTT 
AACCTTACTT 
AACCTTACTT 
AACCTTACTT 
AACCTTACTT 
AACCTTACTT 
AACCTTACTT 
********** 



AAGTCCCAAA 
AAGTCCCAAA 
AAGTCCCAAA 
AAGTCCCAAA 
AAGTCCCAAA 
AAGTCCCAAA 
AAGTCCCAAA 

AAGTCCCAAA 
********** 



100 

GATAAGACTA 

GATAAGACTA 

GATAAGACTA 

GATAAGACTA 

GATAAGACTA 

GATAAGACTA 

GATAAGACTA 

GATAAGACTA 
********** 



msa23 5280 .2(19 5__COHl 
msa235280.2{195_M732 
msa235280 . 2 { 195_M781 
msa235280 . 2 { 195_H36B 
msa235280.2{l95_JM9130013} 
msa235280.2{l95_18RS2l} 
msa235280.2l 195_2603} 
msa235280.2{195_A909} 
Consensus 



101 

CTGTAGAGAA 

CTGTAGAGAA 

CTGTAGAGAA 

CTGTAGAGAA 

CTGTAGAGAA 

CTGTAGAGAA 

CTGTAGAGAA 

CTGTAGAGAA 
********** 



GTTAGAAGAT 

GTTAGAAGAT 

GTTAGAAGAT 

GTTAGAAGAT 

GTTAGAAGAT 

GTTAGAAGAT 

GTTAGAAGAT 

GTTAGAAGAT 
********** 



CGCTGGAAAA 
CGCTGGAAAA 
CGCTGGAAAA 
CGCTGGAAAA 
CGCTGGAAAA 
CGCTGGAAAA 
CGCTGGAAAA 

CGCTGGAAAA 
********** 



AAATTACTTT 

AAATTACTTT 

AAATTACTTT 

AAATTACTTT 

AAATTACTTT 

AAATTACTTT 

AAATTACTTT 

AAATTACTTT 
********** 



150 

CAAAGTTCAG 

CAAAGTTCAG 

CAAAGTTCAG 

CAAAGTTCAG 

CAAAGTTCAG 

CAAAGTTCAG 

CAAAGTTCAG 

CAAAGTTCAG 
********** 



msa235280 . 2 { 195_COHi; 
11183235280.2(195^732 
msa235280.2(l95_M781 
msa235280.2{!95 H36B 



151 



200 



GATACTGGCA TTGGTTTGAA AGACGTTTAT CTTCAATCTG TTAAGTATGT 
GATACTGGCA TTGGTTTGAA AGACGTTTAT CTTCAATCTG TTAAGTATGT 
GATACTGGCA TTGGTTTGAA AGACGTTTAT CTTCAATCTG TTAAGTATGT 
GATACTGGCA TTGGTTTGAA AGACGTTTAT CTTCAATCTG TTAAGTATGT 



845 



WO 2004/018646 



PCT/US2003/026827 



Table 51: Comparative Sequences relating to SAG0677 



msa235280 .2{l95_JM9130013 } 
msa235280 . 2 { 195_18RS21 } 
msa235280 . 2 { 195_2603 } 
msa235280 .2{l95_A909} 
Consensus 



msa235280.2{l95_COHl' 
msa235280.2(l95_M732 
msa235280.2{l95_M781 
msa235280.2{l95_H36B; 
msa235280 . 2 { 195_jJM9130013 ' 
msa235280.2{l95_18RS21 
msa235280.2{l95_2603 
msa235280.2{l95_A909 
Consensus 



GATACTGGCA 
GATACTGGCA 
GATACTGGCA 
GATACTGGCA 



TTGGTTTGAA 
TTGGTTTGAA 
TTGGTTTGAA 
TTGGTTTGAA 



AGACGTTTAT 
AGACGTTTAT 
AGACGTTTAT 
AGACGTTTAT 



CTTCAATCTG 
CTTCAATCTG 
CTTCAATCTG 
CTTCAATCTG 



TTAAGTATGT 
TTAAGTATGT 
TTAAGTATGT 
TTAAGTATGT 



********** ********** ********** ********** ********** 



201 

TGGTGGTGGC 
TGGTGGTGGC 
TGGTGGTGGC 
TGGTGGTGGC 
TGGTGGTGGC 
TGGTGGTGG C 
TGGTGGTGGC 
TGGTGGTGGC 



AATAATAATT 
AATAATAATT 
AATAATAATT 
AATAATAATT 
AATAATAATT 
AATAATAATT 
AATAATAATT 
AATAATAATT 



TAGACCTTAT 
TAGACCTTAT 
TAGACCTTAT 
TAGACCTTAT 
TAGACCTTAT 
TAGACCTTAT 
TAGACCTTAT 
TAGACCTTAT 



********** ********** ********** 



CACACCTCCA 

CACACCTCCA 

CACACCTCCA 

CACACCTCCA 

CACACCTCCA 

CACACCTCCA 

CACACCTCCA 

CACACCTCCA 
********** 



250 

GGATTTAAAA 

GGATTTAAAA 

GGATTTAAAA 

GGATTTAAAA 

GGATTTAAAA 

GGATTTAAAA 

GGATTTAAAA 

GGATTTAAAA 
********** 



msa235280 . 2 { 195_COHl } 
msa235280 . 2 { 195_M732 } 
msa235280.2{195_M78l} 
msa235280 . 2 { 195_H36B } 
msa235280.2{l95_JM9130013} 
msa235280.2{l95_18RS21 j 
msa235280 .2(195_2603} 
msa235280 . 2 { 195_A909 } 
Consensus 



251 

AAGAAGATAA 

AAGAAGATAA 

AAGAAGATAA 

AAGAAGATAA 

AAGAAGATAA 

AAGAAGATAA 

AAGAAGATAA 

AAGAAGATAA 
********** 



AAAAGTTGAA 
AAAAGTTGAA 
AAAAGTTGAA 
AAAAGTTGAA 
AAAAGTTGAA 
AAAAGTTGAA 
AAAAGTTGAA 
AAAAGTTGAA 



AAACCAAAAT 
AAACCAAAAT 
AAACCAAAAT 
AAACCAAAAT 
AAACCAAAAT 
AAACCAAAAT 
AAACCAAAAT 
AAACCAAAAT 



TAGACCGTCC 
TAGACCGTCC 
TAGACCGTCC 
TAGACCGTCC 
TAGACCGTCC 
TAGACCGTCC 
TAGACCGTCC 
TAGACCGTCC 



300 

ACCAGGAATT 
ACCAGGAATT 
ACCAGGAATT 
ACCAGGAATT 
ACCAGGAATT 
ACCAGGAATT 
ACCAGGAATT 
ACCAGGAATT 



********** ********** ********** ********** 



msa235280 .2f 195__COHl} 
msa235280.2{195_M732} 
msa235280.2fl95_M78l} 
msa235280 . 2 { 195_H36B } 
msa235280.2{l95_JM9130013} 
msa235280 - 2 { 195_18RS21 } 
msa235280 . 2 I 195_2603 } 
msa235280.2{195_A909} 
Consensus 



301 

GATTTACCAg 
GATTTACCAg 
GATTTACCAg 
GATTTACCAg 
GATTTACCAg 
GATTTACCAg 
GATTTACCAg 
GATTTACCAc 



CACCAACTTC 
CACCAACTTC 
CACCAACTTC 
CACCAACTTC 
CACCAACTTC 
CACCAACTTC 
CACCAACTTC 
CACCAACTTC 



AATGAGAAGT 
AATGAGAAGT 
AATGAGAAGT 
AATGAGAAGT 
AATGAGAAGT 
AATGAGAAGT 
AATGAGAAGT 
AATGAGAAGT 



*********_ ********** ********** 



TTTGATTATT 

TTTGATTATT 

TTTGATTATT 

TTTGATTATT 

TTTGATTATT 

TTTGATTATT 

TTTGATTATT 

TTTGATTATT 
********** 



350 

CAACCCCACC 
CAACCCCACC 
CAACCCCACC 
CAACCCCACC 
CAACCCCACC 
CAACCCCACC 
CAACCCCACC 
CAACCCCACC 
********** 



msa235280.2{l9 5__COHl } 
msa235280 . 2 { 195_M732 } 
msa235280 . 2 { 195_M781 j 
msa235280 ,2{l95_H36B} 
msa235280.2{l95_jJM9130013} 
msa235280 .2 { 195_18RS2l} 
msa235280.2fl95_2603} 
msa235280 . 2 { 195_A909 } 
Consensus 



351 

GGGAACTAAG 
GGGAACTAAG 
GGGAACTAAG 
GGGAACTAAG 
GGGAACTAAG 
GGGAACTAAG 
GGGAACTAAG 
GGGAACTAAG 



CCAAGCAAAC 
CCAAGCAAAC 
CCAAGCAAAC 
CCAAGCAAAC 
CCAAGCAAAC 
CCAAGCAAAC 
CCAAGCAAAC 
CCAAGCAAAC 



CCAAAGATAG 
CCAAAGATAG 
CCAAAGATAG 
CCAAAGATAG 
CCAAAGATAG 
CCAAAGATAG 
CCAAAGATAG 
CCAAAGATAG 



********** ********** ********** 



TTTATCAACT 

TTTATCAACT 

TTTATCAACT 

TTTATCAACT 

TTTATCAACT 

TTTATCAACT 

TTTATCAACT 

TTTATCAACT 
********** 



400 

CCTCCAGGTT 

CCTCCAGGTT 

CCTCCAGGTT 

CCTCCAGGTT 

CCTCCAGGTT 

CCTCCAGGTT 

CCTCCAGGTT 

CCTCCAGGTT 
********** 



msa235280 .2{l95_COHl} 
msa235280 . 2 { 195 JM732 } 
msa235280 .2? 195_M78l} 
msa235280 . 2 { 195_H36B} 
msa2 3 52 8 0 . 2 { 195_JM9 13 0 013 } 
msa235280.2{l95_18RS2l} 
msa235280 -2{195_2603} 
msa235280.2{l95_A909} 
Consensus 



401 

TCCCAGATTT 

TCCCAGATTT 

TCCCAGATTT 

TCCCAGATTT 

TCCCAGATTT 

TCCCAGATTT 

TCCCAGATTT 

TCCCAGATTT 
********** 



AAACACGCCG 

AAACACGCCG 

AAACACGCCG 

AAACACGCCG 

AAACACGCCG 

AAACACGCCG 

AAACACGCCG 

AAACACGCCG 
********** 



CCGGATGAAG 

CCGGATGAAG 

CCGGATGAAG 

CCGGATGAAG 

CCGGATGAAG 

CCGGATGAAG 

CCGGATGAAG 

CCGGATGAAG 
********** 



cCACcAAAGG 

cCACcAAAGG 

cCACcAAAGG 

-CACtAAAGG 

. CACCAAAGG 

. CACcAAAGG 

.CACCAAAGG 

. CACtAAAGG 
_***_***** 



450 

ATAGTAAAAA 

ATAGTAAAAA 

ATAGTAAAAA 

ATAGTAAAAA 

ATAGTAAAAA 

ATAGTAAAAA 

ATAGTAAAAA 

ATAGTAAAAA 
********** 



msa2 3 52 8 0 . 2 f 19 5_COHl } 
msa235280 . 2 { 195_M732 } 
msa235280 . 2 { 195_M781 } 
msa2 35280.2(19 5_H3 6B } 
msa235280.2{l95_JM9130013) 
msa235280.2{l95_JL8RS2l} 
msa235280 . 2 { 195J2603 } 
rasa235280 . 2 { 195_A909 } 
Consensus 



451 

AGACGCTATT 

AGACGCTATT 

AGACGCTATT 

AGACGCTATT 

AGACGCTATT 

AGACGCTATT 

AGACGCTATT 

AGACGCTATT 
********** 



GAAGATAAAT 

GAAGATAAAT 

GAAGATAAAT 

GAAGATAAAT 

GAAGATAAAT 

GAAGATAAAT 

GAAGATAAAT 

GAAGATAAAT 
********** 



CAGGAGCAAT 
CAGGAGCAAT 
CAGGAGCAAT 
CAGGAGCAAT 
CAGGAGCAAT 
CAGGAGCAAT 
CAGGAGCAAT 
CAGGAGCAAT 



TAAATATGCT 
TAAATATGCT 
TAAATATGCT 
TAAATATGCT 
TAAATATGCT 
TAAATATGCT 
TAAATATGCT 
TAAATATGCT 



500 

AAGTCTCTTC 
AAGTCTCTTC 
AAGTCTCTTC 
AAGTCTCTTC 
AAGTCTCTTC 
AAGTCTCTTC 
AAGTCTCTTC 
AAGTCTCTTC 



********** ********** ********** 



mBa235280 . 2 f 195_COHl } 
msa235280 . 2 { 195J4732 } 
msa235280 . 2 { 195_M781 } 
rasa235280.2{195_H36B} 
msa235280.2{l95_JM9130013 J 
msa235280.2{l95_18RS21) 
msa235280 . 2 { 195_2603 } 
msa235280 . 2 { 195_A909 } 
Consensus 



501 

AACTTAGCTT 
AACTTAGCTT 
AACTTAGCTT 
AACTTAGCTT 
AACTTAGCTT 
AACTTAGCTT 
AACTTAGCTT 
AACTTAGCTT 



TGTTGATGaC 
TGTTGATGaC 
TGTTGATGaC 
TGTTGATGaC 
TGTTGATGaC 
TGTTGATGaC 
TGTTGATGgC 
TGTTGATGaC 



CCTATTTTAG 
CCTATTTTAG 
CCTATTTTAG 
CCTATTTTAG 
CCTATTTTAG 
CCTATTTTAG 
CCTATTTTAG 
CCTATTTTAG 



CTAGCAAAGT 
CTAGCAAAGT 
CTAGCAAAGT 
CTAGCAAAGT 
CTAGCAAAGT 
CTAGCAAAGT 
CTAGCAAAGT 
CTAGCAAAGT 



********** ********_* ********** ********** 



550 

AAATGGCAAA 

AAATGGCAAA 

AAATGGCAAA 

AAATGGCAAA 

AAATGGCAAA 

AAATGGCAAA 

AAATGGCAAA 

AAATGGCAAA 
********** 



846 



WO 2004/018646 



PCT/US2003/026827 



Table 51: Comparative Sequences relating to SAG0677 



msa235280 .2{l95_COHl} 
msa235280 . 2{l95__M732 } 
msa235280 .2{l95_M78l} 
msa23528 0.2{l95_H36B} 
msa235280.2{l95_JM913 0013} 
m3a235280 . 2 { 195_18RS21 } 
msa235280.2{l95_2603} 
msa235280 . 2 { 195_A909 } 
Consensus 



551 

ATATTACAAG 
ATATTACAAG 
ATATTACAAG 
ATATTACAAG 
ATATTACAAG 
ATATTACAAG 
ATATTACAAG 
ATATTACAAG 



TCGAATCTGA 
TCGAATCTGA 
TCGAATCTGA 
TCGAATCTGA 
TCGAATCTGA 
TCGAATCTGA 
TCGAATCTGA 
TCGAATCTGA 



********** ********** 



TGGCAAATTA 

TGGCAAATTA 

TGGCAAATTA 

TGGCAAATTA 

TGGCAAATTA 

TGGCAAATTA 

TGGCAAATTA 

TGGCAAATTA 
********** 



GTCATTCCTA 
GTCATTCCTA 
GTCATTCCTA 
GTCATTCCTA 
GTCATTCCTA 
GTCATTCCTA 
GTCATTCCTA 
GTCATTCCTA 



600 

GAAATGCTTT 
GAAATGCTTT 
GAAATGCTTT 
GAAATGCTTT 
GAAATGCTTT 
GAAATGCTTT 
GAAATGCTTT 
GAAATGCTTT 



********** ********** 



msa2352 80 .2{l95_COHl; 
msa2 3 52 80 . 2 { 19 5_M73 2 
msa235280.2{l95_M781 
msa235280 .2{195_H36B; 
msa235280 . 2 { 195_JM9130013 ; 
msa235280 . 2 { 195_18RS2l' 
msa235280 .2{195_2603 
msa235280.2{l95_A909; 

Consensus 



601 

GTCAGCTAAT 
GTCAGCTAAT 
GTCAGCTAAT 
GTCAGCTAAT 
GTCAGCTAAT 
GTCAGCTAAT 
GTCAGCTAAT 
GTCAGCTAAT 



CAATTTGATG 
CAATTTGATG 
CAATTTGATG 
CAATTTGATG 
CAATTTGATG 
CAATTTGATG 
CAATTTGATG 
CAATTTGATG 



ACACTAGTCT 
ACACTAGTCT 
ACACTAGTCT 
ACACTAGTCT 
ACACTAGTCT 
ACACTAGTCT 
ACACTAGTCT 
ACACTAGTCT 



TAAAATTTAT 
TAAAATTTAT 
TAAAATTTAT 
TAAAATTTAT 
TAAAATTTAT 
TAAAATTTAT 
TAAAATTTAT 
TAAAATTTAT 



650 

CGTAATAATA 
CGTAATAATA 
CGTAATAATA 
CGTAATAATA 
CGTAATAATA 
CGTAATAATA 
CGTAATAATA 
CGTAATAATA 



********** ********** ********** ********** ********** 



msa2 3 52 8 0 . 2 { 1 9 5_COHl 
msa235280.2{l95_M732 
msa235280.2{l95_M781 
msa23 5280 .2(19 5_H36B; 
msa2 3 52 8 0 . 2 { 19 5_JM9 13 0 0 1 3 
msa235280 . 2 { 195_18RS21; 
msa2352 80.2{l95_2603 
msa235280 . 2 { 195_A909; 

Consensus 



651 

ATCGCAATAA 
ATCGCAATAA 
ATCGCAATAA 
ATCGCAATAA 
ATCGCAATAA 
ATCGCAATAA 
ATCGCAATAA 
ATCGCAATAA 



AGAAATTACT 
AGAAATTACT 
AGAAATTACT 
AGAAATTACT 
AGAAATTACT 
AGAAATTACT 
AGAAATTACT 
AGAAATTACT 



ATCACAACAG 
ATCACAACAG 
ATCACAACAG 
ATCACAACAG 
ATCACAACAG 
ATCACAACAG 
ATCACAACAG 
ATCACAACAG 



********** ********** ********** 



ATTATTTTGC 

ATTATTTTGC 

ATTATTTTGC 

ATTATTTTGC 

ATTATTTTGC 

ATTATTTTGC 

ATTATTTTGC 

ATTATTTTGC 
********** 



700 

AGATACAAAA 

AGATACAAAA 

AGATACAAAA 

AGATACAAAA 

AGATACAAAA 

AGATACAAAA 

AGATACAAAA 

AGATACAAAA 
********** 



msa2 3 52 8 0 . 2 { 19 5^COHl } 
msa235280 . 2 { 195_M732 } 
msa235280.2{195_M78l} 
msa235280 . 2{l95_H36B} 
msa2 3 52 8 0 . 2 { 1*9 5_JM9 1 3 0 0 13 } 
msa235280.2{l95_18RS21) 
msa235280 . 2{195_2603 } 
msa235280.2{l95_A909} 
Consensus 



701 

TATGTCAATA 

TATGTCAATA 

TATGTCAATA 

TATGTCAATA 

TATGTCAATA 

TATGTCAATA 

TATGTCAATA 

TATGTCAATA 
********** 



TCACAGCGGT 

TCACAGCGGT 

TCACAGCGGT 

TCACAGCGGT 

TCACAGCGGT 

TCACAGCGGT 

TCACAGCGGT 

TCACAGCGGT 
********** 



TGACTATTTG 

TGACTATTTG 

TGACTATTTG 

TGACTATTTG 

TGACTATTTG 

TGACTATTTG 

TGACTATTTG 

TGACTATTTG 
********** 



AGCAATACTA 

AGCAATACTA 

AGCAATACTA 

AGCAATACTA 

AGCAATACTA 

AGCAATACTA 

AGCAATACTA 

AGCAATACTA 
********** 



750 

CTTTTGAGCA 

CTTTTGAGCA 

CTTTTGAGCA 

CTTTTGAGCA 

CTTTTGAGCA 

CTTTTGAGCA 

CTTTTGAGCA 

CTTTTGAGCA 
********** 



msa23528 0 . 2 { 195_COHl} 
msa235280.2{l95_M732} 
msa2352 80.2{195_M78l} 
msa2352 80.2{l95_H36B} 
msa235280.2{l95_OM9130013} 
msa235280.2{l95_18RS2lJ 
msa235280.2{l95_2603} 
msa235280.2{l95_A909} 
Consensus 



751 

ATTAGCTACT 
ATTAGCTACT 
ATTAGCTACT 
ATTAGCTACT 
ATTAGCTACT 
ATTAGCTACT 
ATTAGCTACT 
ATTAGCTACT 



GGTGAAACAG 
GGTGAAACAG 
GGTGAAACAG 
GGTGAAACAG 
GGTGAAACAG 
GGTGAAACAG 
GGTGAAACAG 
GGTGAAACAG 



TAGATTACCA 
TAGATTACCA 
TAGATTACCA 
TAGATTACCA 
TAGATTACCA 
TAGATTACCA 
TAGATTACCA 
TAGATTACCA 



TGCCATTGTA 
TGCCATTGTA 
TGCCATTGTA 
TGCCATTGTA 
TGCCATTGTA 
TGCCATTGTA 
TGCCATTGTA 
TGCCATTGTA 



800 

TTTTCAAGCT 
TTTTCAAGCT 
TTTTCAAGCT 
TTTTCAAGCT 
TTTTCAAGCT 
TTTTCAAGCT 
TTTTCAAGCT 
TTTTCAAGCT 



********** ********** ********** ********** ********** 



msa235280.2{l95_COHl} 
msa235280.2il95_M732} 
msa235280.2{l95_M78l) 
msa235280.2{l95_H36B) 
msa235280.2{l95_JM913 0013} 
msa235280-2{l95_18RS2l} 
msa235280.2(l95_26O3} 
msa235280.2{195_A909} 
Consensus 



801 

TTGCTGCTAT 

TTGCTGCTAT 

TTGCTGCTAT 

TTGCTGCTAT 

TTGCTGCTAT 

TTGCTGCTAT 

TTGCTGCTAT 

TTGCTGCTAT 
********** 



TAAAGACAAG 

TAAAGACAAG 

TAAAGACAAG 

TAAAGACAAG 

TAAAGACAAG 

TAAAGACAAG 

TAAAGACAAG 

TAAAGACAAG 
********** 



GGTGGTAAGA 

GGTGGTAAGA 

GGTGGTAAGA 

GGTGGTAAGA 

GGTGGTAAGA 

GGTGGTAAGA 

GGTGGTAAGA 

GGTGGTAAGA 
********** 



TTTATGTtAA 

TTTATGTtAA 

TTTATGTtAA 

TTTATGTcAA 

TTTATGTtAA 

TTTATGTtAA 

TTTATGTtAA 

TTTATGTtAA 
*******_** 



850 

CGATAAATTG 

CGATAAATTG 

CGATAAATTG 

CGATAAATTG 

CGATAAATTG 

CGATAAATTG 

CGATAAATTG 

CGATAAATTG 
********** 



msa235280 . 
msa235280 . 
msa235280 . 
msa235280. 
msa235280.2{l95 
msa235280 ,2{ 
msa235280 . 
msa235280 . 



2fl95_COHl} 
2(195_M732* 
2{195_M781 
2{195_H36B 
_JM9130013 
195_18RS2i; 
2f 195_2603 
2{195_A909 
Consensus 



851 

CAAGAAACTT 
CAAGAAACTT 
CAAGAAACTT 
CAAGAAACTT 
CAAGAAACTT 
CAAGAAACTT 
CAAGAAACTT 
CAAGAAACTT 



CTCGTATAGC 
CTCGTATAGC 
CTCGTATAGC 
CTCGTATAGC 
CTCGTATAGC 
CTCGTATAGC 
CTCGTATAGC 
CTCGTATAGC 



********** ********** 



GCTTAAAGAT 

GCTTAAAGAT 

GCTTAAAGAT 

GCTTAAAGAT 

GCTTAAAGAT 

GCTTAAAGAT 

GCTTAAAGAT 

GCTTAAAGAT 
********** 



AAATCTGTTA 

AAATCTGTTA 

AAATCTGTTA 

AAATCTGTTA 

AAATCTGTTA 

AAATCTGTTA 

AAATCTGTTA 

AAATCTGTTA 
********** 



900 

AGATTGGTAT 

AGATTGGTAT 

AGATTGGTAT 

AGATTGGTAT 

AGATTGGTAT 

AGATTGGTAT 

AGATTGGTAT 

AGATTGGTAT 
********** 



rasa23 52 8 0 . 2 { 195_COHl } 
msa235280.2{l95_M732} 
msa235280.2{l95_M78l} 
msa235280.2{l95_H36B) 
msa235280.2{l95_JM9130013) 



901 

TGAATTACCA 
TGAATTACCA 
TGAATTACCA 
TGAATTACCA 
TGAATTACCA 



AATGATGTCA 
AATGATGTCA 
AATGATGTCA 
AATGATGTCA 
AATGATGTCA 



GACATATTGA 
GACATATTGA 
GACATATTGA 
GACATATTGA 
GACATATTGA 



TAGTTTATCT 
TAGTTTATCT 
TAGTTTATCT 
TAGTTTATCT 
TAGTTTATCT 



950 

GTTCGTCGTT 
GTTCGTCGTT 
GTTCGTCGTT 
GTTCGTCGTT 
GTTCGTCGTT 



847 



WO 2004/018646 



PCT/US2003/026827 



Table 51: Comparative Sequences relating to SAG0677 



msa235280 . 2 { 195_18RS21 } 
msa235280 . 2{ 195__2603 } 
msa235280 . 2 { 195_A909 } 

Consensus 



TGAATTACCA AATGATGTCA GACATATTGA TAGTTTATCT GTTCGTCGTT 

TGAATTACCA AATGATGTCA GACATATTGA TAGTTTATCT GTTCGTCGTT 

TGAATTACCA AATGATGTCA GACATATTGA TAGTTTATCT GTTCGTCGTT 
********** ********** ********** ********** ********** 



msa235280 . 2{l95_COHl} 
msa23S280 . 2{l95_M732} 
msa235280 .2{195_M781} 
msa235280 . 2{195_H36B} 
msa235280 . 2{ 195_JM9130013 } 
msa235280 .2{l95_18RS2l} 
msa235280 . 2 { 195_2603 } 
msa235280 .2{195_A909} 
Consensus 



951 

TGAATGAGGT 

TGAATGAGGT 

TGAATGAGGT 

TGAATGAGGT 

TGAATGAGGT 

TGAATGAGGT 

TGAATGAGGT 

TGAATGAGGT 
********** 



TAAAACTGTT 

TAAAACTGTT 

TAAAACTGTT 

TAAAACTGTT 

TAAAACTGTT 

TAAAACTGTT 

TAAAACTGTT 

TAAAACTGTT 
********** 



GATAATATCT 
GATAATATCT 
GATAATATCT 
GATAATATCT 
GATAATATCT 
GATAATATCT 
GATAATATCT 
GATAATATCT 
********** 



TGAAAAATGA 

TGAAAAATGA 

TGAAAAATGA 

TGAAAAATGA 

TGAAAAATGA 

TGAAAAATGA 

TGAAAAATGA 

TGAAAAATGA 
********** 



1000 

TGAACAAGAC 

TGAACAAGAC 

TGAACAAGAC 

TGAACAAGAC 

TGAACAAGAC 

TGAACAAGAC 

TGAACAAGAC 

TGAACAAGAC 
********** 



msa235280 . 2{l95_COHl} 
msa235280 . 2 { 195_M732 } 
msa235280.2fl95_M78l) 
msa235280 . 2 { 195_H36B) 
msa235280 .2{195_JM9130013} 
msa235280 . 2 { 195_18RS21 } 
msa235280 . 2{ 195_2603 ) 
msa235280 . 2{ 195_A909 } 

Consensus 



1001 

ATTAATCTCA 

ATTAATCTCA 

ATTAATCTCA 

ATTAATCTCA 

ATTAATCTCA 

ATTAATCTCA 

ATTAATCTCA 

ATTAATCTCA 
* ** * ****** 



GCAAAACTTA 

GCAAAACTTA 

GCAAAACTTA 

GCAAAACTTA 

GCAAAACTTA 

GCAAAACTTA 

GCAAAACTTA 

GCAAAACTTA 
********** 



CCAATTAAAA 

CCAATTAAAA 

CCAATTAAAA 

CCAATTAAAA 

CCAATTAAAA 

CCAATTAAAA 

CCAATTAAAA 

CCAATTAAAA 
********** 



TACAACCCGA 

TACAACCCGA 

TACAACCCGA 

TACAACCCGA 

TACAACCCGA 

TACAACCCGA 

TACAACCCGA 

TACAACCCGA 
********** 



1050 

CAAATCGTCG 

CAAATCGTCG 

CAAATCGTCG 

CAAATCGTCG 

CAAATCGTCG 

CAAATCGTCG 

CAAATCGTCG 

CAAATCGTCG 
********** 



msa2 3 5 2 8 0 . 2 { 1 9 5__COHl 
msa235280.2{l95_M732 
msa235280 . 2 { 195_M781 
msa235280 .2{195„H36B 
msa235280 . 2 { 195_JM9130013 
msa235280 . 2 { 195_18RS21 } 
msa235280 . 2 { 195_2603 } 
msa2 3 52 8 0 . 2 { 195_A90 9 } 

Consensus 



1051 

TCTAGAGTTT 

TCTAGAGTTT 

TCTAGAGTTT 

TCTAGAGTTT 

TCTAGAGTTT 

TCTAGAGTTT 

TCTAGAGTTT 

TCTAGAGTTT 
********** 



ACTATTAATA 

ACTATTAATA 

ACTATTAATA 

ACTATTAATA 

ACTATTAATA 

ACTATTAATA 

ACTATTAATA 

ACTATTAATA 
********** 



ACATTAACTC 
ACATTAACTC 
ACATTAACTC 
ACATTAACTC 
ACATTAACTC 
ACATTAACTC 
ACATTAACTC 
ACATTAACTC 



AAGTTCAGAA 
AAGTTCAGAA 
AAGTTCAGAA 
AAGTTCAGAA 
AAGTTCAGAA 
AAGTTCAGAA 
AAGTTCAGAA 
AAGTTCAGAA 



1100 
ATCATGACCA 
ATCATGACCA 
ATCATGACCA 
ATCATGACCA 
ATCATGACCA 
ATCATGACCA 
ATCATGACCA 
ATCATGACCA 



********** ********** ********** 



msa2 3 5 2 8 0 . 2 { 1 9 5_C0H1 } 
msa235280 . 2 {195JM732 } 
msa235280.2{l95_M78l} 
msa23 5280.2{l95_H36B) 
msa235280 . 2 { 195_JM9130013 } 
msa235280 . 2 { 195_18RS2l} 
msa235280.2f 195_2603} 
msa235280.2{195_A909} 
Consensus 



1101 

CTTTCAAAGA 
CTTTCAAAGA 
CTTTCAAAGA 
CTTTCAAAGA 
CTTTCAAAGA 
CTTTCAAAGA 
CTTTCAAAGA 
CTTTCAAAGA 



TGGAAAGATG 
TGGAAAGATG 
TGGAAAGATG 
TGGAAAGATG 
TGGAAAGATG 
TGGAAAGATG 
TGGAAAGATG 
TGGAAAGATG 



CCAGAATTGG 
CCAGAATTGG 
CCAGAATTGG 
CCAGAATTGG 
CCAGAATTGG 
CCAGAATTGG 
CCAGAATTGG 
CCAGAATTGG 



********** ********** ********** 



TTGAACAAAA 

TTGAACAAAA 

TTGAACAAAA 

TTGAACAAAA 

TTGAACAAAA 

TTGAACAAAA 

TTGAACAAAA 

TTGAACAAAA 
********** 



1150 

AGATGTTTCT 

AGATGTTTCT 

AGATGTTTCT 

AGATGTTTCT 

AGATGTTTCT 

AGATGTTTCT 

AGATGTTTCT 

AGATGTTTCT 
********** 



msa2 3 52 8 0 . 2 { 19 5_C0H1 J 
msa235280.2{l95_M732' 
msa235280.2{l95_M781; 
msa235280 .2{195_H36B; 
msa235280 -2{l95_JM9130013 
msa235280 . 2 { 195_18RS2l' 
msa235280 .2{195_2603; 
msa235280.2{l95_A909; 

Consensus 



1151 

TTGGATATAA 
TTGGATATAA 
TTGGATATAA 
TTGGATATAA 
TTGGATATAA 
TTGGATATAA 
TTGGATATAA 
TTGGATATAA 



ACGATATGGA 
ACGATATGGA 
ACGATATGGA 
ACGATATGGA 
ACGATATGGA 
ACGATATGGA 
ACGATATGGA 
ACGATATGGA 



CATGAGTAAG 
CATGAGTAAG 
CATGAGTAAG 
CATGAGTAAG 
CATGAGTAAG 
CATGAGTAAG 
CATGAGTAAG 
CATGAGTAAG 



TTTAAAACTA 
TTTAAAACTA 
TTTAAAACTA 
TTTAAAACTA 
TTTAAAACTA 
TTTAAAACTA 
TTTAAAACTA 
TTTAAAACTA 



1200 
TTCGACTTGG 
TTCGACTTGG 
TTCGACTTGG 
TTCGACTTGG 
TTCGACTTGG 
TTCGACTTGG 
TTCGACTTGG 
TTCGACTTGG 



********** ********** ********** ********** ********** 



msa235280 . 2 { 195_COHl" 
msa235280.2{l95_M732 
msa235280 .2{195_M781 
msa235280 . 2 { 195_H36B 
msa235280 . 2{ 195_JM9130013 
msa235280 . 2 { 195_18RS21 
msa235280 . 2 { 195_2603 
msa235280 .2{195_A909> 

Consensus 



1201 

ACGAAAGGAT 

ACGAAAGGAT 

ACGAAAGGAT 

ACGAAAGGAT 

ACGAAAGGAT 

ACGAAAGGAT 

ACGAAAGGAT 

ACGAAAGGAT 
********** 



TCTGAATTTA 
TCTGAATTTA 
TCTGAATTTA 
TCTGAATTTA 
TCTGAATTTA 
TCTGAATTTA 
TCTGAATTTA 

TCTGAATTTA 
********** 



AGGGACAACT 

AGGGACAACT 

AGGGACAACT 

AGGGACAACT 

AGGGACAACT 

AGGGACAACT 

AGGGACAACT 

AGGGACAACT 
********** 



TATTGCAAAA 

TATTGCAAAA 

TATTGCAAAA 

TATTGCAAAA 

TATTGCAAAA 

TATTGCAAAA 

TATTGCAAAA 

TATTGCAAAA 
********** 



1250 

ACTGGAACAG 

ACTGGAACAG 

ACTGGAACAG 

ACTGGAACAG 

ACTGGAACAG 

ACTGGAACAG 

ACTGGAACAG 

ACTGGAACAG 
********** 



msa2 3 52 8 0 . 2 { 195_C0H1 } 
msa235280.2(195_M732} 
msa235280.2{195_M78l} 
msa235280 .2{l95_H36B} 
msa235280.2{l95_JM9130013} 
msa235280 .2{l95_18RS2lJ 
msa235280 . 2 { 195J2603 } 
msa235280 . 2{ 195_A909 } 

Consensus 



1251 

TTGAATTAGA 

TTGAATTAGA 

TTGAATTAGA 

TTGAATTAGA 

TTGAATTAGA 

TTGAATTAGA 

TTGAATTAGA 

TTGAATTAGA 
********* * 



TATGTTTTTC 
TATGTTTTTC 
TATGTTTTTC 
TATGTTTTTC 
TATGTTTTTC 
TATGTTTTTC 
TATGTTTTTC 
TATGTTTTTC 



AAACAATCTC 
AAACAATCTC 
AAACAATCTC 
AAACAATCTC 
AAACAATCTC 
AAACAATCTC 
AAACAATCTC 
AAACAATCTC 



AAGACCCAGC 
AAGACCCAGC 
AAGACCCAGC 
AAGACCCAGC 
AAGACCCAGC 
AAGACCCAGC 
AAGACCCAGC 
AAGACCCAGC 



********** ********** ********** 



1300 

TTCAATTATT 

TTCAATTATT 

TTCAATTATT 

TTCAATTATT 

TTCAATTATT 

TTCAATTATT 

TTCAATTATT 

TTCAATTATT 
********** 



1301 



1350 



848 



WO 2004/018646 



PCT/US2003/026827 



Table 51: Comparative Sequences relating to SAG0677 



msa235280 .2f 195_COHl) 
msa235280 .2{195_M732} 
msa235280 . 2{l95_M78lj 
msa235280.2{l95_H36B} 
msa235280 . 2 { 195_JM9130013 } 
msa235280 .2{l95_18RS2l} 
msa23S280 . 2{ 195_2603 } 
msa235280.2{l95__A909} 

Consensus 



AAAAAAATAT 

AAAAAAATAT 

AAAAAAATAT 

AAAAAAATAT 

AAAAAAATAT 

AAAAAAATAT 

AAAAAAATAT 

AAAAAAATAT 
********** 



ACCTTATCCA 

ACCTTATCCA 

ACCTTATCCA 

ACCTTATCCA 

ACCTTATCCA 

ACCTTATCCA 

ACCTTATCCA 

ACCTTATCCA 
********** 



AAATGGTGTT 
•AAATGGTGTT 
AAATGGTGTT 
AAATGGTGTT 
AAATGGTGTT 
AAATGGTGTT 
AAATGGTGTT 
AAATGGTGTT 



CCAAATGAAT 
CCAAATGAAT 
CCAAATGAAT 
CCAAATGAAT 
CCAAATGAAT 
CCAAATGAAT 
CCAAATGAAT 
CCAAATGAAT 



TGAAAAAATT 
TGAAAAAATT 
TGAAAAAATT 
TGAAAAAATT 
TGAAAAAATT 
TGAAAAAATT 
TGAAAAAATT 
TGAAAAAATT 



********** ********** ********** 



msa235280 . 2{l95_COHl} 
msa235280 . 2 j 195_M732 } 
msa235280 .2{l95_M78lj 
msa235280 .2{l95_H36B} 
msa235280 . 2{l95_JM9130013 } 
msa235280 . 2 { 195_18RS21 } 
msa235280 . 2 f 195_2603 } 
msa235280 . 2 { 195_A909 } 

Consensus 



1351 

TGACTCTAGT 

TGACTCTAGT 

TGACTCTAGT 

TGACTCTAGT 

TGACTCTAGT 

TGACTCTAGT 

TGACTCTAGT 

TGACTCTAGT 
********** 



TTTGGTTTAA 

TTTGGTTTAA 

TTTGGTTTAA 

TTTGGTTTAA 

TTTGGTTTAA 

TTTGGTTTAA 

TTTGGTTTAA 

TTTGGTTTAA 
********** 



CTGAAAGTCA 

CTGAAAGTCA 

CTGAAAGTCA 

CTGAAAGTCA 

CTGAAAGTCA 

CTGAAAGTCA 

CTGAAAGTCA 

CTGAAAGTCA 
********** 



GATAGATGGA 

GATAGATGGA 

GATAGATGGA 

GATAGATGGA 

GATAGATGGA 

GATAGATGGA 

GATAGATGGA 

GATAGATGGA 
********** 



1400 

TACTATATTT 

TACTATATTT 

TACTATATTT 

TACTATATTT 

TACTATATTT 

TACTATATTT 

TACTATATTT 

TACTATATTT 
********** 



msa235280 .2{l95_COHl} 
msa235280 .2 { 195_M732 " 
msa23 5280 . 2{195_M781^ 
msa23 5280 .2 {l95_H36Bj 
msa235280 . 2 { 195_JM9130013 } 
msa235280 .2{l95_18RS2lJ 
msa235280 . 2 { 195_2603 } 
msa235280.2{l95_A909} 

Consensus 



1401 

ATAAAGATGC 
ATAAAGATGC 
ATAAAGATGC 
ATAAAGATGC 
ATAAAGATGC 
ATAAAGATGC 
ATAAAGATGC 
ATAAAGATGC 



AATTAACCTT 
AATTAACCTT 
AATTAACCTT 
AATTAACCTT 
AATTAACCTT 
AATTAACCTT 
AATTAACCTT 
AATTAACCTT 



AAATTTAAAT 
AAATTTAAAT 
AAATTTAAAT 
AAATTTAAAT 
AAATTTAAAT 
AAATTTAAAT 
AAATTTAAAT 
AAATTTAAAT 



********** ********** ********** 



TAACCAGTGG 

TAACCAGTGG 

TAACCAGTGG 

TAACCAGTGG 

TAACCAGTGG 

TAACCAGTGG 

TAACCAGTGG 

TAACCAGTGG 
********** 



1450 

TGCAAGTCTT 

TGCAAGTCTT 

TGCAAGTCTT 

TGCAAGTCTT 

TGCAAGTCTT 

TGCAAGTCTT 

TGCAAGTCTT 

TGCAAGTCTT 
********** 



msa23 5280 . 2 { 195_C0H1 
msa23 5280 . 2 { 195_M732 
msa23 5280 .2(195_M781 
msa235280.2{195_H36B' 
msa235280 . 2 { 195_JM9130013 
msa2 35280 . 2 { 195_18RS21 
msa235280.2{l95_2603 
msa235280.2{l95_A909; 

Consensus 



1451 

AAAGTTGTTT 
AAAGTTGTTT 
AAAGTTGTTT 
AAAGTTGTTT 
AAAGTTGTTT 
AAAGTTGTTT 
AAAGTTGTTT 
AAAGTTGTTT 



ATAAAGGGCA 
ATAAAGGGCA 
ATAAAGGGCA 
ATAAAGGGCA 
ATAAAGGGCA 
ATAAAGGGCA 
ATAAAGGGCA 
ATAAAGGGCA 



AGAAGATCCA 
AGAAGATCCA 
AGAAGATCCA 
AGAAGATCCA 
AGAAGATCCA 
AGAAGATCCA 
AGAAGATCCA 
AGAAGATCCA 



********** ********** ********** 



TATAGTCATC 

TATAGTCATC 

TATAGTCATC 

TATAGTCATC 

TATAGTCATC 

TATAGTCATC 

TATAGTCATC 

TATAGTCATC 
********** 



1500 

AGAAAGAAGA 

AGAAAGAAGA 

AGAAAGAAGA 

AGAAAGAAGA 

AGAAAGAAGA 

AGAAAGAAGA 

AGAAAGAAGA 

AGAAAGAAGA 
********** 



msa23 52 8 0 . 2 { 195_COHl } 
msa235280.2{l95_M732} 
msa235280 .2f 195_M78l) 
msa235280 .2{195_H36B} 
msa235280 . 2 { 195_JM9130013 } 
msa235280.2{l95_18RS2l} 
msa235280-2{l95_2603} 
msa235280 .2{l95_A909} 

Consensus 



1501 

TATGACTAAA 
TATGACTAAA 
TATGACTAAA 
TATGACTAAA 
TATGACTAAA 
TATGACTAAA 
TATGACTAAA 
TATGACTAAA 



AaAGGTGAAC 
AaAGGTGAAC 
AaAGGTGAAC 
AaAGGTGAAC 
AaAGGTGAAC 
AaAGGTGAAC 
AaAGGTGAAC 
AaAGGTGAAC 



AGCTCAGTCA 
AGCTCAGTCA 
AGCTCAGTCA 
AGCTCAGTCA 
AGCTCAGTCA 
AGCTCAGTCA 
AGCTCAGTCA 
AGCTCAGTCA 



TTCAACTCAA 
TTCAACTCAA 
TTCAACTCAA 
TTCAACTCAA 
TTCAACTCAA 
TTCAACTCAA 
TTCAACTCAA 
TTCAACTCAA 



1550 
GCCAATGAAA 
GCCAATGAAA 
GCCAATGAAA 
GCCAATGAAA 
GCCAATGAAA 
GCCAATGAAA 
GCCAATGAAA 
GCCAATGAAA 



********** *_******** ********** ********** ********** 



rnsa235280.2{l95_COHl} 
msa235280.2{l95_M732} 
msa2352 80.2{l95__M78l} 
msa23 52 8 0 . 2 { 195_H3 SB} 
msa235280 . 2 { 195_JM9130013 } 
msa235280 . 2{l95_18RS2l} 
msa23528 0.2{l95_2603} 
msa235280 . 2 { 195_A90 9 } 

Consensus 



1551 

ATACAGCAAA 
ATACAGCAAA 
ATACAGCAAA 
ATACAGCAAA 
ATACAGCAAA 
ATACAGCAAA 
ATACAGCAAA 
ATACAGCAAA 



AGTAACCTTT 
AGTAACCTTT 
AGTAACCTTT 
AGTAACCTTT 
AGTAACCTTT 
AGTAACCTTT 
AGTAACCTTT 
AGTAACCTTT 



GCTAATATTG 
GCTAATATTG 
GCTAATATTG 
GCTAATATTG 
GCTAATATTG 
GCTAATATTG 
GCTAATATTG 
GCTAATATTG 



ACTGGTCACA 
ACTGGTCACA 
ACTGGTCACA 
ACTGGTCACA 
ACTGGTCACA 
ACTGGTCACA 
ACTGGTCACA 
ACTGGTCACA 



1600 
TTATAGTAAG 
TTATAGTAAG 
TTATAGTAAG 
TTATAGTAAG 
TTATAGTAAG 
TTATAGTAAG 
TTATAGTAAG 
TTATAGTAAG 



********** ********** ********** ********** ********** 



msa2 3 52 8 0 . 2 { 19 5_COHl } 
msa2 35280.21 195_M73 2 } 
msa235280 .2{l95_M78l} 
msa235280 .2{195_H36B) 
msa235280 . 2 { 195_JM9130013 } 
msa2352 | 80 . 2 { 195_18RS21 } 
msa235280.2{l95_2603} 
msa235280.2{l95_A909} 

Consensus 



1601 

GTTACTGTGA 

GTTACTGTGA 

GTTACTGTGA 

GTTACTGTGA 

GTTACTGTGA 

GTTACTGTGA 

GTTACTGTGA 

GTTACTGTGA 
********** 



ATGGAAAAGA 

ATGGAAAAGA 

ATGGAAAAGA 

ATGGAAAAGA 

ATGGAAAAGA 

ATGGAAAAGA 

ATGGAAAAGA 

ATGGAAAAGA 
********** 



AGTTGgTAAA 

AGTTGgTAAA 

AGTTGgTAAA 

AGTTGgTAAA 

AGTTGgTAAA 

AGTTGtTAAA 

AGTTGtTAAA 

AGTTGgTAAA 
*****_**** ********** 



GGTAGTGAGT 
GGTAGTGAGT 
GGTAGTGAGT 
GGTAGTGAGT 
GGTAGTGAGT 
GGTAGTGAGT 
GGTAGTGAGT 
GGTAGTGAGT 



1650 

TACCTTTAAC 

TACCTTTAAC 

TACCTTTAAC 

TACCTTTAAC 

TACCTTTAAC 

TACCTTTAAC 

TACCTTTAAC 

TACCTTTAAC 
********** 



msa235280 . 2 { 195_COHl 
msa235280 .2{195_M732 
msa235280 .2{195_M781 
msa235280 .2{195_H36B) 
msa23 5280 . 2 { 195_JM9130013 } 
msa235280 . 2 { 195_18RS21 } 



1651 

TAAAGGATGG 
TAAAGGATGG 
TAAAGGATGG 
TAAAGGATGG 
TAAAGGATGG 
TAAAGGATGG 



ACAACATTTG 
ACAACATTTG 
ACAACATTTG 
ACAACATTTG 
ACAACATTTG 
ACAACATTTG 



TATTACATAA 
TATTACATAA 
TATTACATAA 
TATTACATAA 
TATTACATAA 
TATTACATAA 



AACAGAAAAT 
AACAGAAAAT 
AACAGAAAAT 
AACAGAAAAT 
AACAGAAAAT 
AACAGAAAAT 



1700 
TCATTAAATG 
TCATTAAATG 
TCATTAAATG 
TCATTAAATG 
TCATTAAATG 
TCATTAAATG 



849 



WO 2004/018646 



PCT/US2003/026827 



Table 51: Comparative Sequences relating to SAG0677 



msa235280.2(l95_2603l 
msa235280 .2{195_A909} 
Consensus 



TAAAGGATGG ACAACATTTG TATTACATAA AACAGAAAAT TCATTAAATG 

TAAAGGATGG ACAACATTTG TATTACATAA AACAGAAAAT TCATTAAATG 
********** ********** ********** ********** ********** 



msa235280 .2{l95_COHl} 
msa235280 .2{195_M732} 
msa235280.2{l95_M78l} 
msa235280 .2{l95_H36B^ 
msa235280 . 2{l95_JM9130013 ' 
msa235280 . 2 { 195_18RS21 ' 
msa235280 .2{195_2603} 
msa235280 ,2{195_A909} 
Consensus 



1701 

TTAAAAGTTT 

TTAAAAGTTT 

TTAAAAGTTT 

TTAAAAGTTT 

TTAAAAGTTT 

TTAAAAGTTT 

TTAAAAGTTT 

TTAAAAGTTT 
********** 



GATTATGGAG 

GATTATGGAG 

GATTATGGAG 

GATTATGGAG 

GATTATGGAG 

GATTATGGAG 

GATTATGGAG 

GATTATGGAG 
********** 



ACGGGTAGTG 

ACGGGTAGTG 

ACGGGTAGTG 

ACGGGTAGTG 

ACGGGTAGTG 

ACGGGTAGTG 

ACGGGTAGTG 

ACGGGTAGTG 
********** 



TAAGTAAGAA 
TAAGTAAGAA 
TAAGTAAGAA 
TAAGTAAGAA 
TAAGTAAGAA 
TAAGTAAGAA 
TAAGTAAGAA 
TAAGTAAGAA 



1750 
AGTTCAACAA 
AGTTCAACAA 
AGTTCAACAA 
AGTTCAACAA 
AGTTCAACAA 
AGTTCAACAA 
AGTTCAACAA 
AGTTCAACAA 



********** ********** 



msa235280.2{l95_COHl} 
msa235280.2{l95_M732} 
msa23 5280 . 2 { 195_M781 } 
msa235280 . 2 {195_H36B} 
msa235280 .2{l95_JM9 130013} 
msa235280.2{l95_18RS2l} 
msa235280 .2{195_2603} 
msa235280.2{l95_A909} 
Consensus 



1751 

CTTCCTTTAA 

CTTCCTTTAA 

CTTCCTTTAA 

CTTCCTTTAA 

CTTCCTTTAA 

CTTCCTTTAA 

CTTCCTTTAA 

CTTCCTTTAA 
********** 



GTCCTAGATT 
GTCCTAGATT 
GTCCTAGATT 
GTCCTAGATT 
GTCCTAGATT 
GTCCTAGATT 
GTCCTAGATT 
GTCCTAGATT 



ATCTAAAAAT 
ATCTAAAAAT 
ATCTAAAAAT 
ATCTAAAAAT 
ATCTAAAAAT 
ATCTAAAAAT 
ATCTAAAAAT 
ATCTAAAAAT 



********** ********** 



AAGCATATGA 

AAGCATATGA 

AAGCATATGA 

AAGCATATGA 

AAGCATATGA 

AAGCATATGA 

AAGCATATGA 

AAGCATATGA 
********** 



1800 

GGGATATGCT 

GGGATATGCT 

GGGATATGCT 

GGGATATGCT 

GGGATATGCT 

GGGATATGCT 

GGGATATGCT 

GGGATATGCT 
********** 



msa235280 . 2 ( 195_C0H1 } 
msa235280 .2{195_M732} 
msa235280 .2{l95_M78l} 
maa235280 . 2 { 195__H36B} 
msa235280 .2{195_JM9130013} 
msa235280 . 2 { 195_18RS21 } 
msa235280 . 2 {l95_2603 } 
msa235280 .2{l95_A909} 
Consensus 



1801 

ACTTACTATG 

ACTTACTATG 

ACTTACTATG 

ACTTACTATG 

ACTTACTATG 

ACTTACTATG 

ACTTACTATG 

ACTTACTATG 
********** 



CAAAAAGATT 

CAAAAAGATT 

CAAAAAGATT 

CAAAAAGATT 

CAAAAAGATT 

CAAAAAGATT 

CAAAAAGATT 

CAAAAAGATT 
********** 



CAGCGTATTA 

CAGCGTATTA 

CAGCGTATTA 

CAGCGTATTA 

CAGCGTATTA 

CAGCGTATTA 

CAGCGTATTA 

CAGCGTATTA 
********** 



CGAAACAAGT 

CGAAACAAGT 

CGAAACAAGT 

CGAAACAAGT 

CGAAACAAGT 

CGAAACAAGT 

CGAAACAAGT 

CGAAACAAGT 
********** 



1850 

GACAGTCTAG 

GACAGTCTAG 

GACAGTCTAG 

GACAGTCTAG 

GACAGTCTAG 

GACAGTCTAG 

GACAGTCTAG 

GACAGTCTAG 
********** 



msa235280 
rasa235280 
msa235280 
msa235280 
msa235280 .2{l9 
msa235280 .2 
msa235280 
msa235280 



.2f 195_COHl 
.2{195_M732 
.2{195_M781 
-2{195_H36B 
5_JM9130013 
(l95_18RS2i; 
.2{195_2603 
.2{195_A909; 
Consensus 



1851 

TCCTTCGAAT 

TCCTTCGAAT 

TCCTTCGAAT 

TCCTTCGAAT 

TCCTTCGAAT 

TCCTTCGAAT 

TCCTTCGAAT 

TCCTTCGAAT 
********** 



TAATCTCACT 

TAATCTCACT 

TAATCTCACT 

TAATCTCACT 

TAATCTCACT 

TAATCTCACT 

TAATCTCACT 

TAATCTCACT 
********** 



GCAGATACTA 

GCAGATACTA 

GCAGATACTA 

GCAGATACTA 

GCAGATACTA 

GCAGATACTA 

GCAGATACTA 

GCAGATACTA 
********** 



AACTTAATTT 

AACTTAATTT 

AACTTAATTT 

AACTTAATTT 

AACTTAATTT 

AACTTAATTT 

AACTTAATTT 

AACTTAATTT 
********** 



1900 

TAATGCTGTT 

TAATGCTGTT 

TAATGCTGTT 

TAATGCTGTT 

TAATGCTGTT 

TAATGCTGTT 

TAATGCTGTT 

TAATGCTGTT 
********** 



msa2 35280.2(19 5_C0H1 
msa235280 . 2 { 195_M732 
msa235280,2{l95_M781^ 
msa235280.2{195_H36B} 
msa235280 . 2 { 195_»JM9130013 } 
msa235280 . 2 { 195_18RS21 } 
msa235280-2{l95_2603} 
msa235280 .2{195_A909} 
Consensus 



1901 

AAAGGAGCGA 

AAAGGAGCGA 

AAAGGAGCGA 

AAAGGAGCGA 

AAAGGAGCGA 

AAAGGAGCGA 

AAAGGAGCGA 

AAAGGAGCGA 
********** 



GTGCTCTTAC 

GTGCTCTTAC 

GTGCTCTTAC 

GTGCTCTTAC 

GTGCTCTTAC 

GTGCTCTTAC 

GTGCTCTTAC 

GTGCTCTTAC 
********** 



TGAAAATATG 

TGAAAATATG 

TGAAAATATG 

TGAAAATATG 

TGAAAATATG 

TGAAAATATG 

TGAAAATATG 

TGAAAATATG 
********** 



ATGATGAGAC 
ATGATGAGAC 
ATGATGAGAC 
ATGATGAGAC 
ATGATGAGAC 
ATGATGAGAC 
ATGATGAGAC 

ATGATGAGAC 
********** 



1950 

AGTTTGCAGT 

AGTTTGCAGT 

AGTTTGCAGT 

AGTTTGCAGT 

AGTTTGCAGT 

AGTTTGCAGT 

AGTTTGCAGT 

AGTTTGCAGT 
********** 



msa235280 .2{ 195_COHl' 
msa235280 . 2 { 195JM732 ' 
msa235280 .2{l95__M78i; 
msa235280-2{l95_H36B' 
msa235280 .2{195_JM9130013; 
msa235280 . 2 { 195_18RS21 
msa235280 .2{195_2603 
msa235280 .2{195_A909; 

Consensus 



1951 

TGCTGGACCA 
TGCTGGACCA 
TGCTGGACCA 
TGCTGGACCA 
TGCTGGACCA 
TGCTGGACCA 
TGCTGGACCA 
TGCTGGACCA 



CAAGATGATC 
CAAGATGATC 
CAAGATGATC 
CAAGATGATC 
CAAGATGATC 
CAAGATGATC 
CAAGATGATC 
CAAGATGATC 



********** /********** 



CTGTTAGTGA 
CTGTTAGTGA 
CTGTTAGTGA 
CTGTTAGTGA 
CTGTTAGTGA 
CTGTTAGTGA 
CTGTTAGTGA 

CTGTTAGTGA 
****** *•* * * 



ACATAAATAC 

ACATAAATAC 

ACATAAATAC 

ACATAAATAC 

ACATAAATAC 

ACATAAATAC 

ACATAAATAC 

ACATAAATAC 
********** 



2000 

CCATCAGTAT 

CCATCAGTAT 

CCATCAGTAT 

CCATCAGTAT 

CCATCAGTAT 

CCATCAGTAT 

CCATCAGTAT 

CCATCAGTAT 
********** 



msa235280 
maa235280 
msa235280 
msa235280 
msa23 5280 .2(19 
rasa235280 .2 
msa235280 
msa235280 



.2{195_C0H1 
.2{195_M732 
.2? 195_M781 
.2{195_H36B; 
5_JM9130013 
{195_18RS21 
.2{195_2603 
.2{195_A909} 
Consensus 



2001 

TTCTCTTAAC 
TTCTCTTAAC 
TTCTCTTAAC 
TTCTCTTAAC 
TTCTCTTAAC 
TTCTCTTAAC 
TTCTCTTAAC 
TTCTCTTAAC 



TCCTGCCTTA 
TCCTGCCTTA 
TCCTGCCTTA 
TCCTGCCTTA 
TCCTGCCTTA 
TCCTGCCTTA 
TCCTGCCTTA 
TCCTGCCTTA 



********** ********** 



TTGGAAACTG 

TTGGAAACTG 

TTGGAAACTG 

TTGGAAACTG 

TTGGAAACTG 

TTGGAAACTG 

TTGGAAACTG 

TTGGAAACTG 
********** 



CTAGTGAGGC 
CTAGTGAGGC 
CTAGTGAGGC 
CTAGTGAGGC 
CTAGTGAGGC 
CTAGTGAGGC 
CTAGTGAGGC 
CTAGTGAGGC 
********** 



2050 

AACTCTAAAT 

AACTCTAAAT 

AACTCTAAAT 

AACTCTAAAT 

AACTCTAAAT 

AACTCTAAAT 

AACTCTAAAT 

AACTCTAAAT 
********** 



msa235280 . 2 { 195_C0H1 } 



2051 2100 
GGTAAGGAAA TCACAGCATC TGGTATTATC GGTCACATCA AGGATGGTGA 



850 



WO 2004/018646 



PCT/US2003/026827 



Table 51: Comparative Sequences relating to SAG0677 



msa235280 . 2 ( 195_M732 } 
msa235280 .2{195_M78l} 
msa235280 .2{l95_H36B} 
rasa235280.2{l95_JM9130013} 
msa235280.2{l95_18RS21> 
msa235280 . 2 { 195__2603 } 
msa235280 .2{195_A909} 

Consensus 



msa235280 .2{195_C0H1} 
msa235280 . 2{l95_M732} 
msa235280 ,2{195_M781} 
msa235280 .2{195_H36B} 
msa23 5280.2{l95_JM9130013} 
msa235280.2{l95_JL8RS2l} 
msa235280 .2{l95_2603] 
msa235280 .2{l95_A909} 

Consensus 



msa235280 .2{l95_COHl} 
msa235280 .2{195_M732} 
msa235280.2(l95_M78l} 
msa235280.2{195_H36B} 
msa235280.2{l95_JM9130013} 
msa235280 . 2 { 195_18RS21 } 
msa235280.2{l95_2603} 
msa235280 . 2 { 195_A909} 

Consensus 



msa235280.2{l95__COHl} 
msa235280.2{l95_M732} 
msa235280 .2{195_M781} 
msa235280 .2{195_H36Bj 
msa235280 .2{195_JM9130013> 
msa235280 . 2 { 195_18RS21 } 
msa235280 ,2{195_2603 j 
msa235280 . 2 { 195_A909 } 

Consensus 



msa235280.2{l95_COHl 
msa235280.2{l95_M732 
msa23 5280 . 2 { 195J4781 
msa235280.2{l95_H36B 
msa235280 . 2 { 195_JM9130013 
msa235280 . 2 { 195_JL8RS21 
msa235280 . 2 { 195_2603 
msa235280.2{l95_A909 

Consensus 



rasa235280.2fl95 COHl' 



195_M732 
195 M781> 



msa235280 .2 
msa235280 .2 
msa235280 .2{195_H36B; 
msa235280.2{l95_JM9130013 
msa235280.2{l95_18RS21> 
msa235280.2{l95__2603 
msa235280 .2{195_A909 

Consensus 



msa235280 
msa235280 
msa235280 
msa235280 . 
msa23S280.2{l95 
msa235280 .2{ 
msa235280 
msa235280 



2{l95_COHl 
2{195_M732 
2{195_M781 
2{195_H36B} 
_JM9130013) 
195_18RS21} 
2{195_2603} 
2{195__A909} 
Consensus 



GGTAAGGAAA 
GGTAAGGAAA 
GGTAAGGAAA 
GGTAAGGAAA 
GGTAAGGAAA 
GGTAAGGAAA 
GGTAAGGAAA 



TCACAGCATC 
TCACAGCATC 
TCACAGCATC 
TCACAGCATC 
TCACAGCATC 
TCACAGCATC 
TCACAGCATC 



TGGTATTATC 
TGGTATTATC 
TGGTATTATC 
TGGTATTATC 
TGGTATTATC 
TGGTATTATC 
TGGTATTATC 



GGTCACATCA 
GGTCACATCA 
GGTCACATCA 
GGTCACATCA 
GGTCACATCA 
GGTCACATCA 
GGTCACATCA 



AGGATGGTGA 
AGGATGGTGA 
AGGATGGTGA 
AGGATGGTGA 
AGGATGGTGA 
AGGATGGTGA 
AGGATGGTGA 



********** ********** ********** ********** ********** 



2101 

TAAAAGCAAG 
TAAAAGCAAG 
TAAAAGCAAG 
TAAAAGCAAG 
TAAAAGCAAG 
TAAAAGCAAG 
TAAAAGCAAG 
TAAAAGCAAG 



CATGTTGAAG 
CATGTTGAAG 
CATGTTGAAG 
CATGTTGAAG 
CATGTTGAAG 
CATGTTGAAG 
CATGTTGAAG 
CATGTTGAAG 



TCAAAATGGT 
TCAAAATGGT 
TCAAAATGGT 
TCAAAATGGT 
TCAAAATGGT 
TCAAAATGGT 
TCAAAATGGT 
TCAAAATGGT 



GAATGAAAAT 
GAATGAAAAT 
GAATGAAAAT 
GAATGAAAAT 
GAATGAAAAT 
GAATGAAAAT 
GAATGAAAAT 
GAATGAAAAT 



********** ********** ********** ********** 



2251 

CCGGGCTAAA 
CCGGGCTAAA 
CCGGGCTAAA 
CCGGGCTAAA 
CCGGGCTAAA 
CCGGGCTAAA 
CCGGGCTAAA 
CCGGGCTAAA 



2301 

TGGTAACAGA 

TGGTAACAGA 

TGGTAACAGA 

TGGTAACAGA 

TGGTAACAGA 

TGGTAACAGA 

TGGTAACAGA 

TGGTAACAGA 
********** 

2351 

GACCAATCAG 

GACCAATCAG 

GACCAATCAG 

GACCAATCAG 

GACCAATCAG 

GACCAATCAG 

GACCAATCAG 

GACCAATCAG 
********** 



2150 

GGAGACATGC 

GGAGACATGC 

GGAGACATGC 

GGAGACATGC 

GGAGACATGC 

GGAGACATGC 

GGAGACATGC 

GGAGACATGC 
********** 

2200 
TCGAACAAAA 
TCGAACAAAA 
TCGAACAAAA 
TCGAACAAAA 
TCGAACAAAA 
TCGAACAAAA 
TCGAACAAAA 
TCGAACAAAA 
********** 

2250 
AATATGAGTT 
AATATGAGTT 
AATATGAGTT 
AATATGAGTT 
AATATGAGTT 
AATATGAGTT 
AATATGAGTT 

AATATGAGTT 
********** 

2300 

AGGGTTGAAG 

AGGGTTGAAG 

AGGGTTGAAG 

AGGGTTGAAG 

AGGGTTGAAG 

AGGGTTGAAG 

AGGGTTGAAG 

AGGGTTGAAG 
********** 

2350 
CATGTTCTTT 
CATGTTCTTT 
CATGTTCTTT 
CATGTTCTTT 
CATGTTCTTT 
CATGTTCTTT 
CATGTTCTTT 
CATGTTCTTT 

********** ********** ********** ********** 



2151 

TAGGAACCCC 

TAGGAACCCC 

TAGGAACCCC 

TAGGAACCCC 

TAGGAACCCC 

TAGGAACCCC 

TAGGAACCCC 

TAGGAACCCC 
********** 

2201 

CCATTAATGA 
CCATTAATGA 
CCATTAATGA 
CCATTAATGA 
CCATTAATGA 
CCATTAATGA 
CCATTAATGA 
CCATTAATGA 

********** ********** ********** ********** 



TGTTATTATT 
TGTTATTATT 
TGTTATTATT 
TGTTATTATT 
TGTTATTATT 
TGTTATTATT 
TGTTATTATT 
TGTTATTATT 
********** 



GTGGACGTAG 
GTGGACGTAG 
GTGGACGTAG 
GTGGACGTAG 
GTGGACGTAG 
GTGGACGTAG 
GTGGACGTAG 
GTGGACGTAG 



CAAGGTAAAG 

CAAGGTAAAG 

CAAGGTAAAG 

CAAGGTAAAG 

CAAGGTAAAG 

CAAGGTAAAG 

CAAGGTAAAG 

CAAGGTAAAG 
********** 



AGTACTTTAT 
AGTACTTTAT 
AGTACTTTAT 
AGTACTTTAT 
AGTACTTTAT 
AGTACTTTAT 
AGTACTTTAT 
AGTACTTTAT 



ACTTGACTAA 
ACTTGACTAA 
ACTTGACTAA 
ACTTGACTAA 
ACTTGACTAA 
ACTTGACTAA 
ACTTGACTAA 
ACTTGACTAA 
********** 



GCCGGTAAAC 
GCCGGTAAAC 
GCCGGTAAAC 
GCCGGTAAAC 
GCCGGTAAAC 
GCCGGTAAAC 
GCCGGTAAAC 
GCCGGTAAAC 



TTACCACTTA 
TTACCACTTA 
TTACCACTTA 
TTACCACTTA 
TTACCACTTA 
TTACCACTTA 
TTACCACTTA 
TTACCACTTA 



GTCGTTTTAA 
GTCGTTTTAA 
GTCGTTTTAA 
GTCGTTTTAA 
GTCGTTTTAA 
GTCGTTTTAA 
GTCGTTTTAA 
GTCGTTTTAA 



********** ********** ********** 



AG CAGG AG AG 
AGCAGGAGAG 
AGCAGGAGAG 
AGCAGGAGAG 
AGCAGGAGAG 
AGCAGGAGAG 
AGCAGGAGAG 
AGCAGGAGAG 



AAAGCAAGTA 
AAAGCAAGTA 
AAAGCAAGTA 
AAAGCAAGTA 
AAAGCAAGTA 
AAAGCAAGTA 
AAAGCAAGTA 
AAAGCAAGTA 



CACTTGGATT 

CACTTGGATT 

CACTTGGATT 

CACTTGGATT 

CACTTGGATT 

CACTTGGATT 

CACTTGGATT 

CACTTGGATT 
********** 



TTGTTCGTCG 
TTGTTCGTCG 
TTGTTCGTCG 
TTGTTCGTCG 
TTGTTCGTCG 
TTGTTCGTCG 
TTGTTCGTCG 
TTGTTCGTCG 



TTCCAGAGCT 
TTCCAGAGCT 
TTCCAGAGCT 
TTCCAGAGCT 
TTCCAGAGCT 
TTCCAGAGCT 
TTCCAGAGCT 
TTCCAGAGCT 



TAACACAGCA 
TAACACAGCA 
TAACACAGCA 
TAACACAGCA 
TAACACAGCA 
TAACACAGCA 
TAACACAGCA 
TAACACAGCA 



GTTGCTAAAC 
GTTGCTAAAC 
GTTGCTAAAC 
GTTGCTAAAC 
GTTGCTAAAC 
GTTGCTAAAC 
GTTGCTAAAC 
GTTGCTAAAC 



2400 
GTGATTTGAC 
GTGATTTGAC 
GTGATTTGAC 
GTGATTTGAC 
GTGATTTGAC 
GTGATTTGAC 
GTGATTTGAC 
GTGATTTGAC 



********** ********** ********** ********** 



msa235280 .2{l95__COHl} 
msa235280.2f 195__M73 2) 
msa235280 .2{l95_M78l] 
msa235280.2{l95_H36B: 
msa235280.2{l95_JM9130013 • 
msa235280 . 2 { 195_18RS2 1 
msa235280 ,2{195_2603 



2401 

TTCTGATACT 
TTCTGATACT 
TTCTGATACT 
TTCTGATACT 
TTCTGATACT 
TTCTGATACT 
TTCTGATACT 



GCTCTTATCC 
GCTCTTATCC 
GCTCTTATCC 
GCTCTTATCC 
GCTCTTATCC 
GCTCTTATCC 
GCTCTTATCC 



ACATCGTTGC 
ACATCGTTGC 
ACATCGTTGC 
ACATCGTTGC 
ACATCGTTGC 
ACATCGTTGC 
ACATCGTTGC 



CAAAGATGAC 
CAAAGATGAC 
CAAAGATGAC 
CAAAGATGAC 
CAAAGATGAC 
CAAAGATGAC 
CAAAGATGAC 



2450 
TCTCTAAAAC 
TCTCTAAAAC 
TCTCTAAAAC 
TCTCTAAAAC 
TCTCTAAAAC 
TCTCTAAAAC 
TCTCTAAAAC 



851 



WO 2004/018646 



PCT/US2003/026827 



Table 51: Comparative Sequences relating to SAG0677 



msa235280 . 2 { 195_A909} 

Consensus 



TTCTGATACT 
********** 



GCTCTTATCC 
********** 



ACATCGTTGC 
********** 



CAAAGATGAC 
********** 



TCTCTAAAAC 
********** 



msa2352B0 .2{l95_COHl) 
msa235280.2{l95_M732) 
msa235280 .2{195_M781} 
msa235280 .2{195_H36B) 
msa235280 . 2 { 195_JM9130 013 } 
msa235280 . 2 { 195_18RS21 } 
msa235280 .2{195_2603} 
msa235280 . 2 { 195_A909 } 

Consensus 



2451 

TAAAATTATA 

TAAAATTATA 

TAAAATTATA 

TAAAATTATA 

TAAAATTATA 

TAAAATTATA 

TAAAATTATA 

TAAAATTATA 
********** 



TCAAGATGAT 
TCAAGATGAT 
TCAAGATGAT 
TCAAGATGAT 
TCAAGATGAT 
TCAAGATGAT 
TCAAGATGAT 
TCAAGATGAT 



TCATTACTTG 
TCATTACTTG 
TCATTACTTG 
TCATTACTTG 
TCATTACTTG 
TCATTACTTG 
TCATTACTTG 
TCATTACTTG 



AATCTGTTGA 
AATCTGTTGA 
AATCTGTTGA 
AATCTGTTGA 
AATCTGTTGA 
AATCTGTTGA 
AATCTGTTGA 
AATCTGTTGA 



2500 
TAAAACCGGT 
TAAAACCGGT 
TAAAACCGGT 
TAAAACCGGT 
TAAAACCGGT 
TAAAACCGGT 
TAAAACCGGT 
TAAAACCGGT 



********** ********** ********** ********** 



msa235280 
msa235280 
msa235280 
msa2352 8 0 
msa235280.2{l95 
msa235280 ,2{ 
msa235280 . 
rasa235280 . 



2fl95__COHl} 
2{195_M732} 
2{195_M781} 
2{195_H36B} 
_JM9130013} 
195_18RS2l} 
2 { 195_2603} 
2{195_A909} 
Consensus 



msa235280 . 2(195_C0H1) 
msa235280 .2f 195_M732' 
msa235280 . 2 { 195_M781 
Tnsa235280.2{l95_H36B} 
msa235280.2{l95_JM9130013} 
msa235280 . 2 { 195_18RS21> 
msa235280.2{l95_2603) 
msa235280 -2{195__A90 9} 

Consensus 



2501 

CTTTATAGTT 
CTTTATAGTT 
CTTTATAGTT 
CTTTATAGTT 
CTTTATAGTT 
CTTTATAGTT 
CTTTATAGTT 
CTTTATAGTT 



TTAGAAATGG 
TTAGAAATGG 
TTAGAAATGG 
TTAGAAATGG 
TTAGAAATGG 
TTAGAAATGG 
TTAGAAATGG 
TTAGAAATGG 



********** ********** 



2551 

ACTAGAATTT 
ACTAGAATTT 
ACTAGAATTT 
ACTAGAATTT 
ACTAGAATTT 
ACTAGAATTT 
ACTAGAATTT 

ACTAGAATTT 
********** 



GGAGATAATA 

GGAGATAATA 

GGAGATAATA 

GGAGATAATA 

GGAGATAATA 

GGAGATAATA 

GGAGATAATA 

GGAGATAATA 
********** 



TGTAGAAATC 

TGTAGAAATC 

TGTAGAAATC 

TGTAGAAATC 

TGTAGAAATC 

TGTAGAAATC 

TGTAGAAATC 

TGTAGAAATC 
********** 



TTAtTAAGTT 
TTAtTAAGTT 
TTAtTAAGTT 
TTAcTAAGTT 
TTAtTAAGTT 
TTAtTAAGTT 
TTAtTAAGTT 

TTAtTAAGTT 
***_****** 



ACTAAAGATA 

ACTAAAGATA 

ACTAAAGATA 

ACTAAAGATA 

ACTAAAGATA 

ACTAAAGATA 

ACTAAAGATA 

ACTAAAGATA 
********** 



ATCTGCTGTT 

ATCTGCTGTT 

ATCTGCTGTT 

ATCTGCTGTT 

ATCTGCTGTT 

ATCTGCTGTT 

ATCTGCTGTT 

ATCTGCTGTT 
********** 



2550 
TGACAGTACC 
TGACAGTACC 
TGACAGTACC 
TGACAGTACC 
TGACAGTACC 
TGACAGTACC 
TGACAGTACC 
TGACAGTACC 
********** 

2600 
GACTTATCAA 
GACTTATCAA 
GACTTATCAA 
GACTTATCAA 
GACTTATCAA 
GACTTATCAA 
GACTTATCAA 

GACTTATCAA 
********** 



msa235280.2{l95_COHl; 
msa235280 . 2 ( 195__M732 
msa235280.2{l95_M781 
msa235280.2{l95_H36B^ 
msa23 5280.2{l95_JM9130013 
msa235280 . 2 { 195_18RS21 
msa235280 . 2 { 195_2603 
msa235280.2{l95_A909; 

Consensus 



2601 

ATTATCGTCG 
ATTATCGTCG 
ATTATCGTCG 
ATTATCGTCG 
ATTATCGTCG 
ATTATCGTCG 
ATTATCGTCG 
ATTATCGTCG 



TAATGAGACC 
TAATGAGACC 
TAATGAGACC 
TAATGAGACC 
TAATGAGACC 
TAATGAGACC 
TAATGAGACC 
TAATGAGACC 



CTTCATATCT 
CTTCATATCT 
CTTCATATCT 
CTTCATATCT 
CTTCATATCT 
CTTCATATCT 
CTTCATATCT 
CTTCATATCT 



ATAGAAACCG 
ATAGAAACCG 
ATAGAAACCG 
ATAGAAACCG 
ATAGAAACCG 
ATAGAAACCG 
ATAGAAACCG 
ATAGAAACCG 



2650 
TTTTGATGTT 
TTTTGATGTT 
TTTTGATGTT 
TTTTGATGTT 
TTTTGATGTT 
TTTTGATGTT 
TTTTGATGTT 
TTTTGATGTT 



********** ********** ********** ********** ********** 



msa235280.2{l95_COHi; 
msa235280.2{l95_M73 2; 
msa235280.2{l95_M781 
msa235280.2{l95_H36B] 
msa235280 . 2 { 195_JM9130013 } 
msa235280 . 2 { 195_18RS2 V 
msa235280.2fl95_2 603 
msa235280.2{l95_A909| 

Consensus 



2651 

AAAGCAAGCC 
AAAGCAAGCC 
AAAGCAAGCC 
AAAGCAAGCC 
AAAGCAAGCC 
AAAGCAAGCC 
AAAGCAAGCC 
AAAGCAAGCC 



AAATGACAGC 
AAATGACAGC 
AAATGACAGC 
AAATGACAGC 
AAATGACAGC 
AAATGACAGC 
AAATGACAGC 
AAATGACAGC 



TGACAAAGGA 
TGACAAAGGA 
TGACAAAGGA 
TGACAAAGGA 
TGACAAAGGA 
TGACAAAGGA 
TGACAAAGGA 
TGACAAAGGA 



GCTAAAGTAA 
GCTAAAGTAA 
GCTAAAGTAA 
GCTAAAGTAA 
GCTAAAGTAA 
GCTAAAGTAA 
GCTAAAGTAA 
GCTAAAGTAA 



2700 
CTGTGGATAT 
CTGTGGATAT 
CTGTGGATAT 
CTGTGGATAT 
CTGTGGATAT 
CTGTGGATAT 
CTGTGGATAT 
CTGTGGATAT 



********** ********** ********** ********** ********** 



msa235280 
msa235280 
msa235280 
msa235280 
msa235280 .2 {19 
msa235280 .2 
msa235280 
msa235280 



.2{l95_COHl 
.2{195_M732 
.2{195_M781 
.2{195_H36B; 
5_JM9130013; 
{195_18RS21 
.2{195_2603 
.2{195_A909; 
Consensus 



2701 

GTTGATGAAG 
GTTGATGAAG 
GTTGATGAAG 
GTTGATGAAG 
GTTGATGAAG 
GTTGATGAAG 
GTTGATGAAG 
GTTGATGAAG 



CACTTAGTTG 
CACTTAGTTG 
CACTTAGTTG 
CACTTAGTTG 
CACTTAGTTG 
CACTTAGTTG 
CACTTAGTTG 
CACTTAGTTG 



TTCCAGAAAT 
TTCCAGAAAT 
TTCCAGAAAT 
TTCCAGAAAT 
TTCCAGAAAT 
TTCCAGAAAT 
TTCCAGAAAT 
TTCCAGAAAT 



GGCAGGAGCT 
GGCAGGAGCT 
GGCAGGAGCT 
GGCAGGAGCT 
GGCAGGAGCT 
GGCAGGAGCT 
GGCAGGAGCT 
GGCAGGAGCT 



2750 
TATACATTAA 
TATACATTAA 
TATACATTAA 
TATACATTAA 
TATACATTAA 
TATACATTAA 
TATACATTAA 
TATACATTAA 



********** ********** ********** ********** ********** 



msa23 52 8 0 . 2 { 195_COHl } 
msa235280.2{l95_M732} 
msa235280.2f 195_M78l} 
msa235280.2{195_H36B[ 
msa235280 . 2 { 195_JM9130013 
msa235280.2{l95_18RS21 
msa235280.2{l95_2603; 
msa235280.2{l95_A909} 

Consensus 



msa235280.2{l95_COHl} 
msa235280.2{l95_M732} 



2751 

CAATCGACGA 
CAATCGACGA 
CAATCGACGA 
CAATCGACGA 
CAATCGACGA 
CAATCGACGA 
CAATCGACGA 
CAATCGACGA 



AGcTCCAAAC 
AGcTCCAAAC 
AGcTCCAAAC 
AGcTCCAAAC 
AGcTCCAAAC 
AGcTCCAAAC 
AGcTCCAAAC 
AGaTCCAAAC 



ACAAATGAAT 
ACAAATGAAT 
ACAAATGAAT 
ACAAATGAAT 
ACAAATGAAT 
ACAAATGAAT 
ACAAATGAAT 
ACAAATGAAT 



CAGGAATGTT 
CAGGAATGTT 
CAGGAATGTT 
CAGGAATGTT 
CAGGAATGTT 
CAGGAATGTT 
CAGGAATGTT 
CAGGAATGTT 



********** **_******* ********** ********** 



2800 

AACAAACGCT 

AACAAACGCT 

AACAAACGCT 

AACAAACGCT 

AACAAACGCT 

AACAAACGCT 

AACAAACGCT 

AACAAACGCT 
********** 



2801 2850 
AAAGTATCGA TTCATTATGT AAATGGTGGT GTTGATAAAG TTGATGTTCC 
AAAGTATCGA TTCATTATGT AAATGGTGGT GTTGATAAAG TTGATGTTCC 



852 



WO 2004/018646 



PCT/US2003/026827 



Table 51: Comparative Sequences relating to SAG0677 



{l95_M78l} 
{195 1 



msa235280 . 2 
msa235280 . 2^195_H36B 
msa235280 .2{l95_JM9130013; 
msa235280 . 2 { 195_18RS21 
msa235280 .2f 195_2603* 
msa235280 . 2{195_A909 

Consensus 



msa23528 0 . 2 { 195_COHl } 
msa235280 .2{195_M732} 
msa235280 . 2 { 195_M78l} 
msa235280 . 2{195_H36B} 
msa235280.2{l95_JM9130013} 
msa235280 . 2 { 195_18RS21 } 
msa235280 . 2 f 195_2603 } 
msa2352B0.2{195_A909} 

Consensus 



AAAGTATCGA 
AAAGTATCGA 
AAAGTATCGA 
AAAGTATCGA 
AAAGTATCGA 
AAAGTATCGA 



TTCATTATGT 
TTCATTATGT 
TTCATTATGT 
TTCATTATGT 
TTCATTATGT 
TTCATTATGT 



AAATGGTGGT 
AAATGGTGGT 
AAATGGTGGT 
AAATGGTGGT 
AAATGGTGGT 
AAATGGTGGT 



GTTGATAAAG 
GTTGATAAAG 
GTTGATAAAG 
GTTGATAAAG 
GTTGATAAAG 
GTTGATAAAG 



TTGATGTTCC 
TTGATGTTCC 
TTGATGTTCC 
TTGATGTTCC 
TTGATGTTCC 
TTGATGTTCC 



********** ********** ********** ********** ********** 



2851 

GATTAAAGTA 

GATTAAAGTA 

GATTAAAGTA 

GATTAAAGTA 

GATTAAAGTA 

GATTAAAGTA 

GATTAAAGTA 

GATTAAAGTA 
********** 



GTTGACTTAG 

GTTGACTTAG 

GTTGACTTAG 

GTTGACTTAG 

GTTGACTTAG 

GTTGACTTAG 

GTTGACTTAG 

GTTGACTTAG 
********** 



2900 

AAGCTATTcg taaagctgaa gaagcacata 
AAGCTATTcg taaagctgaa gaagcacata 
AAGCTATTcg taaagctgaa gaagcacata 
AAGCTATTcg taaagctgaa gaagcacata 
AAGCTATTcg taaagctgaa gaagcacata 

AAGCTATT 

AAGCTATT 

AAGCTATTcg taaagctgaa gaagcacata 
******** 



2901 

msa2 35280 .2{ 19 5_COHl| aagctgacga agcacgtaaa gctgaagaag caCGTAAAGC 

msa235280.2{l95_M732} aagctgacga agcacgtaaa gctgaagaag caCGTAAAGC 

msa235280 .2{l95 — M781} aagctgacga agcacgtaaa gctgaagaag caCGTAAAGC 

msa235280 . 2{l95_H36B) aagctgacga agcacgtaaa gctgaagaag caCGTAAAGC 

rasa235280.2{l95_JM9130013) aagctgacga agcacgtaaa gctgaagaag caCGTAAAGC 

msa235280.2{l95_18RS2l} CGTAAAGC 

msa235280 .2{l95_2603} . .* cgtaaa gctgaagaag caCGTAAAGC 

msa23528p -2{195_A909} aagctgacga agcacgtaaa gctgaagaag caCGTAAAGC 

Consensus __******** 



2950 

TGAaGAAGCA 

TGAaGAAGCA 

TGAaGAAGCA 

TGAcGAAGCA 

TGAaGAAGCA 

TGAaGAAGCA 

TGAaGAAGCA 

TGAaGAAGCA 
***_****** 



msa235280.2(l95_COHl) 
msa235280 . 2{195__M732} 
msa235280.2{l95_M78l} 
msa235280 .2{195_H36B} 
msa235280 . 2 { 195_JM9 130013 } 
msa235280 . 2 { 195_18RS2i; 
msa235280.2{l95_2603 
msa235280 .2{l95_A909* 

Consensus 



msa235280 .2{195_C0H1 
msa235280 . 2 { 195_M732 
msa235280 .2{195_M781 
ms a2 3 52 8 0 . 2 { 1 9 5_H3 6B 
msa235280.2{l95_JM9130013 
msa235280.2{l95_18RS21 
msa235280.2{!95 2603 
msa235280.2{l95_A909 

Consensus 



msa235280 . 2 { 195_COHl ']■ 
msa235280 . 2 ( 195_M732 
msa235280 .2{195_M781 
msa235280 . 2 { 195_H36B 
msa235280 . 2 { 195_JM9130013 
msa235280 . 2 { 195_18RS21 
msa235280.2{l95_2603 
msa235280 .2{195_A909 

Consensus 



msa235280 . 2{l95_C0Hl] 
msa235280 . 2 { 195_M732 
msa235280.2{195_M781 
msa235280.2{l95_H36B 
msa235280 . 2{ 195_JM9130013 ]■ 
msa235280.2{l95_18RS21/ 
msa235280 .2{195_2603 
msa235280 . 2 { 195_A909 ; 

Consensus 



2951 

CaTAAAGCTG 

CaTAAAGCTG 

CaTAAAGCTG 

CaTAAAGCTG 

CaTAAAGCTG 

CgTAAAGCTG 

CgTAAAGCTG 

CgTAAAGCTG 
********** 

3001 

agca . CGTAA 

agca . CGTAA 

agcacCGTAA 

agca . CGTAA 

agcacCGTAA 

CGTAA 

CGTAA 

CGTAA 

***** 

3051 

GAAGAAGGCT 

GAAGAAGGCT 

GAAGAAGGCT 

GAAGAAGGCT 

GAAGAAGGCT 

GAAGAAGGCT 

GAAGAAGGCT 

GAAGAAGGCT 
********** 

3101 

AGCGTCTGAT 
AGCGTCTGAT 
AGCGTCTGAT 
AGCGTCTGAT 
AGCGTCTGAT 
AGCGTCTGAT 
AGCGTCTGAT 
AGCGTCTGAT 



3000 

AAGAAGtAcg taaagctgaa gaagcacata aagtcgaaga 

AAGAAGtAcg taaagctgaa gaagcacata aagtcgaaga 

AAGAAGtAcg taaagctgaa gaagcacata aagtcgaaga 

AAGAAGtAcg taaagctgaa gaagcacata aagtcgaaga 

AAGAAGtAcg taaagctgaa gaagcacata aagtcgaaga 

AAGAAGcA 

AAGAAGcA 

AAGAAGcA 

******_*__ — _ — _______ — — ______ 



AGCTGAAGAG 

AGCTGAAGAG 

AGCTGAAGAG 

AGCTGAAGAG 

AGCTGAAGAG 

AGCTGAAGAG 

AGCTGAAGAG 

AGCTGAAGAG 
********** 



ACAAaGTTAA 
ACAAaGTTAA 
ACAAaGTTAA 
ACAAgGTTAA 
ACAAgGTTAA 
ACAAgGTTAA 
ACAAgGTTAA 
ACAAgGTTAA 
****_***** 



TTACCAAAGA 
TTACCAAAGA 
TTACCAAAGA 
TTACCAAAGA 
TTACCAAAGA 
TTACCAAAGA 
TTACCAAAGA 
TTACCAAAGA 
********** ********** 



GGACATAAAA 

GGACATAAAA 

GGACATAAAA 

GGACATAAAA 

GGACATAAAA 

GGACATAAAA 

GGACATAAAA 

GGACATAAAA 
********** 



TAACGTT CAT 

TAACGTT CAT 

TAACGTT CAT 

TAACGTT CAT 

TAACGTT CAT 

TAACGTT CAT 

TAACGTTCAT 

TAACGTTCAT 
********** 



CTAAGACAGT 

CTAAGACAGT 

CTAAGACAGT 

CTAAGACAGT 

CTAAGACAGT 

CTAAGACAGT 

CTAAGACAGT 

CTAAGACAGT 
********** 



3O50 

ACCTATAGTT 

ACCTATAGTT 

ACCTATAGTT 

ACCTATAGTT 

ACCTATAGTT 

ACCTATAGTT 

ACCTATAGTT 

ACCTATAGTT 
********** 

3100 
CTACAGTTAA 
CTACAGTTAA 
CTACAGTTAA 
CTACAGTTAA 
CTACAGTTAA 
CTACAGTTAA 
CTACAGTTAA 

CTACAGTTAA 
********** ********** 



CCCAAGAAGC 

CCCAAGAAGC 

CCCAAGAAGC 

CCCAAGAAGC 

CCCAAGAAGC 

CCCAAGAAGC 

CCCAAGAAGC 

CCCAAGAAGC 
********** 



CAAACTGATA 
CAAACTGATA 
CAAACTGATA 
CAAACTGATA 
CAAACTGATA 
CAAACTGATA 
CAAACTGATA 
CAAACTGATA 



TTCCGCAGTT 
TTCCGCAGTT 
TTCCGCAGTT 
TTCCGCAGTT 
TTCCGCAGTT 
TTCCGCAGTT 
TTCCGCAGTT 
TTCCGCAGTT 
********** 



3150 

CATATGGCTA 

CATATGGCTA 

CATATGGCTA 

CATATGGCTA 

CATATGGCTA 

CATATGGCTA 

CATATGGCTA 

CATATGGCTA 
********** 



msa235280 
msa235280 
msa235280 
msa235280 
msa235280 .2{19 
tnsa235280 .2 
msa235280 
msa23528 0 



.2{195_C0H1} 
.2{195_M732} 
.2f 195_M781; 
.2{195_H36B; 
5 JM9130013 
{195_18RS21 
.2{195_2603 
.2{195_A909 



3151 

GAACAGACAA 
GAACAGACAA 
GAACAGACAA 
GAACAGACAA 
GAACAGACAA 
GAACAGACAA 
GAACAGACAA 
GAACAGACAA 



TAAACAGATA 
TAAACAGATA 
TAAACAGATA 
TAAACAGATA 
TAAACAGATA 
TAAACAGATA 
TAAACAGATA 
TAAACAGATA 



ACTTCACATC 
ACTTCACATC 
ACTTCACATC 
ACTTCACATC 
ACTTCACATC 
ACTTCACATC 
ACTTCACATC 
ACTTCACATC 



3200 

AGACACATG t 

AGACACATGt TGAAAA 

AGACACATG t TG 

AGACACATG- 

AGACACATGt TG ~ 

AGACACATGt TGAA 

AGACACATGt TGAAAAACAA 
AGACACATGt TGAAAAACAA 



853 



WO 2004/018646 



PCT/US2003/026827 



Table 51: Comparative Sequences relating to SAG0677 



Consensus 



********** ********** ********** *********- 



msa235280 .2{195_C0H1; 
msa235280 .2{195_M732; 
msa235280 ,2{195_M781' 
msa235280 .2{l95_H36B} 
msa235280.2{l95_JM9130013 
msa235280 .2 { 195_18RS21 
msa235280 . 2 { 195_2603 
msa235280 .2{l95_A909; 

Consensus 



3201 



3250 



ATTAAAAATA cattgccatc cactggtgac agcaaacgtg gttattatat 

ATTAAAAATA 

********** ********** ********** ********** ********** 



msa235280 .2{l95_COHl} 
msa23S280 . 2 { 195_M732 } 
msa235280 .2{195_M78l} 
msa235280 .2{195_H36B} 
msa23 5280 . 2 { 195_JM913 0013} 
msa235280 . 2 { 195_18RS21 ) 
msa2352B0 .2{l95_2603j 
msa235280 .2{195_A909} 

Consensus 



3251 



3300 



cactggaatg gctatcgtta tgctgagtgt attatttagt ttagctaaaa 
********** ********** ********** ********** ********** 



msa235280.2{l95_COHl} 
msa235280 . 2 f 195_M732 ■ 
msa235280 .2(195_M781; 
msa235280 . 2 { 195_H36B 
msa235280 . 2 { 195_JM9130013 
msa2352*80 .2{l95_18RS2i; 
msa235280.2{l95_2603 
msa235280.2{l95_A909 
Consensus 



3301 



3317 



agtttaaaag caaatat 



********** ******* 



SEQ ID NO. 5110 
STRAIN 2603 frame: 1 

LNNKG VGGDGVQ I YQYY I KMDNNKP YL S P KD KTTVE KLEDRWKKI TFKVQDTGI GLKDVY 
LQSVKYVGGGNNNLDLITPPGFKKEDKK^KPKLDRPPGIDLPAPTSMRSFDYSTPPGTK 
PSKPKDSLSTPPGFPDLNTPPDEAPKDSKKDAIEDKSGAIKYAKSLQLSFVIX3PILASKV 
NGKI LQVESDGKLVI PRNALSANQFDDTS LKI YRNNNRNKEI T I TTD YFADTKYVNI TAV 
DYLSNTTFEQLATGETVD YHAI VFS SFAAI KDK1GGKI YVNDKXQETS RI ALKDKS VKI G I 
ELPNDVRHIDSLSVRRIjNEVKTVDNILKNDEQDINLSKTYQLKY^ 

SSEI>n^FKDGKMPELVEQKDVSLDI^MDMSKFKTIRI^RKDSEFKGQLIAKTGTVELD 
MFFKQSQDPAS 1 1 KKI YL I QNGVPNELKKFDS S FGLTESQI DGYYIYKDAINLKFKLTSG 
ASLKVVYKGQEDPYSHQKEDMTKKGEQLSHSTQANE^ 

WKGSELPLTKGWTTFVLHKTENSLNVKSLIMETGSVSKKVQQLPLSPRLSKNKHMRDML 
LTMQKD S AYYET SD SLVLR I NLTADTKLNFNAWGAS ALT ENMMMRQ FAVAGPQDD P VS E 
HKYPSVFLLTPALLETASEATLNGKE I TASGI IGHI KLX3DKSKHVEVKMVNENGDMLGTP 
VI I QGKDLTNRTKPLMSGRRVL YAGKQYE FRAKLPLSRFNTW I RVEWTEAGEKAS I VRR 
MFFDQSVPELNTAVAKRDLTSDTAL IH I VAKDDSLKLKL YQDDSLLES VDKTGLYS FRNG 
VE I TKDMTVPLE FGDNI I KLSAVDLSNYRRNETLHI YRNRFDVKASQMTADKGAKVTVDM 
LMKHLVVPEMAGAYTLTI DEAPNTNESGMLTNAKVS I HYVNGGVDKVDVPI KWDLEAI R 
KAEEARKAEEARKAEEARKAEEGHICrQEAP IVEEGYKVNNVHQTDTTVKASDLPKTKTVS 
AVHMARTDNKQITSHO/^HVEKQIKNTLPSTGDSKRGYYITG^^IVMLSVLFSLAKKFKSK 
Y 



SEQ ID NO. 5111 

STRAIN A909 frame: 1 

LNNKG VGGDGVQ I YQYY I KMDNNKP YL S P KDKTTVE KLEDRW KKI T F KVQDTG I GL KD VY 
LQSVKYVGGGNNNLDLITPPGFKKEDKKVEKPKLDRPPGIDLPPPTSMRSFDYSTPPGTK 
PSKPKDSLSTPPGFPDLNTPPDEALKDSKKDAIEDKSGAIKYAKSLQLSFVDDPILASKV 
NGKI LQVESDGKLVI PRNALSANQFDDTSLKI YRNNNRNKE I T I TTD YFADTKYVNI TAV 
DYLSNTTFEQLATGETVDYHAIVFSSFAAIKDKGGKIYVNDKLQETSRIALKDKSVKIGI 
ELPND VRH I DSLS VRRLNE VKTVDN I LKNDEQD I NLS KTYQLKYNPTNRRLEFT I NN INS 
SSEIMTTFKIX3KMPELVEQKDVSLDINDMDMSKFKTIRLGRKDSE^ 
MFFKQSQDPAS I IKKIYLIQNGVPNELKKFDSSFGLTESQIDGYYIYKDAINLKFKLTSG 
ASLKW!fKGQEDPYSHQKEDMTKKGEQLSHSTQANENTAK\T?FANIDWSHYSKVTVNGKE 
VGKGSELPLTKGWTTFVLJiKTENSLNVKSLIMETGSVSKKVQQLPLSPRLSKNKHMRDML 
LTMQ KDS AYY ET SD S LVLR I NLTADT KLN FNAVKGAS ALTENMMMRQ FAVAG P QDD P VS E 
HKYPSVFLLTPALLETASEATLNGKE I TASGI IGHI KDGDKS KHVEVKMVNENGDMLGTP 
VI I Q/3KDLTNRTKPLMSGRRVLYAGKQ YEFRAKLPLS RFNTW I RVEWTEAGEKAS I VRR 
MFFDQSVPELNTAVAKRDLTSDTAL I H I VAKDDSLKLKL YQDDSLLE SVDKTGLYSFRNG 
VE I TKDMTVPLE FGDNI I KLSAVDLSNYRRNETLHI YRNRFDVKASQMTADKGAKVTVDM 
LMKHLWPEMAGAYTLTIDEDPNTNESGMLTNAKVS I HYVNGGVDKVDVP I KWDLEAI R 
KAEEAHKADEARKAEEARKAEEARKAEEARKAEEGHKTQEAP I VEEGYKVNNVHQTDTTV 
KASDLPKTKTVSAVHMARTDNKQ I TSHQTHVE KQ I KN 

SEQ ID NO. 5112 

STRAIN H36B frame: 2 

GVQIYQYYIKMD^WKPYLSPKD KTTVE KLEDRWKKITFKVQDTGIGLKDVYLQSVKYVGG 
GNNNLDLITPPGFKKEDKKVEKPKLDRPPGIDLPAPTSMRSFDYSTPPGTKPSKPKDSLS 
TPPGFPD1OTPPDEALKDSKKDAIEDKSGAIKYAKSLQLSFVDDPILASKVNGKILQVES 



854 



WO 2004/018646 



Table 51: Comparative Sequences relating to SAG0677 



DGKLVI PRNALSANQFDDTSLKI YRNNNRNKE IT ITTDYFADTKYVN I TAVDYLSNTTFE 
QLATGETVDYHAI VFSS FAAI KTDKGGKI YVNDKLQETSRI ALKDKS VKI G I ELPNDVRH I 
DSLS VRRLNEVKTVDNI LKNDEQD INLSKTYQLKYNPTNRRLEFT INNI NSSSE I MTTFK 
DGKMPELVEQKDVSLDINDMDMSKFKT I RLGRKDSEFKGQLI AKTGTVELDMFFKQSQDP 
ASIIKKIYLIQNGVPNELKKFDSSFGLTESQIIXSYYIYKDAI^njKFKLTSGASLKVVYKG 
QEDPYSHQKEDMTKKGEQLSHSTQANENTAKVTFANIDWSHYSKVTVNGKEVGKGSELPL 
TKGWTTFVLHKTENS LNVKS L IMETGS VS KKVQQLPLS PRLS KNKHMRDMLLTMQKDS AY 
YETSDSLVLRINLTADTKLNFNAVKGASALTENMMMRQFAVAGPQDDPVSEHKYPSVFLL 
TPALLETAS EATLNGKE I TASG IIGHI KDGDKSKHVEVKMVNENGDMLGTPVI I QGKDLT 
NRTKPLMSGRRVLYAGKQYEFRAKLPLSRFNTWI RVEWTEAGEKAS IVRRMFFDQSVPE 
LNTAVAKRDLTS DTAL I H I VAKDDSLKLKLYQDD SLLES VDKTGL YS FRNGVE I TKDMT V 
PIlEFGDNITKLSAVDLSNYRRNETLHIYRNRFDVKASQMTADKGAKVlTO^lmKHLVVPE 
MAGAYTLT I DEAPNTNE SGMLTNAKVS I H YVNGG VDKVD VP I KWDLEAI RKAEEAHKAD 
FJVRKAEEARKADEAHKAEEVRKAEEAHKVEEARKAEEGHKTQEAP I VEEGYKVNNVHQTD 
TTVKASDLPKTKTVSAVHMARTDNKQITSHQTH 

SEQ XD NO. 5113 

STRAIN 18RS21 frame: 1 

LNNKG VGGDG VQ I YQ YY I KMDNNKP YLS PKDKTTVEKLEDRWKKI TFKVQDTG I GLKDVY 
LQSVKYVGGGNNNLDLITPPGFKKEDKKVEKPKLDRPPGIDLPAPTSMRSFDYSTPPGTK 
PSKPKDSLSTPPGFPDLNTPPDEAPKDSKKDAI EDKSGAI KYAKSLQLSFVDDPI LASKV • 
NGKI LQVESDGKLVI PRNALSANQFDDTSLKI YRNNNRNKE I T I TTDYFADTKYVNITAV 
DYLSNTTFEQLATGETVDYHAIVFSSFAAI KDKGGKI YVNDKLQETSRI ALKDKS VKIGI 
ELPNDVRHI DSLS VRRLNEVKTVDNI LKNDEQD INLSKTYQLKYNPTNRRLEFT INN 
SSEIMTTFKDGKMPELVEQKDVSLDINDMDMSKFKTIRLGRKDSEFKGQLIAKTGTVELD 
MFFKQSQDPASIIKKIYLIQNGVPNELKKFDSSFGLTESQIDGYYIYKDAINLKFKLTSG 
ASLKVWKGQEDPYSHQKED>TrKKGEQLSHSTQANENTAKVTFANIDWSHYSKVTVNGKE 
VVKGSELPLTKGWTTFVLHKTENSIJWKSLIM 

LTMQ KD S AYYET SD S LVLR I NLTADTKLNFNAVKGAS ALTENMMM RQ FAVAG PQDDP VS E 
HKYPS VFLLTPALLETAS EATLNGKE I TASG I IGHI KDGDKSKHVEVKMVNENGDMLGTP 
VI IQX3KDLTNRTKPLMSGRRVLYAGKQYEFRAKLPLSRFN™ 

MFFDQSVPELNTAVAKRDLTSDTALIHIVAKDDSLKLKLYQDDSLLESVDKTGLYSFRNG 
VEITKDMTVPLEFGDNI I KLSAVDLSNYRRNETLHI YRNRFDVKASQ^ADKGAKVTVDM 
LMKHLWPEMAGAYTLT I DEAPNTNESGMLTNAKVS I HYVNGG VDKVD VP I KWDLEAI R 
KAEEARKAEEARKAEEGHKTQEAP I VEEGYKVNNVHQTDTTVKASDLPKTKT VS AVHMAR 
TDNKQITSHQTHVE 

SEQ XD NO. 5114 

STRAIN M732 frame: 1 

LNNKG VGGDG VQ I YQYY I KMDNNKP YLS PKDKTTVEKLEDRWKKI TFKVQDTG I GLKDVY 
LQS VKYVGGGNNNLDLI TP PGFKKEDKKVE KPKLDRPPG I DLPAPTSMRS FD YST PPGTK 
PSKPKDSLSTPPGFPDLNTPPDEATKG . . KRRY . R . IRSN . I C . VSST . LC . . PYFS . QS 
KWQNITSRI .WQISHS.KCFVS.SI . .H.S.NLS. . . SQ. RNYYHNRLFCRYKICQYHSG 
. LFEQYYF . AISYW . NSRLPCHCI FKLCCY . RQGW . DLC . R . IARNFSYSA . R . I C . DWY 
.ITK.CQTY. .FICSSFE.G.NC. .YLEK. . TRH . SQQNLPI KIQPDKSS3RVYY . .H.L 
KFRNHDHFQRWKDARIG . TKRCFFGYKRYGHE . V . NYSTWTKGF . I . GTTYCKNWNS . IR 
YVFQTISRPSFNY . KNIPYPKWCSK. IEKI .L . FWFN . KSDRWILYL . RCN .P.I. INQW 
CKS.SCL.RARRSI .SSERRYD.KR.TAQSFNSSQ.KYSKSNLC. Y.LVTL. . GYCEWKR 
SW.R. . VTFN . RMDNICIT . NRKFIKC . KFDYGDG . CK. ESSTTSFKS .II- K-AYEGYA 
TYYAKRFSVLRNK. QSSPSN . SHCRY . T . F . CC . RSECSY . KYDDETVCS CWTTR . SC . . 
T . IPISISLNSCLIGNC . .GNSKW . GNHS I WYYRSHQGW . . KQAC . SQNGE . KWRHARNP 
CYYSR . RLD . SNKTINEWT . STLCR . TI . VPG . ITT . SF . HLD . G . SGNRSRRESKYCSS 
HVL . PISSRA .HSSC .T . FDF . YCSYPHRCQR . LSKTKI ISR . FIT . IC . . NRSL . F . KW 
CRNH . RYDSTTRI WR . YY . VI CC . LIKLSS . . DPSYL . KPF . C . SKPNDS . QRS . SNCGY 
VDEALSCSRNGRSLYINNRRSSKHK. IRNVNKR . SIDSLCKWWC . . S . CSD . SS . LRSYS 
. S . RST - S . RST . S .RST . S .RST . S . RST . S . RST . SRRST . S . RGT . NPRSTYS . RRL 
QS . . RSSN . YYS . SV . FTKD . DSFRSSYG . NRQ . TDNFTSDTC . K 

SEQ XD NO. 5115 

STRAIN COH1 frames 1 

LNNKGVGGDGVQ I YQYY I KMDNNKP YLSPKDKTTVEKLEDRWKKITFKVQDTGI GLKDVY 
LQSVKYVGGGNNNLDLITPPGFKKEDKKVEKPKLDRPPGIDLPAPTSMRSFDYSTPPGTK 
PSKPKDSLSTPPGFPDLNTPPDEATKG . . KRRY . R . IRSN . I C . VSST -LC. . PYFS . QS 
KWQNITSRI .WQISHS.KCFVS.SI . .H.S .NLS. . . SQ .RNYYHNRLFCRYKICQYHSG 
. LFEQYYF . AI SYW . NSRLPCHCI FKLCCY . RQGW . DLC . R . IARNFSYSA . R . I C . DWY 
.ITK.CQTY. .FICSSFE.G.NC. .YLEK. . TRH . SQQNLPI KIQPDKSSSRVYY . .H.L 
KFRNHDHFQRWKDARIG . TKRCFFGYKRYGHE . V . NYSTWTKGF . I . GTTYCKNWNS . IR 
YVFQTISRPSFNY. KNIPYPKWCSK. IEKI . L . FWFN . KSDRWILYL . RCN . P . I . INQW 
CKS . SCL . RARRS I . SSERRYD . KR . TAQS FNSSQ . KYSKSNLC . Y - LVTL . . GYCEWKR 
SW.R. . VTFN . RMDNICIT . NRKFI KC . KFDYGDG . CK . ESSTTSFKS .U.K. AYEGYA 
TYYAKRFSVLRNK. QSSPSN . SHCRY . T . F . CC . RSECSY . KYDDETVCSCWTTR . SC . . 
T . I PI S I SLNSCLIGNC . . GNSKW . GNHS I WYYRSHQGW . . KQAC . SQNGE . KWRHARNP 
CYYSR . RLD . SNKTINEWT . STLCR . TI . VPG . ITT . SF . HLD . G . SGNRSRRESKYCSS 
HVL . PISSRA . HSSC .T . FDF . YCSYPHRCQR . LSKTKI I SR . FIT . IC . . NRSL . F . KW 
CRNH . RYDSTTRI WR . YY . VI CC . LIKLSS . . DPSYL . KPF . C . SKPNDS . QRS . SNCGY 
VDEALSCSRNGRSLYINNRRSSKHK. IRNVNKR .SIDSLCKWWC . . S . CSD . SS . LRSYS 
. S . RST . S . RST . S . RST . S . RST . S . RST . S . RST . SRRST . S . RGT . NPRSTYS . RRL 
QS . . RSSN . YYS . SV . FTKD . DSFRSSYG . NRQ . TDNFTSDTC 

SEQ XD NO. 5116 

STRAIN M781 frame: 1 

LNNKGVGGDGVQ I YQYY I KMDNNKP YLS PKDKTTV^ 

LQSVKYVGGGNNNLDLITPPGFKKEDKKVEKPKLDRPPGI DLPAPTSMRS FDYSTPPGTK 
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Table 51: Comparative Sequences relating to SAG0677 



PSKPKDSLSTPPGFPDLNTPPDEATKG . . KRRY . R . IRSN . IC . VSST . LC . . PYFS . QS 
KWQNITSRI .WQISHS.KCFVS.SI . .H.S.NLS. . . SQ . RNYYHNRLFCRYKI CQYHSG 
. LFEQYYF . AISYW . NSRLPCHCI FKLCCY . RQGW . DLC . R . IARNFSYSA . R . I C . DWY 
.ITK.CQTY. .FICSSFE.G.NC. . YLEK . . TRH . SQQNLPI KIQPDKSS SRVYY . .H.L 
KFRNHDHFQRWKDARIG . TKRCFFGYKRYGHE . V . NYSTWTKGF . I . GTTYCKNWNS . I R 
YVFQTISRPSFNY . KNIPYPKWCSK. IEKI . L . FWFN . KSDRWILYL'. RCN. P . I . INQW 
CKS . SCL . RARRSI . SSERRYD . KR .TAQSFNSSQ . KYSKSNLC . Y . LVTL . . GYCEWKR 
SW . R . . VTFN . RMDNICIT .NRKFIKC . KFDYGDG . CK . ESSTTSFKS .U.K. AYEGYA 
TYYAKRFSVLRNK. QSSPSN . SHCRY . T . F . CC . RSECSY . KYDDETVCSCWTTR . SC . . 
T.IPISISLNSCLIGNC. . GNSKW . GNHSIWYYRSHQGW . . KOAC . SQNGE . KWRHARNP 
CYYSR . RLD . SNKTtNEWT . STLCR.TI . VPG . ITT . SF .HLD . G . SGNRSRRESKYCSS 
HVL . PI SSRA . HSSC . T . FDF . YCS YPHRCQR . LSKTKI I SR . FI T . IC . . NRSL . F . KW 
CRNH . RYDSTTR I WR .YY.VICC. LI KLSS . . DPSYL . KPF . C . SKPNDS . QRS . SNCGY 
VDEALSCSRNGRSLYINNRRSSKHK . IRNVNKR . S IDSLCKWWC . . S . CSD . SS . LRSYS 
. S . RST . S . RST . S . RST . S . RST . S . RST . S . RST . SRRSTVKLKRDI KPKKHL . LKKA 
TKLITFIKLI LQLKRL I YQRLRQFPQF I WLEQT INR. LHI RHML 

SEQ ID NO. 5117 

STRAIN JM9130013 frame: 2 

GVQ I YQYYI KMDNNKPYLS PKDKTTVE KLEDRWKKITFKVQDTG I GLKDVYLQS VKYVGG 
GNNNLDL I TPPG FKKEDKKVEKPKIjDRPPG I DLPAPTSMRS FD YSTPPGTKPSKPKDSLS 
TPPGFPDLNTPPDEAPKDSKKDAI EDKSGAI KYAKSLQLS FVDDP I LAS KVNGKI LQVES 
DGKLVIPRNALSANQFDDTSLKIYRNNNRNKEITI TTDYFADTKYVNITAVDYLSNTTFE 
QLATGETVDYHAI VFSSFAAI KDKGGKI YVNDKLQETSRI ALKDKSVKIGI ELPNDVRHI 
DSLSVRRLNEVKTVDNILKNDEQDINIjSKTYQLKYNPTNRRIiEFT 

IX3KMPELVEQKDVSLDINDMDMSKFKTIRLGRKDSEFKGQLIAKTGTVELDMFFKQSQDP 
AS 1 1 KKI YL I QNGVPNELKKFDSS FGLTESQ I DGYY I YKDAI NLKFKLT S GASLKWYKG 
QEDPYSHQKEDMTKXGEQLSHSTQANENTAKVTFANIDWSHYSKVTVNGKEVGKGSELPL 
TKGWTTFVLHKTENSLNVKSLIMETGSVSKKVQQLPLSPRLSKNK^ 
YETSDSLVLRINLTADTKIaNFNAVKGASALTENMMMRQFAVAGPQDDPVSEHKYPSVFLL 

TPALLETASEATLNGKE ITASGI I GH I KDGDKSKHVEVKMVNENGDMLGTP VI I QGKDLT 
NRTKPLM S GRRVLYAGKQ Y EFRAKLPL SRFNTW I RVEWTEAGE KAS I VRRMFFDQSVPE 
IiNTAVAKRDLTSDTAL I HI VAKDDSLKLKLYQDDSLLiES VDKTGIiYS FRNGVE I TKDMTV 
PLEFGDNI IKLSAVDLSNYRRNETLHI YRNRFDVKASQMTADKGAKVTVDMLMKHLVVPE 
MAGAYTLT I D EAPNTNE S GMLTNAKVS I HYVNGG VDKVD VP I KWDLEA I RKAEEAHKAD 
EARKAEEARKAEEAHKAEEVRKAEEAHKVEEAP . S . RGT . NPRSTYS . RRLCG . . RSSN . 
YYS . SV . FTKD . DSFRSSYG . NRQ . TDNFTSDTC 

PRETTY Of: /biotmp/msa235427 . 2 {* } December 10, 2002 05:18 



msa235427 . 2 { 195_H36B} 
msa235427 . 2 {195_JM9130013 } 
msa235427 . 2 { 195_18RS2l} 
msa235427 .2{l95_2603} 
msa235427 . 2 f 195_A909 J 
msa235427 . 2 { 195_C0H1 } 
msa235427 .2{195_M732} 
msa23 5427 . 2 { 195_M78 1 } 
Consensus 



G 

G 

LNNKGVGGDG 

LNNKGVGGDG 

LNNKGVGGDG 

LNNKGVGGDG 

LNNKGVGGDG 

LNNKGVGGDG 
********** 



VQI YQYYI KM 

VQI YQYYI KM 

VQI YQYYI KM 

VQI YQYYI KM 

VQI YQYYI KM 

VQI YQYYI KM 

VQI YQYYI KM 

VQI YQYYI KM 
********** 



DNNKPYLSPK 

DNNKPYLSPK 

DNNKPYLSPK 

DNNKPYLSPK 

DNNKPYLSPK 

DNNKPYLSPK 

DNNKPYLSPK 

DNNKPYLSPK 
********** 



DKTTVEKLED 

DKTTVEKLED 

DKTTVEKLED 

DKTTVEKLED 

DKTTVEKLED 

DKTTVEKLED 

DKTTVEKLED 

DKTTVEKLED 
********** 



50 

RWKKITFKVQ 

RWKKITFKVQ 

RWKKITFKVQ 

RWKKITFKVQ 

RWKKITFKVQ 

RWKKITFKVQ 

RWKKITFKVQ 

RWKKITFKVQ 
********** 



msa23 5427 . 2 { 195_H36B} 
msa235427 .2{195_JM9130013} 
msa235427 . 2 { 195_18RS2l} 
msa235427.2{l95_2603} 
msa235427 . 2 { 195_A909 } 
' ' msa235427 .2{l95_COHl} 
msa23 5427 .2{195_M732} 
msa23 5427.2{l95__M78l} 
Consensus 



51 

DTG I GLKDVY 
DTG I GLKDVY 
DTG I GLKDVY 
DTGIGLKDVY 
DTGI GLKDVY 
DTGIGLKDVY 
DTGIGLKDVY 

DTGIGLKDVY 
********** 



LQSVKYVGGG 

LQSVKYVGGG 

LQSVKYVGGG 

LQSVKYVGGG 

LQSVKYVGGG 

LQSVKYVGGG 

LQSVKYVGGG 

LQSVKYVGGG 
********** 



NNNLDLITPP 

NNNLDLITPP 

NNNLDLITPP 

NNNLDLITPP 

NNNLDLITPP 

NNNLDLITPP 

NNNLDLITPP 

NNNLDLITPP 
********** 



GFKKEDKKVE 
GFKKEDKKVE 
GFKKEDKKVE 
GFKKEDKKVE 
GFKKEDKKVE 
GFKKEDKKVE 
GFKKEDKKVE 

GFKKEDKKVE 
********** 



100 

KPKLDRPPGI 
KPKLDRPPGI 
KPKLDRPPGI 
KPKLDRPPGI 
KPKLDRPPGI 
KPKLDRPPGI 
KPKLDRPPGI 

KPKLDRPPGI 
********** 



msa235427 . 2 { 195_H36B> 
msa235427.2{l95_JM9130013) 
msa235427.2{l95_18RS2l} 
msa23 5427 . 2 ( 195_2603 } 
msa235427.2{195_A909} 
msa235427 . 2 | 195_C0H1 % 
msa235427.2{l95_M732 
msa235427.2{l95"M781 
Consensus 



msa235427.2{l95_H36B} 
msa235427.2{l95_JM9130013' 
msa235427 . 2 { 195_18RS21 ' 
msa235427.2{l95_2603 
msa235427 . 2 { 195_A909 } 
msa23 5427 . 2 { 195_C0H1 } 
msa235427 . 2 { 195_M732 ) 
msa235427.2{195_M781} 

Consensus 



101 

DLPaPTSMRS 
DLPaPTSMRS 
DLPaPTSMRS 
DLPaPTSMRS 
DLPpPTSMRS 
DLPaPTSMRS 
DLPaPTSMRS 
DLPaPTSMRS 



FDYSTPPGTK 
FDYSTPPGTK 
FDYSTPPGTK 
FDYSTPPGTK 
FDYSTPPGTK 
FDYSTPPGTK 
FDYSTPPGTK 
FDYSTPPGTK 



PSKPKDSLST 
PSKPKDSLST 
PSKPKDSLST 
PSKPKDSLST 
PSKPKDSLST 
PSKPKDSLST 
PSKPKDSLST 
PSKPKDSLST 



***_****** ********** ********** 



151 

daiedkBgai 
daiedksgai 
daiedksgai 
daiedksgai 
daiedksgai 
rry .r . irsn 
rry.r.irsn 
rry.r . irsn 



kyakslqlsf 
kyakslqlsf 
kyakslqlsf 
kyakslqlsf 
kyakslqlsf 
.ic.vsst .1 
. ic.vsst .1 
.ic.vsst .1 



vddPilaskv 
vddPilaskv 
vddPilaskv 
vdgPilaskv 
vddPilaskv 
c. .Pyf s .qs 
c. .Pyf s .qs 
c. .Pyf s .qs 



PPGFPDLNTP 

PPGFPDLNTP 

PPGFPDLNTP 

PPGFPDLNTP 

PPGFPDLNTP 

PPGFPDLNTP 

PPGFPDLNTP 

PPGFPDLNTP 
********** 



ngkilqvesd 
ngkilqvesd 
ngkilqvesd 
ngkilqvesd 
ngkilqvesd 
kwqnitsri . 
kwqnitsri - 
kwqnitsri . 



150 

PDEAlKdskK 

PDEApKdskK 

PDEApKdskK 

PDEApKdskK 

PDEAlKdskK 

PDEAtKg. .K 

PDEAtKg . . K 

PDEAtKg. .K 
****_* * 

200 

gklvipmal 
gklviprnal 
gklvipmal 
gklviprnal 
gklvipmal 
wqishs . kef 
wqishs . kef 
wqishs. kef 
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Table 51: Comparative Sequences relating to SAG0677 



msa2 35427.2(19 5_H3 6B } 
msa235427 . 2 { 195_JM9130013 } 
msa235427 . 2 { 195_18RS21 } 
msa235427 .2{195_2603} 
msa235427 .2{195_A909} 
msa235427 . 2 { 195_C0H1 } 
msa23 542'7 . 2 f 19 5_M732 } 
msa235427.2{l95_M781} 

Consensus 



msa23S427 .2{l95_H36B} 
msa235427 . 2 { 195_JM9130013 } 
rasa235427 . 2 { 195_18RS21 } 
msa235427 .2{195_2603} 
msa23 5427.2{l95_A909} 
msa235427 .2 ' 
msa235427 .2 
msa23 5427 .2 



19 5_C0H1 
195_M732 
195_M781 
Consensus 



msa235427 .2{195_H36B 
msa23 5427 .2{195_JM913 0013' 
msa235427 ,2{l95_18RS21 
msa235427.2{l95_2603~ 
msa23 5427 . 2 { 195_A909 
msa235427 . 2 { 195JZOH1 
msa23 5427 . 2 { 195_M732 
msa235427 - 2 { 195_M78l} 

Consensus 



msa235427 .2{195_H36B) 
msa235427 . 2 { 195_JM9130013 } 
msa235427 . 2 { 195_18RS21 } 
msa235427.2{!95_2603} 
msa2 3 5427 . 2 { 195_A909 } 
msa23 5427 . 2 { 195_C0H1 } 
msa235427 . 2 { 195_M732 } 
msa235427.2{l95_M78l} 

Consensus 



msa235427 .2{195_H36B} 
msa23 5427 .2{l9 5_JM9 13 0013} 
msa235427 .2 { 195__18RS2l} 
msa235427 ,2{195__2603} 
msa235427.2{l95_A909} 
msa235427 .2{l95_COHl} 
msa235427 . 2 { 195_M732 } 
msa235427 .2{195_M781} 

Consensus 



msa235427 . 2 { 195_H36B} 
msa235427 . 2 { 195_JM9130013 } 
msa235427 . 2 { 195_18RS2l} 
msa235427 .2{195_2603} 
msa235427 . 2 { 195_A909 } 
msa235427 .2(195_C0H1} 
msa235427 .2{195_M732) 
msa235427.2{l95_M78l} 

Consensus 



msa235427 . 2 { 195_H36B} 
msa235427 . 2 { 195_JM9130013 ) 
msa235427 . 2 { 195_18RS21 } 
msa235427 .2{195_2603} 
msa235427 .2{195_A909} 
msa235427 .2(195_C0H1* 
msa235427 - 2 f 195J4732 , 
msa235427 .2{l95_M78i; 

Consensus 



msa235427 . 2 { 195_H36B} 
msa235427 . 2 { 195_JM9130013 ' 
msa235427 . 2 { 195_18RS21 
msa235427.2{!95_2603 



201 

sanqf ddtsl 
sanqf ddtsl 
sanqf ddtsl 
sanqf ddtsl 
sanqf ddtsl 
vs . si . . h. s 
vs . si . .h.s 
vs. si. .h.s 



kiyrnnnrnk 
kiyrnnnrnk 
kiyrnnnrnk 
kiyrnnnrnk 
kiyrnnnrnk 
.nls . . . sq. 
. nls . . . sq . 
.nls . . . sq. 



eitittdyFa 
eitittdyFa 
eitittdyFa 
eitittdyFa 
eitittdyFa 
rnyyhnrlFc 
rnyyhnrlFc 
rnyyhnrlFc 



dtKyvnitav 
dtKyvnitav 
dtKyvnitav 
dtKyvnitav 
dtKyvnitav 
ryKicqyhsg 
ryKicqyhsg 
ryKicqyhsg 



250 

dylsnttFeq 
dylsnttFeq 
dylsnttFeq 
dylsnttFeq 
dylsnttFeq 
.If eqyyF .a 
.lfeqyyF.a 
. lfeqyyF.a 



251 

latgetvdyh 
latgetvdyh 
latgetvdyh 
latgetvdyh 
latgetvdyh 
isyw.nsrlp 
isyw.nsrlp 
isyw.nsrlp 



aivf ssfaai 
aivf ssf aai 
aivf ssfaai 
aivf ssf aai 
aivf ssf aai 
chcifklccy 
chcif klccy 
chcifklccy 



kdkGgkiyvn 
kdkGgkiyvn 
kdkGgkiyvn 
kdkGgkiyvn 
kdkGgkiyvn 
.rqGw.dlc . 
.rqGw.dlc . 
.rqGw.dlc . 



dklqetsria 
dklqetsria 
dklqetsria 
dklqetsria 
dklqetsria 
r.iarnfsys 
r .iarnf sys 
r.iarnfsys 



300 

lkdksvkigi 
lkdksvkigi 
lkdksvkigi 
lkdksvkigi 
lkdksvkigi 
a.r .ic .dwy 
a . r . ic . dwy 
a.r .ic .dwy 



301 

elpndvrhid 
elpndvrhid 
elpndvrhid 
elpndvrhid 
elpndvrhid 
. itk.cqty . 
. itk.cqty. 
. itk.cqty. 



slsvrrlnev 
slsvrrlnev 
slsvrrlnev 
slsvrrlnev 
slsvrrlnev 
. f icssfe.g 
.f icssfe.g 
. f icssf e .g 



ktvdniLknd 
ktvdniLknd 
ktvdniLknd 
ktvdniLknd 
ktvdniLknd 
.nc. .yLek. 
. nc . . yLek . 
.nc. .yLek. 



eqdinlskty 
eqdinlskty 
eqdinlskty 
eqdinlskty 
eqdinlskty 
. trh . sqqnl 
.trh.sqqnl 
. trh . sqqnl 



350 

qlKynPtnrr 
qlKynPtnrr 
qlKynPtnrr 
qlKynPtnrr 
qlKynPtnrr 
piKiqPdkss 
piKiqPdkss 
piKiqPdkss 



351 

lef tinnins 
lef tinnins 
lef tinnins 
lef tinnins 
lef tinnins 
srvyy . .h. 1 
srvyy. . h. 1 
srvyy. .h. 1 



sseimttFkd 
sseimttFkd 
sseimttFkd 
sseimttFkd 
sseimttFkd 
kfrnhdhFqr 
kf rnhdhFqr 
kfrnhdhFqr 



gKmpelveqK 

gKmpelveqK 

gKmpelveqK 

gKmpelveqK 

gKmpelveqK 

wKdarig . tK 

wKdarig.tK 

wKdarig . tK 
_*____ * 



dvsldindmd 
dvsldindmd 
dvsldindmd 
dvsldindmd 
dvsldindmd 
rcf fgykryg 
rcf fgykryg 
rcf fgykryg 



400 

mskf ktirlg 
mskf ktirlg 
mskfktirlg 
mskfktirlg 
mskfktirlg 
he . v.nystw 
he.v.nystw 
he .v.nystw 



401 

rKdsefkGql 
rKdsefkGql 
rKdsefkGql 
rKdsefkGql 
rKdsefkGql 
tKgf .i .Gtt 
tKgf .i.Gtt 
tKgf . i . Gtt 



iaKtgtveld 
iaKtgtveld 
iaKtgtveld 
iaKtgtveld 
iaKtgtveld 
ycKnwns . ir 
ycKnwns . ir 
ycKnwns . ir 



mf FkqsqdPa 
mf FkqsqdPa 
mf FkqsqdPa 
mf FkqsqdPa 
mf FkqsqdPa 
yvFqtisrPs 
yvFqtisrPs 
yvFqtisrPs 



siikKiyliq 
siikKiyliq 
siikKiyliq 
siikKiyliq 
siikKiyliq 
fny.Knipyp 
f ny . Knipyp 
f ny . Knipyp 



450 

ngvpnelkKf 
ngvpnelkKf 
ngvpnelkKf 
ngvpnelkKf 
ngvpnelkKf 
kwcsk . ieKi 
kwcsk. ieKi 
kwcsk. ieKi 



* _ 



451 

dssFgltesq 
dssFgltesq 
dssFgltesq 
dssFgltesq 
dssFgltesq 
. 1 . Fwf n.ks 
.l.Fwfn.ks 
. 1 .Fwf n.ks 



idgyyiykda 
idgyyiykda 
idgyyiykda 
idgyyiykda 
idgyyiykda 
drwilyl . rc 
drwilyl . rc 
drwilyl.rc 



inlkf kltsg 
inlkfkltsg 
inlkfkltsg 
inlkfkltsg 
inlkfkltsg 
n . p . i . inqw 
n . p . i . inqw 
n . p . i . inqw 



aslkwykgq 
aslkwykgq 
aslkwykgq 
aslkwykgq 
aslkwykgq 
cks . scl . ra 
cks .scl.ra 
cks .scl .ra 



500 

edpyshqked 
edpyshqked 
edpyshqked 
edpyshqked 
edpyshqked 
rrsi . sserr 
rrsi . sserr 
rrsi . sserr 



501 

mtkkgeqlsh 
mtkxgeqlsh 
mtkkgeqlsh 
mtkkgeqlsh 
mtkkgeqlsh 
yd . kr . taqs 
yd . kr . taqs 
yd . kr . taqs 



stqanentaK 
stqanentaK 
stqanentaK 
stqanentaK 
stqanentaK 
fnssq.kysK 
fnssq.kysK 
fnssq.kysK 



vtfanidwsh 
vtf anidwsh 
vtfanidwsh 
vtfanidwsh 
vtfanidwsh 
snlc .y. Ivt 
snlc.y.lvt 
snlc.y . Ivt 



yskvtvngKe 
yskvtvngKe 
yskvtvngKe 
yskvtvngKe 
yskvtvngKe 
1 • ■ gycewKr 
1 • • gycewKr 
1 . . gycewKr 



550 

vgkgselplt 
vgkgselplt 
wkgselplt 
wkgselplt 
vgkgselplt 
sw.r . .vtfn 
sw.r. .vtfn 
sw. r . .vtfn 



5S1 600 

kgwttfvlhk tenslnvksl imetGsvskk vqqlplsprl sknkhmrdml 

kgwttfvlhk tenslnvksl imetGsvskk vqqlplsprl sknkhmrdml 

kgwttfvlhk tenslnvksl imetGsvskk vqqlplsprl sknkhmrdml 

kgwttfvlhk tenslnvksl imetGsvskk vqqlplsprl sknkhmrdml 
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WO 2004/018646 
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Table 51: Comparative Sequences relating to SAG0677 



msa235427 . 2 { 195_A909 } 
msa23 5427 . 2 { lSS^COHl 
msa23 5427 .2{195_M732 
msa23 5427.2{l95_M781; 

Consensus 



kgwttfvlhk tenslnvksl imetGsvskk vqqlplsprl sknkhmrdml 
.rmdnicit. nrkfikc.kf dygdG.ck.e ssttsf ks . i i.k.ayegy. 
.rmdnicit. nrkfikc.kf dygdG.ck.e ssttsfks.i i.k.ayegy. 
.rmdnicit. nrkfikc.kf dygdG.ck.e ssttsfks.i i.k.ayegy. 



msa235427 .2{l95_H36B 
msa235427 . 2{195_JM9130013 
msa235427 .2{l95_18RS21 , 
msa235427 . 2 { 195_2603 } 
msa235427 .2{195_A909} 
msa23 5427 .2{l95_COHl} 
msa235427 ,2{l95_M732} 
msa235427 . 2 { 195__M781 } 
Consensus 



601 

lTmqkdsayy 
lTmqkdsayy 
lTmqkdsayy 
lTmqkdsayy 
lTmqkdsayy 
aTyyakrf sv 
aTyyakrf sv 
aTyyakrf sv 



etsdslvlri 
etsdslvlri 
etsdslvlri 
etsdslvlri 
etsdslvlri 
lrnk.qssps 
lrnk.qssps 
lrnk.qssps 



Nltadtklnf 
Nltadtklnf 
Nltadtklnf 
Nltadtklnf 
Nltadtklnf 
N . shcry . t . 
N . shcry . t . 
N . shcry . t . 



navkgaSalt 
navkgaSalt 
navkgaSalt 
navkgaSalt 
navkgaSalt 
f .cc .rSecs 
f .cc.rSecs 
f .cc.rSecs 



650 

enmmmrqf av 
enmmmrqf av 
enmmmrqf av 
enmmmrqf av 
enmmmrqf av 
y .kyddetvc 
y .kyddetvc 
y . kyddetvc 



msa235427 . 2 { 195_H36B' 
msa235427 . 2 { 195_JM9130013 ' 
msa235427 . 2 { 195_18RS2l' 
msa235427 .2{195_2603} 
msa235427.2{l95_A909} 
msa235427 . 2 { 195_C0H1 } 
msa235427 . 2 { 195_M732 } 
msa235427 . 2 { 195_M78l} 
Consensus 



651 

agpqddpvse 
agpqddpvse 
agpqddpvse 
agpqddpvse 
agpqddpvse 
scwttr.sc. 
scwttr .sc . 
scwttr.sc . 



hkypsvf lit 
hkypsvf lit 
hkypsvf lit 
hkypsvf lit 
hkypsvf lit 
.t .ipisisl 
. t . ipisisl 
. t . ipisisl 



palLetasea 
palLetasea 
palLetasea 
palLetasea 
palLetasea 
nscLignc . . 
nscLignc. . 
nscLignc. . 



tlngkeitaS 
tlngkeitaS 
tlngkeitaS 
tlngkeitaS 
tlngkeitaS 
gnskw.gnhS 
gnskw.gnhS 
gnskw.gnhS 



700 

giighikdGd 
giighikdGd 
giighikdGd 
giighikdGd 
giighikdGd 
iwyyrshqGw 
iwyyrshqGw 
iwyyrshqGw 



msa235427.2{l95_H36B} 
msa235427.2{l95_JM9130013) 
msa235427 . 2 { 195_18RS21 ) 
msa235427 . 2 { 195_2603 } 
msa235427 . 2 { 195_A909 } 
msa235427 . 2 { 195_C0H1 } 
msa235427 . 2 { 195_M732 \ 
msa235427.2{195__M78l} 
Consensus 



701 

ksKhvevkmv 
ksKhvevkmv 
ksKhvevkmv 
ksKhvevkmv 
ksKhvevkmv 
. . Kqac . sqn 
. .Kqac. sqn 
. . Kqac . sqn 



nEngdmlgtp 
nEngdmlgtp 
nEngdmlgtp 
nEngdmlgtp 
nEngdmlgtp 
gE . kwrharn 
gE . kwrharn 
gE . kwrharn 



viiqgkdltn 
viiqgkdltn 
viiqgkdltn 
viiqgkdltn 
viiqgkdltn 
pcyysr .rid 
pcyysr .rid 
pcyysr .rid 



rtkplmsgrr 
rtkplmsgrr 
rtkplmsgrr 
rtkplmsgrr 
rtkplmsgrr 
. snktinewt 
. snktinewt 
. snktinewt 



750 

vlyagkqyef 
vlyagkqyef 
vlyagkqyef 
vlyagkqyef 
vlyagkqyef 

• S t X GIT • t 1 m 

.stlcr . ti . 
.stlcr . ti . 



msa235427.2{l95_H36B} 
msa23 5427.2{l95_JM9130013} 
msa235427.2{l95_18RS21> 
msa235427 . 2 { 195_2603 } 
msa235427 . 2 { 195_A909 } 
msa235427 .2{l95_COHl} 
msa235427 .2{195_M732 J 
msa235427.2{l95_M781) 
Consensus 



751 

raklplsrf n 
raklplsrfn 
raklplsrfn 
raklplsrfn 
raklplsrfn 
vpg. itt . sf 
vpg.itt .sf 
vpg. itt . sf 



twirvewte 
twirvewte 
twirvewte 
twirvewte 
twirvewte 
. hld.g. sgn 
. hid . g . sgn 
.hld.g. sgn 



agekaSivrr 
agekaSivrr 
agekaSivrr 
agekaSivrr 
agekaSivrr 
rsrreSkycs 
rsrreSkycs 
rsrreSkycs 



mf f dqsvpel 
mf f dqsvpel 
mf f dqsvpel 
mf f dqsvpel 
mf f dqsvpel 
shvl .pissr 
shvl .pissr 
shvl .pissr 



800 

ntavakrdlt 
ntavakrdlt 
ntavakrdlt 
ntavakrdlt 
ntavakrdlt 
a.hssc. t . f 
a.hssc. t . f 
a.hssc. t .f 



msa235427 . 2 { 195_H36B 
msa235427 . 2 { 195_JM9130013 
msa235427 . 2 { 195_18RS2 1 
msa235427 .2{l95_2603 
msa235427 . 2 { 195_A909 
msa235427.2{l95_COHl 
msa235427 . 2 { 195_M732 
msa235427 . 2 { 195_M781 
Consensus 



801 

sdtalihiva 
sdtalihiva 
sdtalihiva 
sdtalihiva 
sdtalihiva 
df .ycsyphr 
df .ycsyphr 
df .ycsyphr 



kddsLklkly 
kddsLklkly 
kddsLklkly 
kddsLklkly 
kddsLklkly 
cqr .Lsktki 
cqr . Lsktki 
cqr .Lsktki 



qddsllesvd 
qddsllesvd 
qddsllesvd 
qddsllesvd 
qddsllesvd 
isr .f it . ic 
isr.fit .ic 
isr.f it .ic 



ktglysfrng 
ktglysf rng 
ktglysfrng 
ktglysfrng 
ktglysfrng 
. .nrsl . f .k 
. .nrsl . f . k 
. .nrsl . f .k 



850 

veitkdmtvp 
veitkdmtvp 
veitkdmtvp 
veitkdmtvp 
veitkdmtvp 
wcrnh ♦ ryds 
wcrnh . ryds 
wcrnh . ryds 



msa235427 . 2 { 195_H36B } 
msa235427.2{l95_JM9130013} 
msa235427 . 2 { 195_18RS21 } 
msa235427 . 2 ( 195__2603 } 
msa235427 .2{195_A909} 
msa235427 .2{195_C0H1} 
msa235427 .2{195_M732} 
msa235427 . 2 { 195J478 1 } 
Consensus 



851 

lefgdnitkl 
lefgdniikl 
lefgdniikl 
lefgdniikl 
lefgdniikl 
ttriwr .yy. 
ttriwr .yy. 
ttriwr .yy. 



savdlsnyrr 
savdlsnyrr 
savdlsnyrr 
savdlsnyrr 
savdlsnyrr 
vice . likls 
vicc.likls 
vice. likls 



netlhiYrnr 
netlhiYrnr 
netlhiYrnr 
netlhiYrnr 
netlhiYrnr 
s . .dpsYl .k 
s . .dpsYl .k 
s. .dpsYl.k 



f dvkaSqmta 
f dvkaSqmta 
f dvkaSqmta 
f dvkaSqmta 
f dvkaSqmta 
pf . c . Skpnd 
pf . c . Skpnd 
pf . c . Skpnd 



900 

dkgakvtvdm 
dkgakvtvdm 
dkgakvtvdm 
dkgakvtvdm 
dkgakvtvdm 
s . qrs . sncg 
s . qrs . sncg 
s . qrs . sncg 



msa235427 . 
msa235427.2{l95 
nisa235427 .2{ 
msa235427 . 
msa235427 . 
msa235427 . 
msa235427 . 
msa235427 . 



2{195_H36B) 
_JM9130013} 
195_18RS21} 
2 { 195_2603} 
2f 195_A909) 
2fl95_COHl} 
2{195_M732} 
2{195_M781} 
Consensus 



901 

lmkhlwpem 
lmkhlwpem 
lmkhlwpem 
lmkhlwpem 
lmkhlwpem 
yvdealscsr 
yvdealscsr 
yvdealscsr 



aGaytltide 
aGaytltide 
aGaytltide 
aGaytltide 
aGaytltide 
nGrslyinnr 
nGrslyinnr 
nGrslyinnr 



apntnesgml 
apntnesgml 
apntnesgml 
apntnesgml 
dpntnesgml 
rsskhk. irn 
rsskhk. im 
rsskhk. irn 



tNakvSIhyv 

tNakvSIhyv 

tNakvSIhyv 

tNakvSIhyv 

tNakvSIhyv 

vNkr .SIdsl 

vNkr .SIdsl 

vNkr. SIdsl 
_ *___** 



950 

nggvdkvdvp 
nggvdkvdvp 
nggvdkvdvp 
nggvdkvdvp 
nggvdkvdvp 
ckwwc . . s . c 
ckwwc . . s . c 
ckwwc . . s . c 
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Table 51: Comparative Sequences relating to SAG0677 



msa235427 .2{195_H36B} 
msa235427.2{l95_JM9130013} 
msa235427 .2{l95_18RS2l} 
msa235427 . 2 ( 195_2603 } 
msa235427 .2{195_A909} 
msa23 5427.2{l95_COHl} 
msa235427.2{l95_M732} 
msa235427 . 2{ 195_M78l} 
Consensus 



msa235427 .2{l95_H36B 
msa235427 . 2{l95_JM9130013~ 
msa235427 . 2 { 195_18RS21 
msa235427 .2\ 195J2603 
msa235427.2{195_A909} 
msa235427.2{l95_COHl} 
msa235427 . 2 { 195_M732 } 
msa235427 .2{195_M781} 
Consensus 



msa23 5427 . 2 { 19 5_H36B } 
msa235427.2{l95_JM9130013} 
msa235427 .2{195_18RS21} 
msa235427 .2{l95_2603* 
msa235427.2{l95_A909 
msa235427 . 2 { 195_COHl 
msa235427 . 2 { 195_M732 } 
msa235427 . 2 { 195_M781 } 
Consensus 



951 

ikwdleair 
ikwdleair 
ikwdlea. . 
ikwdleair 
ikwdleair 
sd. ss . lrsy 
sd.ss .lrsy 
sd.ss.lrsy 



kaeeahkade 
kaeeahkade 
.... irkaee 
kaeearkaee 
kaeeahkade 
s . s . rst . s • 
s • s • rst . s . 
s . s • rst . s . 



arkaeearka 
arkaeearka 
arkaeearka 
arkaeearka 
arkaeearka 
rst . s .rst . 
rst . s .rst . 
rst . s .rst . 



deahkaeevr 
eeahkaeevr 
eeghktqeap 
eeghktqeap 
eearkaeear 
s .rst . s . rs 
s .rst . s . rs 
s.rst .s.rs 



1000 
kaeeahkvee 
kaeeahkvee 
iveegykvnn 
iveegykvnn 
kaeeghktqe 
t . s .rst . sr 
t . s .rst . sr 
t . s .rst . sr 



1001 

arkaeeghkt 
ap . s . rgt . n 
vhqtdttvka 
vhqtdttvka 
apiveegykv 
rst . s .rgt . 
rst .s. rgt. 
rstvklkrdi 



qeapiveegy 
prstys .rrl 
sdlpktktvs 
sdlpktktvs 
nnvhqtdttv 
nprstys . rr 
nprstys . rr 
kpkkhl . Ikk 



kvnnvhqtdt 
qg. .rssn.y 
avhmartdnk 
avhmartdnk 
kasdlpktkt 
Iqs . .rssn. 
Iqs. .rssn. 
atklitfikl 



tvkasdlpkt 
ys . sv. f tkd 
qitshqthve 
qitshqthve 
vsavhmartd 
yys .sv.f tk 
yys .sv.f tk 
ilqlkrliyq 



1050 
ktvsavhmar 
.dsfrssyg. 

kqikntlpst 
nkqitshqth 
d. dsfrssyg 
d. dsfrssyg 
rlrqfpqf iw 



1051 

tdnkqitshq th— 
nrq.tdnfts dtc- 



1081 



gdskrgyyit gmaivmlsvl fslakkfksk y 

vekqikn - 

.nrq . tdnf t sdTC 

.nrq.tdnft sdTC.k ~ 

leqtinr.lh irhml 

******** ********** * 
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SEQ ID NO. 5201 
STRAIN 090 

AGCX3ATACCTTTAATTTTGATATTGACCAAATTGCAGA 
CAATGCTATCACTAAAACAGATAAAACAACAGAAATTATTTCCAACCAGA 
CAAGAAGCCAAACTGGG CAAATTGCCTTTTTTGAAA 
CAAAAGTCTGCTATCTCTGAAAAAACACCAGCTTTGGTAGATACTTTTGT 
CGGCGATCAAAATGCGCTCCTTGATTTTGGACAATCCGCAGTAGAAGGCG 
TT AAT AC CACTGTTAAT CATAT CTTGTCTG AG CAG AAAAAAATTCAAATT 
CCTCAAGTTGATGATTTACTAAAAAATGCTAATCGCGAACTAAATGGATT 
TATTG CCAAATATAAAGATG CT ACTCCGG CAG AATTAgAGAAAAAAC CAA 
ACTTGATTCAAAAATTATTCAAACAAAGCAAGACCTCGCTACAGGAATTT 
TATTTTG ACT CACAAAACAT CGAG CAAAAAATGG ATATG ATGGC aGCGAA 
TGTTGTCAAACAAGAAGATACTTTGGCAAGAAATATCG t CTCTGCTGAAA 
TG CT CATTGAAGATAATACTAAAT CTATT GAAAAT TT GGTTGG AGTT ATT 
GCTfc t TATTGAATCgAGTCAAG CCGAGGCTGCTAATCG t GCAaGCCACTT 
ACAACAAGAAATT CTAG CATTAGAT AG CC aAACGT c CGAGTATCAAAT t A 
AAAGTaACCAATTAGCTCGAATGACTGAAGTTATCAATACCCTCGAACAG 
C^CATACTGAATATGTCAGCCGTCTOTACGTTGCATGGGCAACAACACC 
ACAGATGCGAAACTT GGTCAAAGT ATCGT CAGATATGCGT CAGAAACTTG 
GCATGTTACGTCGAAATACCATTCCAACAATGAAACTCTCAATCGCTCAG 
TTAGGCATGATGCAACAATCrrGTCAAATCCGGTGTCACTGCTGATGCTAT 
TGTCAACGCTAATAATG CAGCATTG CAGATG CTGG CTGAAACTAGTAAAG 
AAGCGATTCCa^TGTTAGAGAAGACCGCACAAAGCCCa^CTGTTT CTATT 
AAAT CTGTCA.CTGCA.TTAG CTGAAAG CTTAGTGG CTCAAAATAATGGTAT 
TATCGCTGCCATAGACAAAGGACGTAAGGAACGTGCCCaATTGGAATCTG 
CTGTTATTAAAT CGG CTGAAACAAT CAATGATTCTGT CAAAATT CGTGAT 
AAAAAAATAGTTGAAGC CTTACT CAACGAAGGT aAATCTAC C CAAGAAAA 
AGTTGATGAGTCT 

SEQ ID NO. 5202 
STRAIN A909 

AGCGATACCTTT AATTTTGATATTGAC CAAATTG CAGA 

CAATGCTAT CACTAAAACAGATAAAACAACAGAAATTATTTCCAAC CAGA 

CAACAAGCCAAACTGGGCAAATTGCCTTTT'TT 

CAAAAGTCTGCTATCTCTGAAAAAACACCAGCaTTGGTAGATACTTTTGT 
CGGTGACCAAAATGCGCTCCTTGATTTTGGACAATCCGCAGTAGAAGGCG 
TTAATAC CACTGTTAATCATAT CTTGT CTG AG CAGAAAAAAATTCAAATT 
C CTCAAGTTG ATGATTT ACTAAAAAATGCTAAT CG CX3AACTAAATGG ATT 
TATTG CCAAATATAAAGATG CTACT C CGG CAG AATTAGAGAAAAAAC CAA 
ACTTGATTCAAAAATTATTCAAACAAAGCAAGACCT CG CTACAGG AATTT 
TA1TTTGACTCACAAAACAT 

TGTTGTCAAACAAGAAGATACTTTGGCAAGAAATATCGTCTCTGCT 
TGCT CATTGAAGATAATACTAAAT CTATTGAAAATTTGGTTGGAGTTAwT 
GCTTTTATTGAATCGAGTCAAGCCGAGGCTGCC^TCGTGCAAGCCACTT 
ACAACAAGAAATTCT AG CATTAGATAG CCAAACGTC CGAGTATCAAATTA 
AAAGTAACCAATTAGCT CGAATGACTG AAGTTAT CAATACCCT CGAACAG 
CAACATACTGAATATGTCAGCCGTCTCTACGTTGCATGGGCAACAACACC 
ACAGATG CGAAACTTGGTCAAAGTATCGT CAGAT ATG CGTCAAAAACTTG 
GCATGTTACGTCGAAATACCATT C CAACaATGAAACT CT CAATCG CT CAG 
TTAGGCATGATGCAACAATCTX5TCAAATCCGGTGTCACTGCTGATGCTAT 
TGTCAACGCTAATAATG CAG CATTG CAGATG CTGG CTGAAACTAGTAAAG 
AAGCGATTCCGATGTTAGAGAAGAC CG CACAAAG C CC aACTGTTT CTATT 
AAATCTGTCACTGCATTAGCTGAAAGCTTAGTGGCTCAAAA 
TATCGCTGCCATAGACAAAGGACGTAAAGAACGTGCCCAATTAGAATCTG 
CTGTTATTAAAT CGG CTGAAACAATCAATGATTCTGTCAAAATTCGTGAT 
AAAAAAATAGTTGAAGCCTTACTCAACGAAGGTaAATCTACCCAAGAAAA 
AGtTGATGAGTCT 

SEQ ID NO. 5203 
STRAIN H3 6B 

AG CG aTAC CTTT AATTTTGATATTGAC CAAATTG CAGAC 
AATGCTATCACTAAAACAGATAAAACAACAGAAATTATTTCCAACC^ 
AACAAGCCAAACTGGG CAAATTG CCTTTTTTGAAAAACT 
AAAAGTCTG CTAT CT CTGAAAAAACAC CAG CrTTGGTAGATACTTTTGTC 
GGTGACCAAAATGCGCTCCITGATTTTGGACAATCCGCAG 
TAATACCACTGTTAATCATATCTTGTCTGAGCAGAAAAAAATTCAAATTC 
CT CAAGTTGATGATTTACT AAAAAATG CT AAT CG CGAACTAAATGGATTT 
ATTG C CAAATATAAAGATG CTACTCCGG CAGAATTAGAGAAAAAACCAAA 
CTTGATT CAAAAATTATTCAAACAAAG CAAGACCTCG CTACAGGAATTTT 
ATTTTGACTCACAAAACATCGAGCAAAAAATGGATATGATGGCAGCGAAT 
GTTGTCAAACAAGAAGATACTTTGG CAAG AAAT ATCGT cTCTGCTGAAAT 
G CT CATTGAAGATAATACTAAATCT ATTG AAAATTTGGTTC 

ctt ttattgaatcgagtcaagccgaggctgccaatcgtgcaagccactta 
caacaagaaattctagcattagatag c caaacgt c cg agt at caaattaa 
aagtaacc^ttagctcgaatgactgaagttatcaataccctcgaacagc 
aacatactgaatatgtcagccgtctctacgttgcatgggcaacaacacca 
cagatgcgaaacttggtcaaagtat cgt cagat atg c gtcaaaaacttgg 
catgttacgt cgaaatac catt c caac aatgaaact ct caat cg ctcagt 
tagg catgatgcaacaat ctgt caaat ccggtgt cactg ctgatg ctatt 
gtcaagg ctaataatgcagcattg cag atg ctggctg aaactagt aaaga 
agcgattccgatgttagagaagaccgc7vcaaagccccactgtttctatta 
aatctgtcactgcattatctgaaagcttagtggctcaaaata^ 

ATCGCTGCCA,TAGACAAAGGACGTAAAGAACGTGCCCAATTAGAATCTGC 
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TGTTATT AAATCGG CTGAAACAAT CAATG ATT CTGT CAAAATT CGTG AT a 
AAAAAATAGTTG AAG CCTT ACT CAa CGAAGGT aAAT CT AC C CAAGAAAAA 
GTTGATG AGT CT 

SEQ ID NO. 5204 
STRAIN 18RS21 

TTTTGATATTGAC CAAATTG CAGACAATG CTAT CACT AAAACAG AT AAAA 

CAACAGAAATTATTTCCAACCAGACAACAAGCCAAACTGGGCAA^ 

UTTTTIGAAAAACTAACACCAGCACAAAAGTCTGCTATCTCTGAAAAAAC 

ACCAGCITOGGTAGATACTTTTGTCGGCGATCAAAATGCGCTCCTTGATT 

TTGGACAATC03CAGTAGAAGGCGTTAATACCACTGTTAATCATATCTTG 

TCTGAGCAG AAAAAAATT CAAATT C CT CAAGTTG ATG ATTTACTAAAAAA 

TGCTAATCGCGAACTAAATGGATTTATTGCCAAATATAAAGATGCTACTC 

CGGCAGAATTAGAGAAAAAACCAAACTTGACTCAAAAAT^ 

AG CAAGACCT CG CT ACAGGAATTTTATTTTGACT CACAAAACATCGAGCA 

AAAAATGGATATGATGGCAGCGAATGTTGTCAA^CAAGAAGATACTTTGG 

CAAGAAATATCGTCTCTGCTGAAATGCTCATTGAAGATAATACTAAATCT 

ATTGAAAATTTGGTTGGAGTTATTG CTTTTATTGAAT CGAGT CAAG CCGA 

GG CTG CTAAT CGTG CAAG C CACTT ACAACAAG AAATT CTAGCATT AGATA 

GCCAAACGT C CGAGTAT CAAATTAAAAGT AACCAATTAGCTCGAATGACT 

GAAGTTAT CAAT ACCCT CGAACAG CAACAT C CTGAAT ATGT CAG C CGTCT 

CTACGTTGCATGGGCAACAACACCACAGATGCGAAACT 

CGTCAGATATG C GT CAG AAACTTGG CATGTTACGT CGAAATACCATT CCA 

ACAATGAAACT CTCAATCG CT CAGTTAGG CAT GATGCAACAAT CTGT CAA 

ATCCGCTGTGA.CTGCTGATGCTATTGTCAftCGCTAATAATGCAGCA.TTGC 

AGATG CTGG CTG AAACTAGT AAAGAAG CGATTCCGATGTTAGAGAAGACC 

GCACAAAGCC CCACTGTTTCTACTAAATCTGTCACTG CATTAGCTGAAAG 

CTTAGTGGCTCAAAATAATGGTATTATCGCTGCGATAGACAAAGGACGTA 

AGGAACGTG CC C aATTGGAAT CTGCTGTTATTAAAT CGG CTGAAACAATC 

AATGATTCTGT CAAAATT CGTGATAAAAAAATAGTTGAAG C CTTACT CAA 

CGAAGGT aAATCTAC CCAAG AAAAAGTTGATG AGTCT 

SEQ ID NO. 5205 
STRAIN M732 

AG CG ATACCTTTAATTTTGATATTG AC CAAATTG CAGAC 
AATGCTATCACTAAAACAGATAAAACAACAGAAACTATTTCCAACCAG^ 
AACAAGC CAAACTGGG CAAATTG CCTTTTTTGAAAAACTAACAC CAG CAC 
AAAAGTCTG CTAT CT CTGAAAAAACACCAGCTCTGGTAGATACTTTTGTC 
GGTGACCAAAATG CG CT CCTTG ATTTTGG ACAATCCG CAGTAGAAGG CGT 
TAATACTACTGTTAAT CAT AT (CTTGT CTGAGCAG AAAAAAATT CAAATTC 
CT GAAGTTGATGATTTACTAAAAAATG CTAAT CG CGAACT 
ATTGCCAAATATAAAGATGCTACTCTOGCAGAATTAGAGAAAAAACCAAA 
CTTGACTCAAAAATTATTCAAACAAAG CAAGACCT CGCTACAGGAATTTT 
ATTTTGACTCACAAAACATCGA.GCAAAAAATGG 

GTTGTCAAACAAGAAGATACTTTGGCAAGAAATAT CGT CT CTGCTGAAAT 
GCTGACTGAAGATAATACTAAATCTATTGAAAATCTGGTTG^ 
CTTTTATTG AATCGAGT CAAG C CGAGG CTG CCAAT CGTG CAAG C CACTTA 
CAACAAGAAATTCTAG CATT AGATAGC CAAAC GT C CG AATAT CAAATTAA 
AACTAACCAATTAGCCCGAATGACTGAAGTTATGAATACCCT 
AACATACGGAATATGTCAGC CGTCT CTACGTTG CATGGG CAACAACACCA 
CAGATGCGAAACITGGTCAAAGTATCGTCAGATATGCGT(^ 
TATGTTACGTCXiAAATACCATT CCAACAATGAAACTCTCAAT CG CTCAGT 
TAGG CATGATGCAACAAT CTGTCAAATCCGGTGT CACTG CTGATGCT ATT 
GT CAACGCTAATAATG CAG CATTGCAAATG CTGG CTG AAA CTAGTAAAGA 
AGCGATTCCGATGTTAG AGAAGACCGCACAAAG C C CCACTGTTTCTATTA 
AATCTGT CACTG CATTAGCTGAAAG CTTAGTGG CT CAAAATAAT GGTATT 
AT CGCTG CCATAGA.GAAAGG ACGTAAGGAACX3TG C CCAATTAGAAT CTG C 
TGTTATTAAATCGG CTGAAACAAT CAATGATTCTGTCAAAATTCGTG ATA 
AAAftAATAGTTGAAGCCTTACTCAACGAAGGTAAATCTACCCAAGAAAAA 
G 

SEQ ID NO. 5206 
STRAIN COH1 

CTAAAACAGATAAAACAACAGAAATTATTT C CAAC CAGACAAGAAGCCAA 
ACTGGGCAAATTGCCTTTTTTGAAAAACTAACAC CAG CACAAAAGTCTG C 
T^CTCTC4AAAAAACACCAGCTTTGGTAGATACTTCTGTCGGTGACCAAA 
ATG CG CT CCTTC^TTTTGGACAAT CCG CAGTAGAAGG CGTTAAT ACTACT 
GTTAAT CATATCTTGTCTGAGCAGAAAAAAATTCAAATTC CT CAAGTTGA 
TGATTTACTAAAAAATG CTAAT CX3 CGAACTAAATGGATTTATTGCCAAAT 
ATAAAGATGCTACTCCX^CaG^CTAGAGAAAAAACCAAACTTGATT 
AAATTATTCAAACAAAG CAAGACCTCGCTACAGGAATTTTATTTTGACT C 
ACAAAACAT CGAGCAAAAAATGGATATGATGGCAG CAAATGTTGTCAAAC 
AftGAAGATACTTTGGCAAGAAATATCGTCTCTGCTGAAATGCT 
GATAATACTAAAT CTATTGAAAATTTGGTTGG AGTTACTG CTTTTATTGA 
AT CGAGTCAAGC CGAgG CTG CCAATCGTGCaAG C CACTT ACAACAaGAAA 
TT CT AGC aTT AGATAGCCAAACGT C CG AATAT CAAATTAAAAGT AACCAA 
TTAG C C CGAATGACT GAaGTTATCAaT aC CCT CGAACAG CAACATACGGA 
aT ATGT CAG CCGT CT CT ACGTTGCATGGG CAACAACAC CACAGATG CGAA 
ACTTGGT CAAAGTAT CGT CAGATATGCGT CAGAAACTTGGTATGTTACGT 
CGAAATACCACTCCAACAATGAAACTCTCAATCG CT CAGTTAGG CATGAT 
GGAACAATCTGTCAAATCCGGTGTCACTGCTGATGCTACTGTCAACG 
ATAATGCAG CATTG CAAATG CTGG CTG AAACTAGTAAAGAAG CG ATT CCG 
ATGTTAGAGAAGAC CGCACAAAGC C CCACTGTTT CTATTAAATCTGT CAC 
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TGCATTAGCTGAAAGCTTAGTGGCTCAAAATAATGGTATTATCGCTGCCA 
TAGACAAAGGACGT AAGG AACGTG C C CAATTAGAAT CTG CTGTT ATTAAA 
TCGGCrTGAAACAATCAATGATTCTGTCAAAATTCGTGATAAAAAAATAGT 
TGAAGCCTTACTCAaCGAAGGTAAATCTACCCAAGAAAAAGTTGATGAGT 
CT 

SEQ ID NO. 5207 
STRAIN M781 

TTTTGATATTGACCAAATTGCAGACAATGCTATCACTAAAACAGATAAAA 
CAACAGAAATTATTT C CAAC CAGACAACAAG C CAAACTGGG CAAATT GC C 
TTTTTTGAAAAACTAACACCAGCACAAAAGTCTGCTATCTCT 
ACCAGCTTTGGTAGATACTTTTGTCGGTGACCAAAATGCGCTCCTTGATT 
TTGG ACAAT CCG CAGTAGAAGG CGTTAATACT ACTG t TAAT CATAT CTTG 
T CTG AGCAGAAAAAAATT CAAATTCCT CAAGTTGATGATTTACTAAAAAA 
TG CT AATCG CGAACTAAATG GATTT ATTG C CAAAT ATAAAGATG CTACT C 
CXIKSCAGAATTAGAGAAAAAACCAAACITGATTC^^ 

AG CAAGACCTCX3 CTACAGGAATTTTATTTTGACT CACAAAACAT CGAG CA 
AAAAATGGATATGATGG CAG CAAATGTTGT CAAACAAG AAGATACTTTGG 
CAAGAAATATCGTCTCTGCTGAAATGCTCATTGAAGATAATACTAAATCT 
ATTGAAAATTTGGTTGGAGTTATTGCTTTTATTGAATCGAGTCAAGCC 
GG CTG CCAATCGTCCAAGC CACTT ACAACAAGAAATT CT AG CATTAGATA 
GCCAAACGTCCGAATATCAAATTAAAAGTAACCAATTAGCCCGAATGACT 
GAAGTTATCAATACCCT CGAACAG CAACATACGG AATATGTCAG C CGT CT 
CTACGTTGCAIGGGCAACAACACCACAGATG 

CGTCAGATATGCGT CAkGAAACTTCGTATGTTACGTCGAAATAC CATTC CA 
ACAATGAAACT CTC AATCG CTCAGTTAGG CATGATGC AACAATCTGT CAA 
ATCCGGTGT CACTG CTGATGCTATTGT CAACGCTAATAATGCAGCATTG C 
AAATG CTGG CTGAAACTAGTAAAGAAG CGATT CCGATGTT AGAGAAGAC C 
G CACAAAG C C CCACTGTTT CTATTAAATCTGT CACTG CATTAGCTGAAAG 
CTTAGTGGCTCAAAATAATGGTATTATCGCTGCCATAGACAAAGGACGTA 
AGGAACGTG CCCAATTAGAATCTGCTGTTATTAAATCGG CTC 
AATCATTCTGTCAAAATTCGTGATAAAAAAATAGTTCAAGCCTTACTC^\A 
CGAAGGTAAATCTAC CCAAG AAAAAGTTG ATGAGT CT 

SEQ ID HO. 5208 
STRAIN CJB110 

TTTTGATATTGACC^AATTGCAGACAATCCTATCACTA 
CAACAGAAATTATTT CCAAC CAGACAACAAG CCAAACTGGG CAAATTG C C 
TTTTTTGAAAAACTAACACCAGCACAAAAGT CTG CTAT CTCTGAAAAAAC 
ACCAGCTTCGGTAGATACTTTTGTCGG CGAT CAAAATG CG CT CCTTGATT 
TTGGACAAT C CG CAGTAGAAGG CGTTAATACCACTGTTAAT CAT ATCTTG 
TCTGAGCAGAAAAAAATTCAAATTCCT(^AAGTTGATGATTTACT 
TG CTAATCG CGAACTAAATGGATTTATTG C CAAATATAAAGATG CTACT C 
CGGCAGAATT AGAGAAAAAAC CAAACTTGATTCAAAAAT^ 
AGCAAGACCTTCCTACAGGAATTTTATTTTGACTCACAAAA 
AAAAATGGATATGATGGCAG CGAATGTTGTCAAACAAGAAGATACTTTGG 
CAAGAAATAT CGTCTCTG CTGAAATGCTCATTGAAGATAATACTAAATCT 
ATTGAAAATTTGGTTGGAGTTATTGCTTTTATTGAAT CGAGT CAAGCCGA 
GG CTGCTAAT CGTG CAAGC CACTTACAACAAGAAATT CTAG CATTAGATA 
GCCAAACGT CCGAGTAT CAAATTAAAAGTAAC CAATTAGCTCGAATGACT 
GAAGTTATCAATAC CCTCGAACAG CAa CATACTG AATATGTCAG CCGTCT 
CTACGTTGCATGGGCaACaACACCACAGATGCGAAACTTGGTCAAAGTAT 
CGTCAGATATGCGTCAGAAACTTGGCATGTTACGTCGAAATACCATTCCA 
ACAATGAAACTCTCAAT CG CTCAGTTAGG CATGATGCAACAAT CTGT CAA 
ATCCGGTGT CACTG CTGATG CTATTGTCAACGCTAATAATGCAGCATTC C 
AGATG CTGGCTg AAACTAGTAAAGAAG CGATT CCGATGTT AGAGAAGACC 
GCACAAAGCC CCACTGTTT CTATTAAATCTGT CACTG CATTAGCTGAAAG 
CTTAGTGG CT CAAAATAATGGT ATT AT CG CTG CCATAGACAAAGGACGTA 
AGGAaCGTG CC CAATTGGAAT CTG CTGTT ATTAAATC GG CTC AAACAAT C 
AATCATTCTGTCAAAATTCGTGATaAAAAAATAGTTGAAGCCTCACTCAA 
CGAAGGTAAATCTAC C CAAG AAAAAGTTGATG AGT CT 

SEQ ID NO. 520 9 
STRAIN 116 9NT 

GCAGACAATGCTATGACTAAAACAGATAAAACAACAGAAAT^ 

CCAGACAACAAGCCAAACTGGGCAAATTGCCTTTTT^ 

CAGCACAAAAGTCTGCTATCTCTGAAAAAACACC^ 

TTTGTCGGTGAC CAAAATG CGCT C CTTGATTTTGG ACAAT C CG CAGTAG A 
AGGCGTTAATACCACTGTTAAT CATAT CTTGT CTG AG CAG AAAAAAATT C 
AAATTCCTI^AAGTTGATGATTTACTAAAAAATGCTAATCGCGAACTAAAT 
GGATTTATTGCCAAATATAAAGATGCTACTCCGGCAGAATTAGAGAAAAA 
ACCAAACTTGATCCAAAAATTATTCAAACAAAGCAAGACCT 
AATTTTATTTTGACT CACAAAACAT CGAG CAAAAAATGGATATGATGGCA 
GCAAATGTTGT CAAACAAGAAGATACTTTGG CAAGAAATATCGTCTCTG C 
TCAAATGCTCATTGAAGATAATACTAAATCTATTGAAAATTTGGT^ 
TTATTGCTTTTATTGAATCGAGTCAAGCCGAGGCTGCCAATCGTGCAAGC 
CACTTACAACAAGAAATT CTAGCATTAaA.TAGCCAAACGTCCa^GTAT 
AATTAAAAGTAACCAATTAGCTCXl^TGACTGAAGTT AT CAATAC CCT CG 
Aa CAG CAACATACTG AATATGTCAG C CGT CTCTACGTTG CATGGG CAACA 
aCACCACAGATC TOAAACTTGGTCAAAGT AT CGT CAG ATATG CGT CAAAA 
ACTTGG CATGTTACGT CGAAATAC CATTC CAACAATG AAACT CT CAAT CG 
CTCAGTTAGG CATGATG CAACAATCTCTCAAAT CCC4GTGT CACTG CTGAT 
G CTATTCTCAACG CTAATAATG CAG CATTC CAGATG CTGGCTGAAACTAG 
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TAAAGAAGCGATTCCGATGTTAGAGAAGACCGCACAAAGCCCCACTGTTT 
CTATT AAAT CTGT CACTG CATT AG CTGAAAG CTTAGTGG CTCAAAAT AAT 
GGTATTATCGCTGCCATAGACAAAGGACGTAAGGAACGTGCCCAATTAGA 
ATCTGCTGTTATTAAATCGGCTGAAACAATCAATGATTCTGTCAAAATT C 
GTGATAAAAAAATAGTTGAAGCCTTACTCAACGAAGGTaAATCTACCCAA 
GAAAAAGTTGATGAGTCT 

SEQ ID NO. 5210 
STRAIN JH9130013 

AGCGATACCTTTAATTTTGATATTGACCAAATTGCAGAC 
AATG CTAT CACTAAAACAG ATAAAACAACAGAAATTATTT CCAACCAGAC 
AACAAGCCAAACTGGGCAAATTGCCTTTTTTGAAAAACTAACAC CAGCAC 
AAAAGTCrGCTATCTCTGAAAAAACACCAGCTTTGGTAGATACTTTTGTC 
GGTGACCAAAATGCGCrCCTTGATTTTGGACAATCCGCAGTAGAAGGCGT 
TAATACCACTGTTAATCATAT CTTGTCTGAG CAG AAAAAAATTC AAATT C 
CT CAAGT TGATGATTTACTAAAAAATG CTAAT CG CGAACTAAATGGATTT 
ATTGCCAAATATAAAGATGCTACTCCGGCAGAATTAGAGAAAAAACCAAA 
CTTGATT CAAAAATTATT CAA^CAAAGCAAGAC CT CG CTACAGGAATTTT 
ATTTTGACTCA.CAAAACAT CGAG CAAAAAATG G AT ATGATGG CAG CG AAT 
GTTGT CAAACAAGAAGATACTTTGG CAAGAAATAT CGTCT CTGCTGAAAT 
GCTCATTGAAGATAATACTAAATCTATTGAAAATTTGGTTGGAG 
CTTTTATTG AATcGAGTCAAG CCGAGG CTG C CAATCGTG CAAGC CACTTA 
CAACAAGAAATT CT AG CATT AG ATAG C CAAACGT C CG AGTAT CAAAT tAA 
AAGTaACCAATTAGCrCGAATGACrGAAGTTATCAATACCCTCGAACAGC 
AACATACTGAATATGTCAGCCGTCTCTACGI^GCATGGGCAACAACACCA 
CAGATGCGAAACnTGGTCAAAGTATCGTC^GATATGCGTCAAAAACrrGG 
CATGTTACGTCGAAATACCATTCC^a^TGAAACTCTCAATCGCTCAGT 
TAGG CATGATG CAACAATCTGT CAAATCCGGTGT CACTG CTGATG CTATT 
GT CAACG CTAAT AATG CAG CATTG CAGATG CTGG CTGAAACTAGTAAAGA 
AGCGATTCCGATGTTAGAGAAGACCGCACAAAGCCCCACTGTTTCTATTA 
AATCTGT CACTG CATTAG CTGAAAG CTTAGTGGCTCAAAAT AATGGTATT 
AT CG CTG CCATAGACAAAGG aCGTAAGGAACGTGC C CAATTAGAATCTG C 
TGTT ATTAAAT CGG CTGAA^CAAT CAATGATTCTGTCAAAATTCGTGAT A 
AAAAAATAGTTG AAG CCTT ACT CAACGAAGGT aAAT CTACCCAAGAAAAA 
GTTGATGAGT CT 

SEQ ID NO. 5211 
STRAIN 2603 

agcgatacctttaattttgatattgaccaaattgcagacaatgctatcac 
taaaacagataaaacaacagaaattatttccaaccagacaacaagccaaa 
ctgggcaaattgccttttttgaaaaactaacaccagcacaaaagtctgct 
atctctgaaaaaacaccagctttggtagatacttttgtcggcgatcaaaa 
tgcgctccttgattttggacaatccgcagtagaag'gcgttaataccactg 
ttaatcatatcttgtctgagcagaaaaaaattcaaattcctcaagttgat 
gatttactaaaaaatgctaatcgcgaactaaatggatttattgccaaata 
taaagatgctactccggcagaattagagaaaaaaccaaacttgattcaaa 
aattattcaaacaaagcaagacctcgctacaggaattttattttgactca 
caaaacatcgagcaaaaaatggatatgatggcagcgaatgttgtcaaaca 
agaagat ac 1 1 tggcaagaaat at cgt ct c tgctgaaatgct cat tgaag 
ataatactaaatctattgaaaatttggttggagttattgcttttattgaa 
tcgagtcaagccgaggctgctaatcgtgcaagccacttacaacaagaaat 
tctagcattagatagccaaacgtccgagtatcaaattaaaagtaaccaat 
tagctcgaatgactgaagttatcaataccctcgaacagcaacatcctgaa 
tatgtcagccgtctctacgttgcatgggcaacaacaccacagatgcgaaa 
cttggtcaaagtatcgtcagatatgcgtcagaaacttggcatgttacgtc 
gaaataccattccaacaatgaaactctcaatcgctcagttaggcatgatg 
caacaatctgtcaaatccggtgtcactgctgatgctattgtcaacgctaa 
taatgcagcattgcagatgctggctgaaactagtaaagaagcgattccga 
tgttagagaagaccgcacaaagccccactgtttctattaaatctgtcact 
gcattagctgaaagcttagtggctcaaaataatggtattatcgctgccat 
agacaaaggacgtaaggaacgtgcccaattggaatctgctgttattaaat 
cggctgaaacaatcaatgattctgtcaaaattcgtgataaaaaaatagtt 
gaagccttactcaacgaaggtaaatctacccaagaaaaagttgatgagtc 
t 
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TTTTGA 

AGCGATACCT TTAATTTTGA 

. TTTTGA 

TTTTGA 

AGCGATACCT TTAATTTTGA 
AGCGATACCT TTAATTTTGA 
AGCGATACCT TTAATTTTGA 
AGCGATACCT TTAATTTTGA 



TATTGACCAA 
TATTGACCAA 
TATTGACCAA 
TATTGACCAA 
TATTGACCAA 
TATTGACCAA 
TATTGACCAA 
TATTGACCAA 



AGCGATACCT TTAATTTTGA TATTGACCAA 
********** ********** ********** 



ATTGCAGACA 

ATTGCAGACA 

ATTGCAGACA 

ATTGCAGACA 

ATTGCAGACA 

ATTGCAGACA 

ATTGCAGACA 

ATTGCAGACA 

GCAGACA 

ATTGCAGACA 
********** 



50 

C 

ATGCTATCAC 

ATGCTATCAC 

ATGCTATCAC 

ATGCTATCAC 

ATGCTATCAC 

ATGCTATCAC 

ATGCTATCAC 

ATGCTATCAC 

ATGCTATCAC 

ATGCTATCAC 
********** 
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msal3607 .2 
msal3607 .2 
msal3607 . 
msal3607 - 2 {2 
msal3607 .2 {2 
msal3607 .2 
msal3607 .2 
msal3607 .2 
msal3607.2{201_( 
msal3607.2{2 
msal3607 .2 



{201_COH1 
{201_M781 
2{201_090 
01_CJB110 
01_18RS21 
201J2603 
201_A909 
20l'_H36B 
JM9130013 
01_1169NT) 
{2 01_M732} 
Consensus 



51 

TAAAACAGAT 

TAAAACAGAT 

TAAAACAGAT 

TAAAACAGAT 

TAAAACAGAT 

TAAAACAGAT 

TAAAACAGAT 

TAAAACAGAT 

TAAAACAGAT 

TAAAACAGAT 

TAAAACAGAT 
********** 



AAAACAACAG 

AAAACAACAG 

AAAACAACAG 

AAAACAACAG 

AAAACAACAG 

AAAACAACAG 

AAAACAACAG 

AAAACAACAG 

AAAACAACAG 

AAAACAACAG 

AAAACAACAG 
********** 



AAATTATTTC 

AAATTATTTC 

AAATTATTTC 

AAATTATTTC 

AAATTATTTC 

AAATTATTTC 

AAATTATTTC 

AAATTATTTC 

AAATTATTTC 

AAATTATTTC 

AAATTATTTC 
********** 



CAACCAGACA 
CAACCAGACA 
CAACCAGACA 
CAACCAGACA 
CAACCAGACA 
CAACCAGACA 
CAACCAGACA 
CAACCAGACA 
CAACCAGACA 
CAACCAGACA 
CAACCAGACA 



100 

ACAaGCCAAA 
ACAaGCCAAA 
ACAaGCCAAA 
ACAaGCCAAA 
ACAaGCCAAA 
ACAaGCCAAA 
ACAaGCCAAA 
ACAaGCCAAA 
ACAaGCCAAA 
ACAaGCCAAA 
ACAaGCCAAA 



********** ********** 



msal3607 
msal3607 
msa'1360 
msal3607.2 
msal3607 -2 
msal3 607 
msal3 607 
msal3607 
msal3607 .2 {20 
msal3607 .2 
msal3 607 



.2{201_COH1} 
.2{201_M781} 
7 .2{201_090} 
{201_CJB110} 
{201_18RS21} 
.2{201_2603} 
.2{201_A909} 
.2{201_H36B} 
1_JM9130 013} 
{201_1169NT} 
,2{201_M732} 
Consensus 



msa!3607 .2{201_COH1} 
msal3607 -2{201_M78l} 
msal3607.2{201_090} 
msal3607.2{201_CJB110} 
msal3607.2{201_18RS2l) 
msal3607 .2{201_2603} 
msal3607 .2{201_A909} 
msal3 607 .2{201_H3 6B} 
msal3607 . 2 {201_JM9130013 } 
msal3607 . 2 {201J.169NT} 
msal3607.2{201_M732} 

Consensus 



101 

CTGGGCAAAT 

CTGGGCAAAT 

CTGGGCAAAT 

CTGGGCAAAT 

CTGGGCAAAT 

CTGGGCAAAT 

CTGGGCAAAT 

CTGGGCAAAT 

CTGGGCAAAT 

CTGGGCAAAT 

CTGGGCAAAT 
********** 

151 

wTCTCTGAAA 
aTCTCTGAAA 
aTCTCTGAAA 
aTCTCTGAAA 
aTCTCTGAAA 
aTCTCTGAAA 
aTCTCTGAAA 
aTCTCTGAAA 
aTCTCTGAAA 
aTCTCTGAAA 

aTCTCTGAAA 
_********* 



TGCCTTTTTT 

TGCCTTTTTT 

TGCCTTTTTT 

TGCCTTTTTT 

TGCCTTTTTT 

TGCCTTTTTT 

TGCCTTTTTT 

TGCCTTTTTT 

TGCCTTTTTT 

TGCCTTTTTT 

TGCCTTTTTT 
********** 



AAACACCAGC 

AAACACCAGC 

AAACACCAGC 

AAACACCAGC 

AAACACCAGC 

AAACACCAGC 

AAACACCAGC 

AAACACCAGC 

AAACACCAGC 

AAACACCAGC 

AAACACCAGC 
********** 



GAAAAACTAA 

GAAAAACTAA 

GAAAAACTAA 

GAAAAACTAA 

GAAAAACTAA 

GAAAAACTAA 

GAAAAACTAA 

GAAAAACTAA 

GAAAAACTAA 

GAAAAACTAA 

GAAAAACTAA 
********** 



TTTGGTAGAT 

TTTGGTAGAT 

TTTGGTAGAT 

TTTGGTAGAT 

TTTGGTAGAT 

TTTGGTAGAT 

TTTGGTAGAT 

TTTGGTAGAT 

TTTGGTAGAT 

TTTGGTAGAT 

TTTGGTAGAT 
********** 



CACCAGCACA 

CACCAGCACA 

CACCAGCACA 

CACCAGCACA 

CACCAGCACA 

CACCAGCACA 

CACCAGCACA 

CACCAGCACA 

CACCAGCACA 

CACCAGCACA 

CACCAGCACA 
********** 



ACTTTTGTCG 

ACTTTTGTCG 

ACTTTTGTCG 

ACTTTTGTCG 

ACTTTTGTCG 

ACTTTTGTCG 

ACTTTTGTCG 

ACTTTTGTCG 

ACTTTTGTCG 

ACTTTTGTCG 

ACTTTTGTCG 
********** 



150 

AAAGTCTGCT 
AAAGTCTGCT 
AAAGTCTGCT 
AAAGTCTGCT 
AAAGTCTGCT 
AAAGTCTGCT 
AAAGTCTGCT 
AAAGTCTGCT 
AAAGTCTGCT 
AAAGTCTGCT 

AAAGTCTGCT 
********** 

200 

GtGAcCAAAA 

GtGAcCAAAA 

GcGAtCAAAA 

GcGAtCAAAA 

GcGAtCAAAA 

GcGAtCAAAA 

GtGAcCAAAA 

GtGAcCAAAA 

GtGAcCAAAA 

GtGAcCAAAA 

GtGAcCAAAA 
*—**—***** 



msal3607.2{20 1_C0H1 } 
msal3607.2{201_M781; 
msal3607_2{201_090 
msal3607 .2 {201_CJB110 
msa!3607.2 {201_18RS21 
msal3607 .2{201_2603 
msal3607 -2{201_A909} 
msal3607.2{201_H36Bj 
msal3607 . 2 {201_JM9130013 } 
msal3607 . 2 {201__1169NT} 
tnsal3607 .2{201_M732} 

Consensus 



msal3607 
msal3607 
msal360 
msal3 607.2 
msal3 607 .2 
msal3607 
msal3607 
msal3607 
msal3607.2{20 
msal3607.2 
msal3607 



.2{201_COH1 
.2{201_M781 
7.2{201_090 
f201_CJB110 • 
{201_18RS21} 
.2{201_2603} 
.2{201_A909} 
.2{201_H36B} 
1__JM9130013} 
{201_1169NT} 
.2{201_M732} 
Consensus 



GATTTTGGAC 

GATTTTGGAC 

GATTTTGGAC 

GATTTTGGAC 

GATTTTGGAC 

GATTTTGGAC 

GATTTTGGAC 

GATTTTGGAC 

GATTTTGGAC 

GATTTTGGAC 

GATTTTGGAC 
********** 



CTTGTCTGAG 
CTTGTCTGAG 
CTTGTCTGAG 
CTTGTCTGAG 
CTTGTCTGAG 
CTTGTCTGAG 
CTTGTCTGAG 
CTTGTCTGAG 
CTTGTCTGAG 
CTTGTCTGAG 
CTTGTCTGAG 



AATCCGCAGT 

AATCCGCAGT 

AATCCGCAGT 

AATCCGCAGT 

AATCCGCAGT 

AATCCGCAGT 

AATCCGCAGT 

AATCCGCAGT 

AATCCGCAGT 

AATCCGCAGT 

AATCCGCAGT 
********** 



AGAAGGCGTT 
AGAAGGCGTT 
AGAAGGCGTT 
AGAAGGCGTT 
AGAAGGCGTT 
AGAAGGCGTT 
AGAAGGCGTT 
AGAAGGCGTT 
AGAAGGCGTT 
AGAAGGCGTT 

AGAAGGCGTT 
********** 



msal3607.2{201_COHl} 



250 

AATACtACTG 
AATACtACTG 
AATACcACTG 
AATACcACTG 
AATACcACTG 
AATACcACTG 
AATACcACTG 
AATACcACTG 
AATACcACTG 
AATACcACTG 

AATACtACTG 
*****_**** 

300 

TCAAGTTGAT 
TCAAGTTGAT 
TCAAGTTGAT 
TCAAGTTGAT 
TCAAGTTGAT 
TCAAGTTGAT 
TCAAGTTGAT 
TCAAGTTGAT 
TCAAGTTGAT 
TCAAGTTGAT 
TCAAGTTGAT 

********** ********** ********** ********** ********** 

301 350 
GATTTACTAA AAAATGCTAA TCGCGAACTA AATGGATTTA TTGCCAAATA 



201 

TGCGCT CCTT 
TGCGCTCCTT 
TGCGCTCCTT 
TGCGCTCCTT 
TGCGCTCCTT 
TGCGCTCCTT 
TGCGCTCCTT 
TGCGCTCCTT 
TGCGCTCCTT 
TGCGCTCCTT 

TGCGCTCCTT 
********** 

251 

TTAATCATAT 
TTAATCATAT 
TTAATCATAT 
TTAATCATAT 
TTAATCATAT 
TTAATCATAT 
TTAATCATAT 
TTAATCATAT 
TTAATCATAT 
TTAATCATAT 
TTAATCATAT 



CAGAAAAAAA 
CAGAAAAAAA 
CAGAAAAAAA 
CAGAAAAAAA 
CAGAAAAAAA 
CAGAAAAAAA 
CAGAAAAAAA 
CAGAAAAAAA 
CAGAAAAAAA 
CAGAAAAAAA 
CAGAAAAAAA 



TTCAAATTCC 
TTCAAATTCC 
TTCAAATTCC 
TTCAAATTCC 
TTCAAATTCC 
TTCAAATTCC 
TTCAAATTCC 
TTCAAATTCC 
TTCAAATTCC 
TTCAAATTCC 
TTCAAATTCC 



864 



WO 2004/018646 



PCT/US2003/026827 



Table 52: Comparative Sequences relating to SAG 1823 



msal3607 . 
msal3607 
msal3607.2{ 
msal3607.2{ 
msal3607 . 
msal3 607 . 
) msal3607. 
msal3607.2{2.01 
msal3607 -2{ 
msal3i507 . 



2{201_M781 
.2{201_090 
2 01_CJB110 
201_18RS21 
2{201_2603 
2{201_A909 
2{201_H3 6B 
_JM9130013 
201_1169NT} 
2{201_M732} 
Consensus 



msal3 607 
msal3607 
msal360 
msal3607 .2 
msal3607 .2 
msal3607 
msal3 607 
msal3607 
msal3607.2{20 
rasal3607 .2 
msal3607 



.2{201_COHl} 
.2{201_M78l} 
7.2{201_090} 
{201_CJB110} 
{201_18RS2l} 
.2{2 01_2603' 
.2{201_A909 
.2{201_H36B 
1_JM9130013 
{201_1169NT} 
.2{201_M732} 
Consensus 



msal3 6 0 7 . 2 { 2 0 l_COHl ; 
msal3607.2{201_M781; 
msal3607.2{201_090 
msal3607-2{201_CJB110 
msal3607.2{201_18RS21 
msal3607.2{201_2603; 
msal3607.2{201_A909. 
msal3607.2{201JK36B 
msal3607 . 2 {201_JM913 0013 } 
msal3607 . 2 {201_1169NT} 
msa!3607-2{201_M732} 

Consensus 



msal3607 .2 
msal3607.2 
msal3607. 
msa!3607.2{2 
msal3607.2{2 
msal3607.2 
msal3607 .2 
• msa!3 607 .2 
msa!3 607 .2{201_j 
msal3607.2{2 
msal3607.2 



{201_COHl} 
{201_M78l} 
2{201_090} 
01_CJB110} 
01_18RS2l} 
{201_2603} 
{201_A909} 
{201_H36B} 
JM9130013} 
01_1169NT} 
{201_M732} 
Consensus 



msal3607 .2 
msal3607.2 
msal3607 . 
msal3607.2{2 
msal3607.2{2 
msal3607 .2 
msal3607 .2 
msal3607.2 
msal3607.2{201_* 
msa!3607.2{2 
msal3607 .2 



{201__COH1) 
{201_M781} 
2{201_090j 
01_CJB110) 
01JL8RS21} 
{2 01_2603| 
{201_A909) 
{2 01_H36B} 
JM9130013} 
01_1169NT} 
{201_M732} 
Consensus 



msal3 6 07.2{20 1_C0H1 } 
msal3607.2{201_M78l} 
msal3607.2{201_090} 
msal3607 .2{201_CJB110 j 
msal3607 - 2 {201_18RS2l} 



AAAATGCTAA 

AAAATGCTAA 

AAAATGCTAA 

AAAATGCTAA 

AAAATGCTAA 

AAAATGCTAA 

AAAATGCTAA 

AAAATGCTAA 

AAAATGCTAA 

AAAATGCTAA 
********** 



GATTTACTAA 
GAT TTACTAA 
GATTTACTAA 
GATTTACTAA 
GATTTACTAA 
GATTTACTAA 
GATTTACTAA 
GATTTACTAA 
GATTTACTAA 
GATTTACTAA 
********** 

351 

TAAAGATGCT 
TAAAGATGCT 
TAAAGATGCT 
TAAAGATGCT 
TAAAGATGCT 
TAAAGATGCT 
TAAAGATGCT 
TAAAGATGCT 
TAAAGATGCT 
TAAAGATGCT 
TAAAGATGCT 
********** ********** 



ACTCCGGCAG 
ACTCCGGCAG 
ACTCCGGCAG 
ACTCCGGCAG 
ACTCCGGCAG 
ACTCCGGCAG 
ACTCCGGCAG 
ACTCCGGCAG 
ACTCCGGCAG 
ACTCCGGCAG 
ACTCCGGCAG 



TCGCGAACTA 

TCGCGAACTA 

TCGCGAACTA 

TCGCGAACTA 

TCGCGAACTA 

TCGCGAACTA 

TCGCGAACTA 

TCGCGAACTA 

TCGCGAACTA 

TCGCGAACTA 
********** 



AATTAGAGAA 

AATTAGAGAA 

AATTAGAGAA 

AATTAGAGAA 

AATTAGAGAA 

AATTAGAGAA 

AATTAGAGAA 

AATTAGAGAA 

AATTAGAGAA 

AATTAGAGAA 

AATTAGAGAA 
********** 



AATGGATTTA 
AATGGATTTA 
AATGGATTTA 
AATGGATTTA 
AATGGATTTA 
AATGGATTTA 
AATGGATTTA 
AATGGATTTA 
AATGGATTTA 
AATGGATTTA 



TTGCCAAATA 
TTGCCAAATA 
TTGCCAAATA 
TTGCCAAATA 
TTGCCAAATA 
TTGCCAAATA 
TTGCCAAATA 
TTGCCAAATA 
TTGCCAAATA 
TTGCCAAATA 



********** ********** 



401 

AATTATTCAA 
AATTATTCAA 
AATTATTCAA 
AATTATTCAA 
AATTATTCAA 
AATTATTCAA 
AATTATTCAA 
AATTATTCAA 
AATTATTCAA 
AATTATTCAA 
AATTATTCAA 



ACCTCgCTAC 
ACCTCgCTAC 
ACCTCgCTAC 
ACCTCgCTAC 
ACCTCgCTAC 
ACCTCgCTAC 
ACCTCgCTAC 
ACCTCgCTAC 
ACCTCgCTAC 
ACCTCaCTAC 
ACCTCgCTAC 
********** ********** *****-**** 



ACAAAGCAAG 
ACAAAGCAAG 
ACAAAGCAAG 
ACAAAGCAAG 
ACAAAGCAAG 
ACAAAGCAAG 
ACAAAGCAAG 
ACAAAGCAAG 
ACAAAGCAAG 
ACAAAGCAAG 
ACAAAGCAAG 



451 

CAAAACATCG 
CAAAACATCG 
CAAAACATCG 
CAAAACATCG 
CAAAACATCG 
CAAAACATCG 
CAAAACATCG 
CAAAACATCG 
CAAAACATCG 
CAAAACATCG 
CAAAACATCG 



AGCAAAAAAT 
AGCAAAAAAT 
AGCAAAAAAT 
AGCAAAAAAT 
AGCAAAAAAT 
AGCAAAAAAT 
AGCAAAAAAT 
AGCAAAAAAT 
AGCAAAAAAT 
AGCAAAAAAT 
AGCAAAAAAT 



********** ********** 



GGATATGATG 
GGATATGATG 
GGATATGATG 
GGATATGATG 
GGATATGATG 
GGATATGATG 
GGATATGATG 
GGATATGATG 
GGATATGATG 
GGATATGATG 

GGATATGATG 
********** 



400 

TTGATt CAAA 
TTGATtCAAA 
TTGATt CAAA 
TTGATtCAAA 
TTGATtCAAA 
TTGATtCAAA 
TTGATtCAAA 
TTGATtCAAA 
TTGATtCAAA 
TTGATc CAAA 
TTGATtCAAA 
*****_**** 

450 

TTTTGACTCA 

TTTTGACTCA 

TTTTGACTCA 

TTTTGACTCA 

TTTTGACTCA 

TTTTGACTCA 

TTTTGACTCA 

TTTTGACTCA 

TTTTGACTCA 

TTTTGACTCA 

TTTTGACTCA 
********** 

500 

TTGTCAAACA 
TTGTCAAACA 
TTGTCAAACA 
TTGTCAAACA 
TTGTCAAACA 
TTGTCAAACA 
TTGTCAAACA 
TTGTCAAACA 
TTGTCAAACA 
TTGTCAAACA 
TTGTCAAACA 
*****-**** ********** 



AAAACCAAAC 

AAAACCAAAC 

AAAACCAAAC 

AAAACCAAAC 

AAAACCAAAC 

AAAACCAAAC 

AAAACCAAAC 

AAAACCAAAC 

AAAACCAAAC 

AAAACCAAAC 

AAAACCAAAC 
********** 



AGGAATTTTA 

AGGAATTTTA 

AGGAATTTTA 

AGGAATTTTA 

AGGAATTTTA 

AGGAATTTTA 

AGGAATTTTA 

AGGAATTTTA 

AGGAATTTTA 

AGGAATTTTA 

AGGAATTTTA 
********** 



GCAGCaAATG 
GCAGCaAATG 
GCAGCgAATG 
GCAGCgAATG 
GCAGCgAATG 
GCAGCgAATG 
GCAGCgAATG 
GCAGCgAATG 
GCAGCgAATG 
GCAGCaAATG 
GCAGCaAATG 



501 

AGAAGATACT 
AGAAGATACT 
AGAAGATACT 
AGAAGATACT 
AGAAGATACT 
AGAAGATACT 
AGAAGATACT 
AGAAGATACT 
AGAAGATACT 
AGAAGATACT 
AGAAGATACT 



TTGGCAAGAA 
TTGGCAAGAA 
TTGGCAAGAA 
TTGGCAAGAA 
TTGGCAAGAA 
TTGGCAAGAA 
TTGGCAAGAA 
TTGGCAAGAA 
TTGGCAAGAA 
TTGGCAAGAA 
TTGGCAAGAA 



********** ********** 



551 

ATAATACTAA 
ATAATACTAA 
ATAATACTAA 
ATAATACTAA 
ATAATACTAA 



AT CTATTGAA 
AT CTATTGAA 
ATCTATTGAA 
AT CTATTGAA 
ATCTATTGAA 



ATATCGTCTC 
ATATCGTCTC 
ATATCGTCTC 
ATATCGTCTC 
ATATCGTCTC 
ATATCGTCTC 
ATATCGTCTC 
ATATCGTCTC 
ATATCGTCTC 
ATATCGTCTC 

ATATCGTCTC 
********** 



AATTTGGTTG 
AATTTGGTTG 
AATTTGGTTG 
AATTTGGTTG 
AATTTGGTTG 



TGCTGAAATG 
TGCTGAAATG 
TGCTGAAATG 
TGCTGAAATG 
TGCTGAAATG 
TGCTGAAATG 
TGCTGAAATG 
TGCTGAAATG 
TGCTGAAATG 
TGCTGAAATG 
TGCTGAAATG 



550 

CTCATTGAAG 
CTCATTGAAG 
CTCATTGAAG 
CTCATTGAAG 
CTCATTGAAG 
CTCATTGAAG 
CTCATTGAAG 
CTCATTGAAG 
CTCATTGAAG 
CTCATTGAAG 
CTCATTGAAG 



********** ********** 



GAGTTAt TGC 
GAGTTAt TGC 
GAGTTAt TGC 
GAGTTAt TGC 
GAGTTAt TGC 



600 

TTTTATTGAA 
TTTTATTGAA 
TTTTATTGAA 
TTTTATTGAA 
TTTTATTGAA 



865 



WO 2004/018646 



PCT/US2003/026827 



Table 52: Comparative Sequences relating to SAG 1823 



msal3607 -2{201_2603} 
msal3607 .2{2 01_A909} 
ms al 3 6 0 7 . 2 { 2 0 1_H3 6B } 
msal3607 . 2 {201_JM913 0013 } 
msal3607 . 2 {201__1169NT} 
msal3607 . 2 {201_M732} 

Consensus 



msa!3 607.2{20 l_COHl } 
msal3607.2{201_M78l} 
msal3607.2{201_090} 
msal3607 .2 {201_CJB110} 
msal3607.2{201_18RS2l} 
msal3607.2{201_2603} 
rnsal3607 . 2 {201_A909} 
msa!3607 .2{201_H36B> 
msal3607.2{201_JM913 0013} 
msal3607 .2{201_1169NT} 
msal3607.2{201_M732} 

Consensus 



msal3607 .2 
msal3 607 .2 
msal3 607 . 
msal3607.2 {2 
msal3607.2{2 
msal3607.2 
msal3607 .2 
msal3607 .2 
msal3607 .2{201_i 
msal3607 .2(2 
msa!3607.2 



{201_COH1} 
{201__M78l} 
2{201_090} 
01_CJB110} 
01_JL8RS2l} 
{201_2603} 
{201_A909} 
{2 01_H3 6B} 
JM9130013} 
01_1169NT} 
{201_M732} 
Consensus 



msal3607.2{201_COHl} 
msal3607 . 2 {2 01JM781} 
msal3607 .2{201_090} 
msal3607 . 2 {201_CJB110} 
msal3607 . 2 {201_18RS2l} 
msal3607.2{201_2603} 
msal3607.2{201__A909} 
msal3607.2{201__H36B} 
msal3607.2{201_JM9130013} 
msal3607 . 2 {201__1169NT} 
' msal3607.2{201_M732} 

Consensus 



msal3607.2{201_COHl 
msal3607.2{201_M781 
msal3607.2{201_090 
msal3607 .2 {201_CJB110 
msal3607 . 2 {201_18RS2l} 
msal3607 .2{201_2603} 
msal3607.2{201_A909} 
msa!3607.2{201_H36B} 
rasal3607.2{201__JM913 0013} 
msal3607.2{201_1169NT} 
Tnsal3607-2{201_M732} 

Consensus 



msal3607 . 

msal3607 . 
msal3607 
msal3607 .2{ 
msa!3607 ,2{ 

msal3607 . 

rasal3607 . 



2{201_COH1} 
2{201_M781} 
.2{201__090} 
201_CJB110} 
201_18RS2l} 
2{201 > _2603} 
2{201__A909} 



ATAATACTAA 

ATAATACTAA 

ATAATACTAA 

ATAATACTAA 

ATAATACTAA 

ATAATACTAA 
********** 

601 

TCGAGTCAAG 

TCGAGT CAAG 

TCGAGTCAAG 

TCGAGTCAAG 

TCGAGTCAAG 

TCGAGTCAAG 

TCGAGTCAAG 

TCGAGTCAAG 

TCGAGTCAAG 

TCGAGTCAAG 

TCGAGTCAAG 
********** 

651 

TCTAGCATTA 

TCTAGCATTA 

TCTAGCATTA 

TCTAGCATTA 

TCTAGCATTA 

TCTAGCATTA 

TCTAGCATTA 

TCTAGCATTA 

TCTAGCATTA 

TCTAGCATTA 

TCTAGCATTA 
********** 

701 

TAGCcCGAAT 

TAGCcCGAAT 

TAGCt CGAAT 

TAGCtCGAAT 

TAGCt CGAAT 

TAGCt CGAAT 

TAGCtCGAAT 

TAGCtCGAAT 

TAGCtCGAAT 

TAGCtCGAAT 

TAGCcCGAAT 
****_***** 

751 

TATGTCAGCC 

TATGTCAGCC 

TATGTCAGCC 

TATGTCAGCC 

TATGTCAGCC 

TATGTCAGCC 

TATGTCAGCC 

TATGTCAGCC 

TATGTCAGCC 

TATGTCAGCC 

TATGTCAGCC 
********** 

801 

CTTGGTCAAA 
CTTGGTCAAA 
CTTGGTPAAA 
CTTGGTCAAA 
CTTGGTCAAA 
CTTGGTCAAA 
CTTGGTCAAA 



ATCTATTGAA 

ATCTATTGAA 

ATCTATTGAA 

ATCTATTGAA 

ATCTATTGAA 

ATCTATTGAA 
********** 



CCGAGGCTGC 

CCGAGGCTGC 

CCGAGGCTGC 

CCGAGGCTGC 

CCGAGGCTGC 

CCGAGGCTGC 

CCGAGGCTGC 

CCGAGGCTGC 

CCGAGGCTGC 

CCGAGGCTGC 

CCGAGGCTGC 
********** 



AATTTGGTTG 

AATTTGGTTG 

AATTTGGTTG 

AATTTGGTTG 

AATTTGGTTG 

AATTTGGTTG 
********** 



cAATCGTGCA 

cAATCGTGCA 

tAATCGTGCA 

tAATCGTGCA 

tAATCGTGCA 

tAATCGTGCA 

CAATCGTGCA 

CAATCGTGCA 

cAATCGTGCA 

cAATCGTGCA 

cAATCGTGCA 
_********* 



GATAGCCAAA 

GATAGC CAAA 

GATAGCCAAA 

GATAGCCAAA 

GATAGCCAAA 

GATAGCCAAA 

GATAGCCAAA 

GATAGCCAAA 

GATAGCCAAA 

GATAGCCAAA 

GATAGCCAAA 
********** 



GACTGAAGTT 

GACTGAAGTT 

GACTGAAGTT 

GACTGAAGTT 

GACTGAAGTT 

GACTGAAGTT 

GACTGAAGTT 

GACTGAAGTT 

GACTGAAGTT 

GACTGAAGTT 

GACTGAAGTT 
********** 



GTCTCTACGT 

GTCTCTACGT 

GTCTCTACGT 

GTCTCTACGT 

GTCTCTACGT 

GTCTCTACGT 

GTCTCTACGT 

GTCTCTACGT 

GTCTCTACGT 

GTCTCTACGT 

GTCTCTACGT 
********** 



GTATCGTCAG 
GTATCGTCAG 
GTATCGTCAG 
GTATCGTCAG 
GTATCGTCAG 
GTATCGTCAG 
GTATCGTCAG 



CGTCCGAaTA 

CGTCCGAaTA 

CGTCCGAgTA 

CGTCCGAgTA 

CGTCCGAgTA 

CGTCCGAgTA 

CGTCCGAgTA 

CGTCCGAgTA 

CGTCCGAgTA 

CGTCCGAgTA 

CGTCCGAaTA 
*******_** 



GAGTTAt TGC 

GAGTTAwTGC 

GAGTTAt TGC 

GAGTTAt TGC 

GAGTTAt TGC 

GAGTTAt TGC 
******_*** 



AGCCACTTAC 
AGC CACTTAC 
AGCCACTTAC 
AGCCACTTAC 
AGCCACTTAC 
AGCCACTTAC 
AGCCACTTAC 
AGCCACTTAC 
AGCCACTTAC 
AGCCACTTAC 
AGCCACTTAC 

i 

********** 



TCAAATTAAA 

TCAAATTAAA 

TCAAATTAAA 

TCAAATTAAA 

TCAAATTAAA 

TCAAATTAAA 

TCAAATTAAA 

TCAAATTAAA 

TCAAATTAAA 

TCAAATTAAA 

TCAAATTAAA 
********** 



ATCAATAC CC 

ATCAATACCC 

ATCAATAC CC 

ATCAATACCC 

ATCAATACCC 

ATCAATACCC 

ATCAATACCC 

ATCAATACCC 

ATCAATACCC 

ATCAATACCC 

ATCAATACCC 
********** 



TGCATGGGCA 

TGCATGGGCA 

TGCATGGGCA 

TGCATGGGCA 

TGCATGGGCA 

TGCATGGGCA 

TGCATGGGCA 

TGCATGGGCA 

TGCATGGGCA 

TGCATGGGCA 

TGCATGGGCA 
********** 



TCGAACAGCA 

TCGAACAGCA 

TCGAACAGCA 

TCGAACAGCA 

TCGAACAGCA 

TCGAACAGCA 

TCGAACAGCA 

TCGAACAGCA 

TCGAACAGCA 

TCGAACAGCA 

TCGAACAGCA 
********** 



ACAACACCAC 

ACAACACCAC 

ACAACACCAC 

ACAACACCAC 

ACAACACCAC 

ACAACACCAC 

ACAACACCAC 

ACAACACCAC 

ACAACACCAC 

ACAACACCAC 

ACAACACCAC 
********** 



ATATGCGTCA 
ATATGCGTCA 
ATATGCGTCA 
ATATGCGTCA 
ATATGCGTCA 
ATATGCGTCA 
ATATGCGTCA 



gAAACTTGGt 
gAAACTTGGt 
gAAACTTGGc 
gAAACTTGGc 
gAAACTTGGc 
gAAACTTGGc 
aAAACTTGGc 



TTTTATTGAA 

TTTTATTGAA 

TTTTATTGAA 

TTTTATTGAA 

TTTTATTGAA 

TTTTATTGAA 
********** 

650 

AACAAGAAAT 

AACAAGAAAT 

AACAAGAAAT 

AACAAGAAAT 

AACAAGAAAT 

AACAAGAAAT 

AACAAGAAAT 

AACAAGAAAT 

AACAAGAAAT 

AACAAGAAAT 

AACAAGAAAT 
********** 

700 

AGTAACCAAT 

AGTAACCAAT 

AGTAACCAAT 

AGTAACCAAT 

AGTAACCAAT 

AGTAACCAAT 

AGTAACCAAT 

AGTAACCAAT 

AGTAACCAAT 

AGTAACCAAT 

AGTAACCAAT 
********** 

750 

ACATaCgGAA 

ACATaCgGAA 

ACATaCtGAA 

ACATaCtGAA 

ACATcCtGAA 

ACATcCtGAA 

ACATaCtGAA 

ACATaCtGAA 

ACATaCtGAA 

ACATaCtGAA 

ACATaCgGAA 
****-*-*** 

800 

AGATGCGAAA 

AGATGCGAAA 

AGATGCGAAA 

AGATGCGAAA 

AGATGCGAAA 

AGATGCGAAA 

AGATGCGAAA 

AGATGCGAAA 

AGATGCGAAA 

AGATGCGAAA 

AGATGCGAAA 
********** 

850 

ATGTTACGTC 
ATGTTACGTC 
ATGTTACGTC 
ATGTTACGTC 
ATGTTACGTC 
ATGTTACGTC 
ATGTTACGTC 



866 



WO 2004/018646 



PCT/US2003/026827 



Table 52: Comparative Sequences relating to SAG 1823 



msal3607 -2{201_H3 6B} 
msal3607.2{201_JM913 0013} 
msal3607 . 2 {201JL169NT} 
msal3 607 . 2 { 2 01J4732 } 

Consensus 



msal3607.2{201_COHl} 
msal3607.2{2 01__M78l} 
msal3607 .2{201_090; 
msal3607.2{201_CJB110; 
msal3 60 7 . 2 { 2 0 1_18RS2 1 • 
msal3607-2{201_2603 
msal3607.2{201_A909; 
msal3607 .2{201_H36B; 
msal3607 .2{201_JM9130013; 
msal3607-2{201_1169NT} 
msa!3607.2{201_M732} 

Consensus 



CTTGGTCAAA 
CTTGGTCAAA 
CTTGGTCAAA 
CTTGGTCAAA 



GTATCGTCAG 
GTATCGTCAG 
GTATCGTCAG 
GTATCGTCAG 



********** ********** 



msal3607 .2 
tnsal3607 .2 
msal3607 . 
msal3607 .2 {2 
msal3607 .2{2 
nisal3607 .2 
msal3607 .2 
msal3607 .2 
msal3607.2{201_ 
msal3607.2{2 
msal3607 .2 



{201_COH1 
{201_M781 
2{201_090 
01_CJB110} 
01_18RS2l} 
{201^2603} 
{201_A909} 
{201_H3 6B} 
JM9130013} 
01_1169NT} 
{201JVI732} 
Consensus 



msal3607.2{201_COHl} 
msal3607.2{201JYI78l} 
msal3607.2{201_090} 
msal3607 .2 {201_CJB110} 
msa!3607 .2{201__18RS2l} 
msal3607 .2(201_2603} 
rasal3607 .2{201_A909} 
tnsal3607 .2{201__H36B} 
msal3 607 . 2 {201_JM913 0013 } 
msal3607 .2 {201_11£9NT} 
msal3607.2{201_M732} 

Consensus 



msal3607 
msal3607 
rasal360 
msal3607 .2 
msal3607.2 
msal3607 
msal3607 
msal3607 
msal3607 .2{20 
msal3 607.2 
msal3607 



.2{2 01_COHl} 
.2{201_M78l) 
7.2{201_090) 
{201_CJB110; 
{201JL8RS21 
.2{201_2603 
,2(201_A909 
,2{201_H36B 
1_JM9130013 
{201_1169NT • 
,2{201_M732; 
Consensus 



msal3607 
msal3607 
msal360 
msal3607 .2 
msal3607.2 
msal3607 
msal3607 
msal3607 
msal3607 .2{20 
msal3607 .2 



{2 01_COH1} 
{2 01_* 



.2{201_M781} 
7.2{201_090} 
{201_CJB110} 
{201_18RS21* 
.2{201_2603 
.2{201_A909 
.2{201JH36B} 
1_JM9130013} 
{201_1169NT} 



851 

GAAATA C CAT 

GAAATACCAT 

GAAATA C CAT 

GAAATACCAT 

GAAATACCAT 

'GAAATACCAT 

GAAATACCAT 

GAAATACCAT 

GAAATACCAT 

GAAATACCAT 

GAAATACCAT 
********** 

901 

CAACAATCTG 
CAACAATCTG 
CAACAATCTG 
CAACAATCTG 
CAACAATCTG 
CAACAATCTG 
CAACAATCTG 
CAACAATCTG 
CAACAATCTG 
CAACAATCTG 
CAACAATCTG 



TCCAACAATG 

TCCAACAATG 

TCCAACAATG 

TCCAACAATG 

TCCAACAATG 

TCCAACAATG 

TCCAACAATG 

TCCAACAATG 

TCCAACAATG 

TCCAACAATG 

TCCAACAATG 
********** 



TCAAATCCGG 
TCAAATCCGG 
TCAAATCCGG 
TCAAATCCGG 
TCAAATCCGG 
TCAAATCCGG 
TCAAATCCGG 
TCAAATCCGG 
TCAAATCCGG 
TCAAATCCGG 
TCAAATCCGG 



ATATGCGTCA 

ATATGCGTCA 

ATATGCGTCA 

ATATGCGTCA 
********** 



AAACTCTCAA 

AAACTCTCAA 

AAACTCTCAA 

AAACTCTCAA 

AAACTCTCAA 

AAACTCTCAA 

AAACTCTCAA 

AAACTCTCAA 

AAACTCTCAA 

AAACTCTCAA 

AAACTCTCAA 
********** 



TGTCACTGCT 
TGTCACTGCT 
TGTCACTGCT 
TGTCACTGCT 
TGTCACTGCT 
TGTCACTGCT 
TGTCACTGCT 
TGTCACTGCT 
TGTCACTGCT 
TGTCACTGCT 
TGTCACTGCT 



aAAACTTGGc 

aAAACTTGGc 

aAAACTTGGc 

gAAACTTGGt 
_********_ 



TCGCTCAGTT 

TCGCTCAGTT 

TCGCTCAGTT 

TCGCTCAGTT 

TCGCTCAGTT 

TCGCTCAGTT 

TCGCTCAGTT 

TCGCTCAGTT 

TCGCTCAGTT 

TCGCTCAGTT 

TCGCTCAGTT 
********** 



GATGCTATTG 
GATGCTATTG 
GATGCTATTG 
GATGCTATTG 
GATGCTATTG 
GATGCTATTG 
GATGCTATTG 
GATGCTATTG 
GATGCTATTG 
GATGCTATTG 

* 

GATGCTATTG 



********** ********** ********** ********** 



951 

TAATGCAGCA 
TAATGCAGCA 
TAATGCAGCA 
TAATGCAGCA 
TAATGCAGCA 
TAATGCAGCA 
TAATGCAGCA 
TAATGCAGCA 
TAATGCAGCA 
TAATGCAGCA 
TAATGCAGCA 



TTGCAaATGC 
TTGCAaATGC 
TTGCAgATGC 
TTGCAgATGC 
TTGCAgATGC 
TTGCAgATGC 
TTGCAgATGC 
TTGCAgATGC 
TTGCAgATGC 
TTGCAgATGC 
TTGCAaATGC 



TGGCTGAAAC 
TGGCTGAAAC 
TGGCTGAAAC 
TGGCTGAAAC 
TGGCTGAAAC 
TGGCTGAAAC 
TGGCTGAAAC 
TGGCTGAAAC 
TGGCTGAAAC 
TGGCTGAAAC 
TGGCTGAAAC 



********** *****_**** ********** 



1001 

TGTTAGAGAA 
TGTTAGAGAA 
TGTTAGAGAA 
TGTTAGAGAA 
TGTTAGAGAA 
TGTTAGAGAA 
TGTTAGAGAA 
TGTTAGAGAA 
TGTTAGAGAA 
TGTTAGAGAA 
TGTTAGAGAA 
********** 

1051 

GCATTAgCTG 
GCATTAgCTG 
GCATTAgCTG 
GCATTAgCTG 
GCATTAgCTG 
GCATTAgCTG 
GCATTAgCTG 
GGATTAtCTG 
GCATTAgCTG 
GCATTAgCTG 



GACCGCACAA 
GACCGCACAA 
GACCGCACAA 
GACCGCACAA 
GACCGCACAA 
GACCGCACAA 
GACCGCACAA 
GACCGCACAA 
GACCGCACAA 
GACCGCACAA 
GACCGCACAA 
********** 



AAAGCTTAGT 
AAAGCTTAGT 
AAAGCTTAGT 
AAAGCTTAGT 
AAAGCTTAGT 
AAAGCTTAGT 
AAAGCTTAGT 
AAAGCTTAGT 
AAAGCTTAGT 
AAAGCTTAGT 



AGCCCCACTG 

AGCCCCACTG 

AGCCCCACTG 

AGCCCCACTG 

AGCCCCACTG 

AGCCCCACTG 

AGCCCCACTG 

AGCCCCACTG 

AGCCCCACTG 

AGCCCCACTG 

AGCCCCACTG 
********** 



GGCTCAAAAT 
GGCT CAAAAT 
GGCTCAAAAT 
GGCTCAAAAT 
GGCTCAAAAT 
GGCTCAAAAT 
GGCTCAAAAT 
GGCTCAAAAT 
GGCTCAAAAT 
GGCTCAAAAT 



TAGTAAAGAA 

TAGTAAAGAA 

TAGTAAAGAA 

TAGTAAAGAA 

TAGTAAAGAA 

TAGTAAAGAA 

TAGTAAAGAA 

TAGTAAAGAA 

TAGTAAAGAA 

TAGTAAAGAA 

TAGTAAAGAA 
********** 



TTTCTATTAA 

TTTCTATTAA 

TTTCTATTAA 

TTTCTATTAA 

TTTCTATTAA 

TTTCTATTAA 

TTTCTATTAA 

TTTCTATTAA 

TTTCTATTAA 

TTTCTATTAA 

TTTCTATTAA 
********** 



AATGGTATTA 
AATGGTATTA 
AATGGTATTA 
AATGGTATTA 
AATGGTATTA 
AATGGTATTA 
AATGGTATTA 
AATGGTATTA 
AATGGTATTA 
AATGGTATTA 



ATGTTACGTC 

ATGTTACGTC 

ATGTTACGTC 

ATGTTACGTC 
********** 

900 

AGGCATGATG 

AGGCATGATG 

AGGCATGATG 

AGGCATGATG 

AGGCATGATG 

AGGCATGATG 

AGGCATGATG 

AGGCATGATG 

AGGCATGATG 

AGGCATGATG 

AGGCATGATG 
********** 

950 

TCAACGCTAA 

TCAACGCTAA 

TCAACGCTAA 

TCAACGCTAA 

TCAACGCTAA 

TCAACGCTAA 

TCAACGCTAA 

TCAACGCTAA 

TCAACGCTAA 

TCAACGCTAA 

TCAACGCTAA 
********** 

1000 

GCGATTCCGA 

GCGATTCCGA 

GCGATTCCGA 

GCGATTCCGA 

GCGATTCCGA 

GCGATTCCGA 

GCGATTCCGA 

GCGATTCCGA 

GCGATTCCGA 

GCGATTCCGA 

GCGATTCCGA 
********** 

1050 

ATCTGTCACT 

AT CTGTCACT 

ATCTGTCACT 

ATCTGTCACT 

ATCTGTCACT 

ATCTGTCACT 

ATCTGTCACT 

ATCTGTCACT 

ATCTGTCACT 

ATCTGTCACT 

ATCTGTCACT 
********** 

1100 
TCGCTGCCAT 
TCGCTGCCAT 
TCGCTGCCAT 
TCGCTGCCAT 
TCGCTGCCAT 
TCGCTGCCAT 
TCGCTGCCAT 
TCGCTGCCAT 
TCGCTGCCAT 
TCGCTGCCAT 
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Table 52: Comparative Sequences relating to SAG 1823 



msal3 607.2{201_M732} 

Consensus 



GCATTAgCTG AAAGCTTAGT GGCTCAAAAT AATGGTATTA TCGCTGCCAT 
******_*** ********** ********** ********** ********** 



msa!3607 .2 
msal3607 .2 
msal3 607 . 
msa!3607 .2 {2 
msal3 607.2{2 
msal3607 .2 
msal3607.2 
msal3607 .2 
msal3607 .2 {201_i 
msal3607 .2 {2 
msal3607 .2 



{201_COHl} 
{201_M78l} 
2{201_090} 
01_CJB110} 
01_18RS21 
{201_2603 
{201_A909; 
{201_H36B; 
JM9130013 ; 
01_1169NT; 
{201_M732; 
Consensus 



msal3607.2{ 2 01_COH1 } 
msal3607.2{201_M78l} 
msal3 607. 2 {201^090} 
msal3607 . 2 {201_CJB110 } 
msal3607 .2{201_18RS21} 
msal3607.2{201_2603} 
msa!3607.2{201_A909} 
msal3607.2{201_H36B} 
msal3607 . 2 {201_JM9130013 ) 
msal3607 . 2 {201_1169NT} 
msa!3607.2{201_M732} 

Consensus 



msal3 6 0 7 . 2 { 2 0 1_C0H1 } 
msal3 607.2{201_M78l} 
msal3 607 .2{201_090} 
msal3607 . 2 {201_CJB110 } 
msal3607.2{201_18RS2l} 
msal3607 . 2 {201_2603 } 
msal3 607.2f201_A909} 
msal3 6 07 . 2 {'2 0 1_H3 6B } 
msal3607 . 2 {201_JM913 0013 } 
msal3607.2{201_1169NT}' 
msa!3 607.2{201_M732} 

Consensus 



1101 

AGACAAAGGA 

AGACAAAGGA 

AGACAAAGGA 

AGACAAAGGA 

AGACAAAGGA 

AGACAAAGGA 

AGACAAAGGA 

AGACAAAGGA 

AGACAAAGGA 

AGACAAAGGA 

AGACAAAGGA 
********** 

1151 

CGGCTGAAAC 

CGGCTGAAAC 

CGGCTGAAAC 

CGGCTGAAAC 

CGGCTGAAAC 

CGGCTGAAAC 

CGGCTGAAAC 

CGGCTGAAAC 

CGGCTGAAAC 

CGGCTGAAAC 

CGGCTGAAAC 
********** 

1201 

GAAGCCTTAC 

GAAGCCTTAC 

GAAGCCTTAC 

GAAGCCTTAC 

GAAGCCTTAC 

GAAGCCTTAC 

GAAGCCTTAC 

GAAGCCTTAC 

GAAGCCTTAC 

GAAGCCTTAC 

GAAGCCTTAC 
********** 



CGTAAgGAAC 

CGTAAgGAAC 

CGTAAgGAAC 

CGTAAgGAAC 

CGTAAgGAAC 

CGTAAgGAAC 

CGTAAaGAAC 

CGTAAaGAAC 

CGTAAgGAAC 

CGTAAgGAAC 

CGTAAgGAAC 
*****-**** 



AATCAATGAT 

AATCAATGAT 

AATCAATGAT 

AATCAATGAT 

AATCAATGAT 

AATCAATGAT 

AATCAATGAT 

AATCAATGAT 

AATCAATGAT 

AATCAATGAT 

AATCAATGAT 
********** 



TCAACGAAGG 

TCAACGAAGG 

TCAACGAAGG 

TCAACGAAGG 

TCAACGAAGG 

TCAACGAAGG 

TCAACGAAGG 

TCAACGAAGG 

TCAACGAAGG 

TCAACGAAGG 

TCAACGAAGG 
********** 



GTGCCCAATT 

GTGCCCAATT 

GTGCCCAATT 

GTGCCCAATT 

GTGCCCAATT 

GTGCCCAATT 

GTGCCCAATT 

GTGCCCAATT 

GTGCCCAATT 

GTGCCCAATT 

GTGCCCAATT 
********** 



TCTGTCAAAA 

TCTGTCAAAA 

TCTGTCAAAA 

TCTGTCAAAA 

TCTGTCAAAA 

TCTGTCAAAA 

TCTGTCAAAA 

TCTGTCAAAA 

TCTGTCAAAA 

TCTGTCAAAA 

TCTGTCAAAA 
********** 



TAAATCTACC 

TAAATCTACC 

TAAATCTACC 

TAAATCTACC 

TAAATCTACC 

TAAATCTACC 

TAAATCTACC 

TAAATCTACC 

TAAATCTACC 

TAAATCTACC 

TAAATCTACC 
********** 



aGAATCTGCT 

aGAATCTGCT 

gGAATCTGCT 

gGAATCTGCT 

gGAATCTGCT 

gGAATCTGCT 

aGAATCTGCT 

aGAATCTGCT 

aGAATCTGCT 

aGAATCTGCT 

aGAATCTGCT 
—********* 



TTCGTGATAA 

TTCGTGATAA 

TTCGTGATAA 

TTCGTGATAA 

TTCGTGATAA 

TTCGTGATAA 

TTCGTGATAA 

TTCGTGATAA 

TTCGTGATAA 

TTCGTGATAA 

TTCGTGATAA 
********** 



CAAGAAAAAG 

CAAGAAAAAG 

CAAGAAAAAG 

CAAGAAAAAG 

CAAGAAAAAG 

CAAGAAAAAG 

CAAGAAAAAG 

CAAGAAAAAG 

CAAGAAAAAG 

CAAGAAAAAG 

CAAGAAAAAG 
********** 



1150 

GTTATTAAAT 

GTTATTAAAT 

GTTATTAAAT 

GTTATTAAAT 

GTTATTAAAT 

GTTATTAAAT 

GTTATTAAAT 

GTTATTAAAT 

GTTATTAAAT 

GTTATTAAAT 

GTTATTAAAT 
********** 

1200 

AAAAATAGTT 

AAAAATAGTT 

AAAAATAGTT 

AAAAATAGTT 

AAAAATAGTT 

AAAAATAGTT 

AAAAATAGTT 

AAAAATAGTT 

AAAAATAGTT 

AAAAATAGTT 

AAAAATAGTT 
********** 

1250 
ttgatgagtc 
ttgatgagtc 
ttgatgagtc 
ttgatgagtc 
ttgatgagtc 
ttgatgagtc 
ttgatgagtc 
ttgatgagtc 
ttgatgagtc 
ttgatgagtc 



msal 3 6 0 7 . 2 { 2 0 1_C0H1 } 
msal3607 .2{201_M78l} 
msal3607 .2{201_090} 
msal3607 . 2 {201_CJB110 } 
msal3 6 0 7 . 2 { 2 0 1_1 8RS2 1 } 
msal3607.2{201_2603} 
msal3607.2{201_A909} 
msal3607.2{201_H36B} 
msal3607.2{201_JM9130013} 
msal3607.2{201_1169NT} 
msal3607.2{201JM732} 

Consensus 



1251 

t 

t 

t 

t 

t 

t 

t 

t 

t 

t 



SEQ ID NO. 5212 

STRAIN _090 frame: 1 

SDTFNFDIDQIADNAITKTDKTTEI ISNQTTSQTGQI AFFEKLTPAQKSAI SEKTPALVD 
TFVGDQNALLDFGQS AVEGVNITVNHI LS EQKKI Q I PQVDDLLKNANRELNGFIAKYKDA 
TPAELEKKPNIjI QKLFKQSKTSLQEFYFDSQNI EQKMDMMAANVVKQEDTLARN I VSAEM 
L I EDNTKSI ENLVGVI AFI ESSQAEAANRASHLQQE I LALDSQTSEYQI KSNQLARMTEV 
INTLEQOOTEYVSRLWAWATTPQMRNLVKVSSDMRQKLGMLRROT 

QQSVKSGVTADAIVNANNAALQMLAETSKEAI PMLEKTAQS PTVS I KS VT ALAE S LVAQN 
NG 1 1 AAI DKGRKERAQLESAVI KSAETINDSVKI RDKKI VEALLNEGKSTQEKVDES 

SEQ ID NO. 52013 

STRAIN A909 frame: 1 

SDTFNFDIDQIADNAITKTDKTTEI ISNQTTSQTGQI AFFEKLTPAQKSAI SEKTPALVD 
TFVGDQNALLDFGQSAVEGVNTTVNHI LSEQKKI QI PQVDDLLKNANRELNGFIAKYKDA 
TPAELEKKPNLI QKLFKQSKTSLQEFYFDSQNI EQKMDMMAANVVKQEDTLARNI VSAEM 
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LI EDNTKS I ENLVGVXAF I ESSQAEAANRASHLQQE I LALDS QT SEYQ I KSNQLARMTEV 
'INTLEQQHTEYVSRLYVAWATTPQMRNLVKVSSDMRQKLGMLRRNTIPTMKLSIAQLGMM 
QQS VKSGVT ADA I VNANNAAL QMLAET S KEA I PMLEKTAQSPTVS I KS VTALAE S LiVAQN 
NGIIAAIDKGRKERAQLESAVIKSAETINDSVKIRDKKIVEALLNEGKSTQEKVDES 

SEQ ID NO. 5214 

STRAIN H36B frame: 1 

SDTFNFDIDQI ADNAITKTDKTTE 1 1 SNQTTSQTGQI AFFEKLTPAQKSAI SEKTPALVD 
TFVGDQNALLDFGQSAVEGVNTTVNHI LSEQKKI Q I PQVDDLLKNANRELNGF IAKYKDA 
TPAELEKKPNLIQKLFKQSKTSLQEFTFDSQNIEQKlffiMMAANVVKQEDTIiARNIVSAEM 
L I EDNTKS I ENLVGV I AF I ESSQAEAANRASHLQQE I LALDSQTSEYQI KSNQLARMTEV 
INTLEQQHTEYVSRLYVAWATTPQMRNLVKVSSDMRQKLGMLRRNTI PTMKLS I AQLGMM 
QQS VKSGVTADAI VNANNAALQMLAETSKEAI PMLEKTAQS PTVS I KS VTALSE SLVAQN 
NGI I AAI DKGRKERAQLESAVIKSAETINDSVKI RDKKI VEALLNEGKSTQEKVDES 

SEQ ID NO. 5215 

STRAIN 18RS21 frame: 2 

FD I DQIADNAITKTDKTTE 1 1 SNQTTSQTGQI AFFEKLTPAQKSAI SEKTPALVDTFVGD 
QNALLDFGQSAVEGVNTTVNH I LSEQKKI Q I PQVDDLLKNANRELNGF I AKYKDATPAEL 
EKKPNLI QKLFKQS KTS LQE F Y FDSQN I EQKMDMMAANVVKQEDTLARNI VS AEML I EDN 
TKS I ENLVGVIAFI ESSQAEAANRASHLQQEI LALDSQTSEYQI KSNQLARMTEVINTLE 
QQHPE YVSRLYVAWATTPQMRNLVKVS SDMRQKLGMLRRNT I PTMKLS I AQLGMMQQSVK 
SGVTADAI VNANNAALQMLAETSKEAI PMLEKTAQSPTVS IKS VTALAE SLVAQNNG 1 1 A 
AIDKGRKERAQLESAVI KSAET INDSVKI RDKKI VEALLNEGKSTQEKVDES 

SEQ ID NO. 5216 

STRAIN M732 frame: 1 

SDTFNEDIDQIADNAITKTDKTTEIISNQTTSQTGQIAFFEKLTPAQKSAISEKTPALVD 
TFVGDQNALLDFGQSAVEGVNTTVNH I LSEQKKI Q I PQVDDLLKNANRELNGF IAKYKDA 
T PAE LEKKPNL I QKLFKQS KT S LQE FYFDSQNIEQ KMDMMAANWKQEDTLARN I VSAEM 
LI EDNTKS I ENLVGVIAFIESSQAEAANRASHLQQE ILALDSQTSEYQI KSNQLARMTEV 
INTLEQQHTE YVSRLYVAWATTPQMRNLVKVS SDMRQKLGMLRRNT I PTMKLS IAQLGMM ' 
QQSVKSGVTADAI WANNAALQMLAETSKEAI PMLEKTAQSPTVS I KS VTALAES LVAQN 
NG 1 1 AAI DKGRKERAQLESAVI KSAET I NDS VKI RDKKI VEALLNEGKSTQEK 

SEQ ID NO. 5217 

STRAIN COH1 frame: 3 

KTDKTTE 1 1 SNQTTCQTGQ I AFFEKLTPAQKSAX SEKTPALVDTFVGDQNALLD FGQSAV 
EGVNTTVNH I LSEQKKI QI PQVDDLLKNANRELNGF I AKYKDATPAELEKKPNL I QKLFK 
QSKTSLQEFYFDSQNIEQKMDMMAANVVKQEDTLARNIVSAEMLIEDNTKSIENLVGVIA 
F I ES S QAEAANRASHLQQE I LALDSQTSEYQ I KSNQLARMTE VINTLEQQHTEYVSRLYV 
AWATT PQMRNLVKVS SDMRQKLGMLRRNT I PTMKLS I AQLGMMQQS VKSGVTADAI VNAN 
NAALQMLAETSKEAI PMLEKTAQSPTVS I KS VTALAE SLVAQNNG 1 1 AAIDKGRKERAQL 
ESAVI KSAET I NDS VKI RDKKI VEALLNEGKSTQEKVDES 

SEQ ID NO. 5218 

STRAIN COH1 frame: 3 

KTDKTTE 1 1 SNQTTCQTGQ I AFFEKLTPAQKSAXSEKTPALVDTFVGDQNALLD FGQSAV 
EGVNTTVNH I LSEQKKI Q I PQVDDLLKNANRELNGF I AKYKDATPAELEKKPNL I QKLFK 
QSKTSLQEFYFDSQNIEQKMDMMAANVVKQEDTLARNI VSAEML I EDNTKS IENLVGVI A 
FIESSQAEAANRASHLQQEILALDSQTSEYQIKSNQLARMTEVINTLEQQHTEYVSRLYV 
AWATTPQMRNLVKVS SDMRQKLGMLRRNT I PTMKLS I AQLGMMQQS VKSGVTADAI VNAN 
NAALQMLAETSKEAI PMLEKTAQSPTVS I KS VTALAE SLVAQNNG 1 1 AAIDKGRKERAQL 
ESAVI KSAET INDSVKI RDKKI VEALLNEGKSTQEKVDES 

SEQ ID NO. 5219 

STRAIN M781 frame: 2 

FD I DQIADNAITKTDKTTE 1 1 SNQTTSQTGQI AFFEKLTPAQKSAI SEKTPALVDTFVGD 
QNALLDFGQSAVEGVNTTVNH I LSEQKKI Q I P QVDDLLKNANRELNGFI AKYKDATPAEL 
EKKPNLI QKLFKQSKTSLQE FYFDSQNI EQKMDMMAANWKQEDTLARN I VSAEML I EDN 
TKS I ENLVGVI AF I ES SQAEAANRASHLQQE I LALDSQTSEYQ I KSNQLARMTEVI NTLE 
QQHTEYVSRL YVAWATTPQMRNLVKVS SDMRQKLGMLRRNT I PTMKLS I AQLGMMQQS VK 
SGVTADAI VNANNAALQMLAETSKEAI PMLEKTAQSPTVS I KS VTALAE SLVAQNNG 1 1 A 
AI DKGRKERAQLESAVI KSAET I NDS VKI RDKKI VEALLNEGKSTQEKVDES 

SEQ ID NO. 5220 
STRAIN CJB110 frame: 2 

FD I DQIADNAITKTDKTTE 1 1 SNQTTSQTGQI AFFEKLTPAQKSAISEKTPALVDTFVGD 
QNALLDFGQSAVEGVNTTVNH I LSEQKKI Q I PQVDDLLKNANRELNGF I AKYKDATPAEL 
EKKPNLI QKLFKQSKTSLQEFYFDSQNIEQKMDMMAANVVKQFJDTLARNIVSAEML I EDN 
TKS I ENLVGVI AFI ESSQAEAANRASHLQQE I LALDSQTSEYQI KSNQLARMTEVINTLE 
QQHTEYVSRL YVAWATTPQMRNLVKVSSDMRQKLGMLRRNTI PTMKLS I AQLGMMQQSVK 
S GVTADA I VN ANNAALQMLAET S KEA I PMLEKTAQ S PTVS I KSVTALAE S LVAQNNG 1 1 A 
AI DKGRKERAQLESAVI KSAET I NDS VKI RDKKI VEALLNEGKSTQEKVDES 

SEQ ID NO. 5221 

STRAIN 1169NT frame: 1 

ADNAI TKTDKTTE 1 1 SNQTTSQTGQIAFFEKLTPAQKSAI SEKTPALVDTFVGDQNALLD 
FGQSAVEGVNTTVNH I LSEQKKI QI PQVDDLLKNANRELNGF I AKYKDATPAELEKKPNL 
IQKLFKQSKTSLQEFYFDSQNI EQKMDMMAANVVKQEDTLARNI VSAEML I EDNTKS I EN 
LVGVIAF I ES SQAEAANRASHLQQE I LALDSQTSEYQ I KSNQLARMTEVINTLEQQHTEY 
VS RL YVAWATT PQMRNLVKVS SDMRQKLGMLRRNT I PTMKLS I AQLGMMQQS VKSGVTAD 
AI VNANNAALQMLAETSKEAI PMLEKTAQSPTVS I KSVTALAESLVAQNNGI IAAIDKGR 
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KERAQLESAVIKSAETI1TOSVKIRDKKIVEALLNEGKSTQEKVDES 

SEQ ID NO. 5222 

STRAIN JM9130013 frame: 1 

SDTFNFD IDQ I ADNAITKTDKTTE 1 1 SNQTTSQTGQ I AFFEKLTPAQKS AI SEKTPALVD 
TFVGDQNALLDFGQSAVEGVNTTVNHILSEQKKI QI PQVDDLLKNANRELNGF I AKYKDA 
TPAELEKXPNLIQKLFKQSKTSLQEFYFDSQNIEQKMDMMAANWKQEDTLARNIVSAEM 
LI EDNTKSI ENLVGVIAFI ESSQAEAANRASHLQQEILALDSQTSEYQI KSNQLARMTEV 
INTLEQQHTEYVSRLYVAWATTPQMRNLVKVSSDMRQKLGMLRRNT I PTMKLS I AQLGMM 
QQS VKSGVTADAI VNANNAALQMIiAETSKEAI PMLEKTAQS PTVS I KS VTALAE S LVAQN 
NG 1 1 AAI DKGRKERAQLESAVI KSAET INDS VKI RDKKI VEALLNEGKSTQEKVDES 

SEQ ID NO. 5223 

STRAIN 2603 frame: 1 

SDTFNFDIDQIADNAITKTDKTTEI I SNQTTSQTGQI AFFEKLTPAQKSAI SEKTPALVD 
TFVGDQNALLDFGQS AVEGVNTTVNH I LS EQKKI QI PQVDDLLKNANRELNGF I AKYKDA 
TPAELEKKPNL I QKLFKQS KTSLQE F YFDS QN I EQKMDMMAANWKQEDTLARN I VSAEM 
LI EDNTKS I ENLVGVI AF I E SSQAEAANRASHLQQE I LALDS QTS E YQI KSNQLARMTEV 
INTLEG^HPEYVSRLYVAWATTPQMRNLVKVSSDMRQKLGMLRRNTI PTMKLS IAQLGMM 
QQS VKSGVTADAI VNANNAALQMIiAETSKEAI PMLEKTAQS PTVS I KS VTALAE S LVAQN 
NG 1 1 AAI DKGRKERAQLESAVI KSAET INDS VKI RDKKI VEALLNEGKSTQEKVDES 

PRETTY of: /biotmp/msa2 83 69 . 2 { * } April 22, 2002 04:27 



msa28369.2{201_090 
msa28369.2{201_1169NT 
msa28369.2{201_A909 
msa28369 . 2 {201_JM913 0013 } 
msa28369.2{201_COHl} 
msa28369 .2 {201_CJB110} 
msa28369.2{201__M78l} 
msa28369 .2{201_2603} 
msa28369.2{201JH36B} 
msa28369.2{201_18RS2l} 
msa28369.2{201_M732} 

Consensus 



1 

sdtfnfdidq iadnaitKTD 

-adnaitKTD 

sdtfnfdidq iadnaitKTD 
sdtfnfdidq iadnaitKTD 

KTD 

fdidq iadnaitKTD 

fdidq iadnaitKTD 

sdtfnfdidq iadnaitKTD 
sdtfnfdidq iadnaitKTD 

fdidq iadnaitKTD 

sdtfnfdidq iadnaitKTD 

M *** 



KTTEIISNQT 
KTTEIISNQT 
KTTEIISNQT 
KTTEIISNQT 
KTTEIISNQT 
KTTEIISNQT 
KTTEIISNQT 
KTTEIISNQT 
KTTEIISNQT 
KTTEIISNQT 
KTTEIISNQT 



TsQTGQIAFF 
TsQTGQIAFF 
TsQTGQIAFF 
TsQTGQIAFF 
TcQTGQIAFF 
TsQTGQIAFF 
TsQTGQIAFF 
TsQTGQIAFF 
TsQTGQIAFF 
TsQTGQIAFF 
TsQTGQIAFF 



50 

EKLTPAQKSA 
EKLTPAQKSA 
EKLTPAQKSA 
EKLTPAQKSA 
EKLTPAQKSA 
EKLTPAQKSA 
EKLTPAQKSA 
EKLTPAQKSA 
EKLTPAQKSA 
EKLTPAQKSA 
EKLTPAQKSA 



********** *_******** ********** 



msa28369.2{201_090} 
msa28369.2{201_1169NT} 
msa28369 . 2 {201_A909} 
msa28369.2{201_JM913 0013} 
msa28369.2{20 l_COHl } 
msa28369.2{201_CJB110} 
msa28369.2{201_M78l} 
msa28369.2{201_2603} 
msa28369.2{201_H36B} 
msa28369.2{201_18RS2l} 
msa28369 .2 {201_M732} 

Consensus 



51 

iSEKTPALVD 

i SEKTPALVD 

iSEKTPALVD 

iSEKTPALVD 

XSEKTPALVD 

iSEKTPALVD 

iSEKTPALVD 

iSEKTPALVD 

iSEKTPALVD 

iSEKTPALVD 

iSEKTPALVD 
_********* 



TFVGDQNALL 
TFVGDQNALL 
TFVGDQNALL 
TFVGDQNALL 
TFVGDQNALL 
TFVGDQNALL 
TFVGDQNALL 
TFVGDQNALL 
TFVGDQNALL 
TFVGDQNALL 
TFVGDQNALL 



DFGQSAVEGV 
DFGQSAVEGV 
DFGQSAVEGV 
DFGQSAVEGV 
DFGQSAVEGV 
DFGQSAVEGV 
DFGQSAVEGV 
DFGQSAVEGV 
DFGQSAVEGV 
DFGQSAVEGV 
DFGQSAVEGV 



NTTVNHILSE 
NTTVNHILSE 
NTTVNHILSE 
NTTVNHILSE 
NTTVNHILSE 
NTTVNHILSE 
NTTVNHILSE 
NTTVNHILSE 
NTTVNHILSE 
NTTVNHILSE 
NTTVNHILSE 



100 

QKKIQIPQVD 
QKKIQIPQVD 
QKKIQIPQVD 
QKKIQIPQVD 
QKKIQIPQVD 
QKKIQIPQVD 
QKKIQIPQVD 
QKKIQIPQVD 
QKKIQIPQVD 
QKKIQIPQVD 
QKKIQIPQVD 



********** ********** ********** ********** 



msa28369 
msa28369 .2{ 

rasa28369 . 
msa28369 .2 {201 

msa28369 . 
rasa28369 .2{ 

tnsa28369 . 

msa28369 . 

msa28369 . 
rasa28369.2{ 

rasa28369 . 



.2{201_090 
201_1169NT 
2{201_A909 
_JM9130013} 
2{201_COHlJ 
201_CJB110} 
2{201_M781} 
2{201_2603} 
2{201_H36B} 
201_18RS21} 
2{201_M732} 
Consensus 



101 

DLL KNANREL 

DLLKNANREL 

DLLKNANREL 

DLLKNANREL 

DLLKNANREL 

DLLKNANREL 

DLLKNANREL 

DLLKNANREL 

DLLKNANREL 

DLLKNANREL 

DLLKNANREL 
********** 



NGF I AKYKDA 

NGF I AKYKDA 

NGF I AKYKDA 

NGF I AKYKDA 

NGF I AKYKDA 

NGF I AKYKDA 

NGF I AKYKDA 

NGF I AKYKDA 

NGF I AKYKDA 

NGF I AKYKDA 

NGF I AKYKDA 
********** 



TPAELEKKPN 
TPAELEKKPN 
TPAELEKKPN 
TPAELEKKPN 
TPAELEKKPN 
TPAELEKKPN 
TPAELEKKPN 
TPAELEKKPN 
TPAELEKKPN 
TPAELEKKPN 
TPAELEKKPN 



LIQKLFKQSK 
LIQKLFKQSK 
LIQKLFKQSK 
LIQKLFKQSK 
LIQKLFKQSK 
LIQKLFKQSK 
LIQKLFKQSK 
LIQKLFKQSK 
LIQKLFKQSK 
LIQKLFKQSK 
LIQKLFKQSK 



150 

TSLQEFYFDS 
TSLQEFYFDS 
TSLQEFYFDS 
TSLQEFYFDS 
TSLQEFYFDS 
TSLQEFYFDS 
TSLQEFYFDS 
TSLQEFYFDS 
TSLQEFYFDS 
TSLQEFYFDS 
TSLQEFYFDS 



********** ********** ********** 



{201_090} 
HJL169NT} 



msa28369 .2 
msa28369.2{20 

msa28369 .2{201_A909 
msa28369.2{201_JM9130013 
msa2 8 3 6 9 . 2 { 2 0 l__COHl 
msa28369 . 2 {201_CJB110 
tnsa28369.2(201_M781 
msa28369.2(201_2603 
msa28369. 2(201 H36B 



151 

QNIEQKMDMM 
QNIEQKMDMM 
QNIEQKMDMM 
QNIEQKMDMM 
QNIEQKMDMM 
QNIEQKMDMM 
QNIEQKMDMM 
QNIEQKMDMM 
QNIEQKMDMM 



AANWKQEDT 
AANWKQEDT 
AANWKQEDT 
AANWKQEDT 
AANWKQEDT 
AANWKQEDT 
AANWKQEDT 
AANWKQEDT 
AANWKQEDT 



LARN I VSAEM 
LARNIVSAEM 
LARN I VSAEM 
LARNIVSAEM 
LARNIVSAEM 
LARNIVSAEM 
LARNIVSAEM 
LARNIVSAEM 
LARNIVSAEM 



LI EDNTKS IE 
LI EDNTKS IE 
LI EDNTKS IE 
LI EDNTKS IE 
LI EDNTKS IE 
LIEDNTKSIE 
LI EDNTKS IE 
LIEDNTKSIE 
LIEDNTKSIE 



200 

NLVGVi AF I E 
NLVGV iAFIE 
NLVGVxAF I E 
NLVGV iAFIE 
NLVGV iAFIE 
NLVGViAFIE 
NLVGV i AF I E 
NLVGViAFIE 
NLVGViAFIE 
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Table 52: Comparative Sequences relating to SAG 1823 



msa28369 .2 {201_18RS2l} 
tnsa28369 .2 {201__M732} 

Consensus 



QNIEQKMDMM AANWKQEDT LARNIVSAEM LIEDNTKSIE NLVGViAFIE 
QNIEQKMDMM AANWKQEDT LARNIVSAEM LIEDNTKSIE NLVGViAFIE 
********** ********** ********** ********** *****_**** 



msa28369 
msa28369 .2 { 

msa28369 . 
msa28369.2{201 

msa28369 . 
msa28369 .2{ 

msa28369 . 

msa28369 . 

msa28369 . 
msa28369 .2{ 

msa28369 . 



.2{201_090} 
201_1169NT} 
2{201__A909} 
_JM9130013} 
2{2 01_COH1} 
201_CJB110} 
2{201_M781} 
2{201__2603} 
2{201_H36B} 
201_18RS2l} 
2{201_M732} 
Consensus 



201 

SSQAEAANRA 
SSQAEAANRA 
SSQAEAANRA 
SSQAEAANRA 
SSQAEAANRA 
SSQAEAANRA 
SSQAEAANRA 
SSQAEAANRA 
SSQAEAANRA 
SSQAEAANRA 
SSQAEAANRA 



250 

INTLEQQHtE 
INTLEQQHtE 
INTLEQQHtE 
INTLEQQHtE 
INTLEQQHtE 
INTLEQQHtE 
INTLEQQHtE 
INTLEQQHpE 
INTLEQQHtE 
INTLEQQHpE 
INTLEQQHtE 

********** ********** ********** ********** ********_* 



SHLQQEILAL 
SHLQQEILAL 
SHLQQEILAL 
SHLQQEILAL 
SHLQQEILAL 
SHLQQEILAL 
SHLQQEILAL 
SHLQQEILAL 
SHLQQEILAL 
SHLQQEILAL 
SHLQQEILAL 



DSQTSEYQIK 
DSQTSEYQIK 
DSQTSEYQIK 
DSQTSEYQIK 
DSQTSEYQIK 
DSQTSEYQIK 
DSQTSEYQIK 
DSQTSEYQIK 
DSQTSEYQIK 
DSQTSEYQIK 
DSQTSEYQIK 



SNQLARMTEV 
SNQLARMTEV 
SNQLARMTEV 
SNQLARMTEV 
SNQLARMTEV 
SNQLARMTEV 
SNQLARMTEV 
SNQLARMTEV 
SNQLARMTEV 
SNQLARMTEV 
SNQLARMTEV 



msa28369 
msa2 83 69.2{ 

msa28369 . 
msa28369.2{201 

rasa28369 . 
msa28369 .2{ 

msa283 69 . 

msa28369 . 

msa28369 . 
msa28369 .2 { 

msa28369 . 



.2{201_090} 
201_1169NT} 
2{201_A909} 
_JM9130013] 
2{201__COH1 
201_CJB110 
2(201_M781 
2{201_2603 
2{201_H36B 
201_18RS21 
2{201_M732; 
Consensus 



251 

YVSRLYVAWA 
YVSRLYVAWA 
YVSRLYVAWA 
YVSRLYVAWA 
YVSRLYVAWA 
YVSRLYVAWA 
YVSRLYVAWA 
YVSRLYVAWA 
YVSRLYVAWA 
YVSRLYVAWA 
YVSRLYVAWA 



TTPQMRNLVK 
TTPQMRNLVK 
TTPQMRNLVK 
TTPQMRNLVK 
TTPQMRNLVK 
TTPQMRNLVK 
TTPQMRNLVK 
TTPQMRNLVK 
TTPQMRNLVK 
TTPQMRNLVK 
TTPQMRNLVK 



VSSDMRQKLG 
VSSDMRQKLG 
VSSDMRQKLG 
VSSDMRQKLG 
VSSDMRQKLG 
VSSDMRQKLG 
VSSDMRQKLG 
VSSDMRQKLG 
VSSDMRQKLG 
VSSDMRQKLG 
VSSDMRQKLG 



********** ********** ********** 



MLRRNTIPTM 

MLRRNTIPTM 

MLRRNTIPTM 

MLRRNTIPTM 

MLRRNTIPTM 

MLRRNTIPTM 

MLRRNTIPTM 

MLRRNTIPTM 

MLRRNTIPTM 

MLRRNTIPTM 

MLRRNTIPTM 
********** 



300 

KLSIAQLGMM 

KLSIAQLGMM 

KLSIAQLGMM 

KLSIAQLGMM 

KLSIAQLGMM 

KLSIAQLGMM 

KLSIAQLGMM 

KLSIAQLGMM 

KLSIAQLGMM 

KLSIAQLGMM 

KLSIAQLGMM 
********** 



msa2836 
msa28369.2 
msa28369 
msa28369 .2 {20 
msa28369 
msa28369 .2 
msa28369 
msa28369 
msa28369 
msa28369.2 
msa28369 



9.2{201_090} 
{201_1169NT} 
.2{201_A909; 
1_JM913 0013 • 
.2{201_COH1 
{201_CJB110 
.2{201__M781 
.2{201_2603 
.2{201_H36B 
{201_18RS2l' 
.2{201_M732; 
Consensus 



301 

QQSVKSGVTA 

QQSVKSGVTA 

QQSVKSGVTA 

QQSVKSGVTA 

QQSVKSGVTA 

QQSVKSGVTA 

QQSVKSGVTA 

QQSVKSGVTA 

QQSVKSGVTA 

QQSVKSGVTA 

QQSVKSGVTA 
********** 



DAIVNANNAA 

DAIVNANNAA 

DAIVNANNAA 

DAIVNANNAA 

DAIVNANNAA 

DAIVNANNAA 

DAIVNANNAA 

DAIVNANNAA 

DAIVNANNAA 

DAIVNANNAA 

DAIVNANNAA 
********** 



LQMLAETSKE 
LQMLAETSKE 
LQMLAETSKE 
LQMLAETSKE 
LQMLAETSKE 
LQMLAETSKE 
LQMLAETSKE 
LQMLAETSKE 
LQMLAETSKE 
LQMLAETSKE 
LQMLAETSKE 



AIPMLEKTAQ 
AIPMLEKTAQ 
AIPMLEKTAQ 
AIPMLEKTAQ 
AIPMLEKTAQ 
AIPMLEKTAQ 
AIPMLEKTAQ 
AIPMLEKTAQ 
AIPMLEKTAQ 
AIPMLEKTAQ 
AIPMLEKTAQ 



********** ********** 



350 

SPTVSIKSVT 

SPTVSIKSVT 

SPTVSIKSVT 

SPTVSIKSVT 

SPTVSIKSVT 

SPTVSIKSVT 

SPTVSIKSVT 

SPTVSIKSVT 

SPTVSIKSVT 

SPTVSIKSVT 

SPTVSIKSVT 
********** 



msa28369 
msa28369.2 { 

msa28369 . 
msa28369 .2 {201 

msa28369 . 
msa28369.2{ 

msa28369 . 

iusa28369 . 

msa28369 . 
rasa28369.2{ 

msa28369 . 



.2{201_090} 
201_1169NT> 
2{201_A909; 
_JM9130013 
2{201_COH1 
201_CJB110 
2(201__M781 
2{201_2603 
2{201_H36B> 
201_18RS2l} 
2{201_M732} 
Consensus 



351 

ALaESLVAQN 
ALaESLVAQN 
ALaESLVAQN 
ALaESLVAQN 
ALaESLVAQN 
ALaESLVAQN 
ALaESLVAQN 
ALaESLVAQN 
ALsESLVAQN 
ALaESLVAQN 
ALaESLVAQN 



NGI IAAIDKG 
NGI I AAIDKG 
NGI IAAIDKG 
NGI IAAIDKG 
NGI IAAIDKG 
NGI IAAIDKG 
NGI IAAIDKG 
NGI IAAIDKG 
NGI IAAIDKG 
NGI IAAIDKG 
NGI IAAIDKG 



RKERAQLESA 
RKERAQLESA 
RKERAQLESA 
RKERAQLESA 
RKERAQLESA 
RKERAQLESA 
RKERAQLESA 
RKERAQLESA 
RKERAQLESA 
RKERAQLESA 
RKERAQLESA 



**„******* ********** ********** 



VIKSAETIND 

VIKSAETIND 

VIKSAETIND 

VIKSAETIND 

VIKSAETIND 

VIKSAETIND 

VIKSAETIND 

VIKSAETIND 

VIKSAETIND 

VIKSAETIND 

VIKSAETIND 
********** 



400 

SVKIRDKKIV 
SVKIRDKKIV 
SVKIRDKKIV 
SVKIRDKKIV 
SVKIRDKKIV 
SVKIRDKKIV 
SVKIRDKKIV 
SVKIRDKKIV 
SVKIRDKKIV 
SVKIRDKKIV 
SVKIRDKKIV 
********** 



msa2836 
msa28369.2 
msa28369 
msa28369 .2 {20 
msa28369 
msa28369.2 
msa28369 
msa28369 
msa28369 
msa28369 .2 
msa28369 



9.2{201_090} 
{201_1169NT} 
.2{201_A909} 
1_JM913 0013} 
.2{2 01_COHl} 
{201_CJB110} 
.2{201_M781" 
.2f201_2603 
.2{201_H36B 
{201_18RS2l} 
.2{2 01_M732} 
Consensus 



401 

EALLNEGKST 

EALLNEGKST 

EALLNEGKST 

EALLNEGKST 

EALLNEGKST 

EALLNEGKST 

EALLNEGKST 

EALLNEGKST 

EALLNEGKST 

EALLNEGKST 

EALLNEGKST 
********** 



417 

QEKvdes 

QEKvdes 

QEKvdes 

QEKvdes 

QEKvdes 

QEKvdes 

QEKvdes 

QEKvdes 

QEKvdes 

QEKvdes 

QEK 

*** 
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Table 53: Comparative Sequences relating to SAG 0755 



SEQ ID NO. 5301 
STRAIN 2603 

acaaatactttgaaaaaagaattagttgaagctaaaaagacaattccatc 
cgtaaaagcttcaaaagtaccgcaaaaatcaacatcatcgaaagataaag 
agtttgttcttaaaccgattatcgatgtctctggttggcaacttcctaag 
gagattgattacgatacgctttcaaaaaatatttcaggtgttgttattcg 
tgtctttggtggatcaaagatatctaagactaataacgctgcttatacaa 
ctggaatcgataaatcgtttaagacccatatcaaagaatttcaaaagcga 
aatatcccagtagctgtctacagttatgcacttggttcaagtgttaaaga 
aatgaaagaagaggctcagatattttataagaatgcagctccttacaaac 
caactttttattggattgacgtagaagaggagacaatgtctaacatgaat 
aaaggtg t c caagca 1 1 ccgaaaagaat t aaaaagac 1 1 ggtgc t aaaaa 
tgttggtatctacattggtacttactttatgactgagcaaggcatctctg 
taaaaggatttgacgctgtttggattccaacttatggtagcgattctgga 
tactatgaagcggctccgcaaactgaacttaaatacgatttacaccaata 
cacctctcaaggttatctaccaggawtcaatcaaccgcttgatttaaatc 
aaattgcagttaataaagacaagaagaaaacttatgagaaactttttgga 
aaagtaaaagag 

SEQ ID NO. 53 02 
STRAIN 090 

ACAAATACTTTGAAAAAAGAATTAG 

TTGAAGCTAAAAAGACAATT C CAT C CGTAAAAG CTTCAAAAGTAC CG CAA 
AAAT CAACAT CATCGAAAGATAAAGAGTT TGTTCTTAAAC CGAT T ATCGA 
TGTCTCTGGTTGGCAACTTCCTAAGGAGATTGATTACGATACGCTTTCA 
AAAATATTT CAGGTGTTGTTATT CGTGT CTTT GGTGGAT CAAAG ATATCT 
AAGACTAATAACG CTG CTTATACAACTGGAAT CGATAAATCGTTTAAGAC 
CCATATCAAAGAATTTCAAAAG CG AAATAT CC CAGTAG CTGTCTACAGTT 
ATGCACTTGGTTCAAGTGTT AAAG AAATG AAAGAAGAGG CT CAGATATTT 
TATAAGAATG GAG CTCCI^ACAAAC CAAC"1"1UUU ATTGGATTGACGTAGA 
AGAGGAGACAATGT CTAACATG AATAAAG GTGTCCAAGCATT CCGAAAA.G 
AATTAAAAAGACTTGGTGCTAAAAATGTT GGT AT CTACATTGGTACTTAC 
TTTATGACTGAGCAAGGCAT CTCTGTAAAAGGATTTGACGCTGTTTGGAT 
TC CAACTTATGGTAG CGATT CTGG ATACT ATG AAG CG G CT C CGCAAACTG 
AACTTAAATACGATTTACACCAATACACCTCTCAAGGTTATCTACCAGGA 
TT CAATCAACCG CTTGATTTAAAT CAAATTG CAGTTAAT AAAGACAAGAA 
GAAAACTTATGAGAAACTTTTTGGAAAAGTAAAAGAG 

SEQ ID NO. 5303 
STRAIN A909 

ACAAATACTTTGAAAAAAGAATTAGTTGAAGCTAAAA 
AGACAATTCCATCCGTAAAAGCTTCAAAAGTACCGCAAAAATCAACATCA 
TCGAAAGAT AAAGAGTTTGTT CTTAAAC CGATTAT CG ATGTCTCTGGTTG 
G CAACTT CCT AAGG AGATTGATTACGATACG CTTT CAAAAAATATTT CAG 
GTGTTGTTATTCGTGTCTTTGGTG GAT CAAAG AT ATCTAAGACTAATAAC 
GCTG CTTAT ACAACT GGAAT CG ATAAATCGTTTAAGACC CATAT CAAAGA 
ATTT CAAAAG CGAAATATCCCAGTAGCTGTCTACAGTTATGC^CTTGGTT 
CAAGTGTTAAAGAAATGAAAGAAGAGG CT CAGATATTTTATAAGAATG CA 
GCTCCITACAAACCAACTTTTTATTGGATTGACGTAGAAGAGGAGACAAT 
GTCTAACATGAATAAAGGTGTCCAAGCATTCCGAAAAGAATTAAAAAGAC 
TTGGTGCTAAAAATGTTGGTATCTACATTGGTACTTACTT^ 
CAAGG CATCT CTGTAAAAGGATTTGACG CTGTTTGGATTCCAACTTATGG 
TAGCGATTCTGGATACTATGAAGCGGCTC C G CAAACTGAACTTAAATACG 
ATTTACACCAA'rACACCT-CTCAAG GTT AT CTACCAGGATT CAAT CAACCG 
CTTGATTTAAAT CAAATTG CAGTTAAT AAAGACAAGAAGAAAACTTATG A 
GAAACTTTTTGGAAAAGTAAAAGAG 

SEQ ID NO. 5304 
STRAIN H36B 

ACAAATACTTTGAAAAAAGAATTAG 

TTGAAG CTAAAAAGACAATT C CAT C CGTAAAAG CTTCAAAAGTACCGCAA 
AAAT CAACAT CAT CGAAAGAT AAAG AGTTTGTTCTTAAAC CGATTAT CG A 
TGTCT CTGGTTGG CAACTTC CTAAG GA.G ATTGATT ACGATACG CTTT CAA 
AAAATATTT CAGGTGTTGTTATTCGTGTCTTTGGTGGATCAAAGATATOT 
AAGACTAATAACGCTGCTTATACAACTGGAATCGATAAATCGTTTAAGAC 
CCATATCAAAGAATTTCAAAAGCGAAATATCCCAGTAGCTGTCTACAGTT 
ATGC^CTTGGTTCAAGTGTTAAAGA^TGAAAGAAGAGGCTCAGATATTT 
TATAAGAATGCAGCT CCTT ACAAAC CAACTTTTT'A 

AGAGGAGACAATGT CTAACATGAAT AAAG GTGT CCAAG CATT C CGAAAAG 
AATTAAAAAGACTTGGTGCTAAAAATGTTGGTATCTACATTGGTACTTAC 
TTTATGACTGAG CAAGG CAT CT CTGTAAAAGGATTTGACG CT GTTTGGAT 
TCCAACHTATGGTAG CGATT CTGGATACTATGAAGCGGCTC CGCAAACTG 
AACTTAAATACGATTTACAC CAAT ACACCT CT CAAGGTTATCTACCAGG A 
TT CAATCAACCG CTTGATTTAAAT CAAATTGCAGTT AATAAAGACAAGAA 
GAAAACTTATGAGAAACiTT'TTGGAAAAGTAAAAGAG 

SEQ ID NO. 5305 
STRAIN 18RS21 

ACAAATACTTTGAAAAAAGAATTAGTTGAAG CT AAAAA 
GACAATT CCATCCGTAAAAG CTT CAAAAGTAC CG CAAAAAT CAACAT CAT 
CG AAAGATAAAGAGTTTGTT CTTAAAC CGATT AT CGATGT CT CTGGTTGG 
CAACTT CCTAAGGAGATTG ATT AC G AT ACG CTTT CAAAAAATATTT CAGG 
TGTTGTTATT CGTGT CTTTGGTGG AT CAAAGATAT CT AAGACTAATAACG 
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Table 53: Comparative Sequences relating to SAG 0755 



CTG CTTAT ACAACT GGAAT CGATAAAT CGTTT AAG AC C CAT AT CAAAGAA 
TTTCAAAAGCGAAATATCCCAGTAGCTGTCTACAGTTATGCACTTGGTTC 
AAGTGTTAAAG AAATGAAAGAAGAG G CT CAGAT ATTTTATAAGAATG CAG 
CTCCTTAC1AAACCAACTTTTTATTGGATTG 

TCTAACATGAATAAAGCTGTC(TAAGCATT CCG AAAAGAATTAAAAAGACT 

TGGTGCTAAAAATGTTGGTATCTACAT^GGTACTTACTTTATGACTGAGC 

AAGGCATCTCTGTAAAAGGATTTGACGCTGTTTGGATTCCAACTTATGGT 

AGCGATTCTGGATACTATGAAGCGGCTCCGCAAACTGAACTTAAATACGA 

TTTACACCAATACA.CCrrCTCAAGGTTATCTACCAGGATT 

TTGATTT AAAT CAAATT G CAGTTAATAAAG ACAAGAAGAAAACTT ATGAG 

AAACTTTTTGGAAAAGTAAAAGAG 

SEQ ID NO. 5306 
STRAIN M732 

ACAAATACTTTGAAAAAAGAATTAGTTGAAGCTAAA 

AAGACAATTC CATC CGTAAAAG CTT CAAAAGT ACC G CAAAAAT CAACAT C 
AT CGAAAGATAAA.G AGTTTGTT CTTAAA.C CGATTAT CGATGT CTCTGGTT 
GG CAACTTC CTAAGGAG ATTGA.TT ACGAT ACGCTTT CAAAAAATAT^ CA 
GGTGTTGTTATT CGT AT CTTTGGTGGAT CAAAGAT AT CT AAG ACT AATAA 
CG CTG CTTAT ACAACTGGAAT CGATAAAT CGTTTAAGAC C CATATCAAAG 
AATTT CAAAAGCGAAATAT C C CAGTAG CTGT CTACAGTTATG CACTTGGT 
TCAAGTGTTAAAGAAATGAAAG AAGAGGCT CAGATATTTTATAAG AATG C 
AGCT C CTTACAAa C CAACTTTTT ATTGGATTGACGTAGAAGAGGAGACAA 
TGTCT AACATGAATAAAGGT GTC CAAG CATTC CG AAAAGAGTTAAAAAGA 
CTTGGTGCTAAAAATGTTGGTATCTACATCGGTACTTAC^ 
GCAAGGTATCTCTGTAAAAGGATTTGACGCTGTTTG^ATTCCAACTTATG 
GTAGCGATT CTGGAT ACTATGAAGCAG CT C CACA AACTG A ACTTAAATAC 
GATTT ACACCAATACAC CT CTCAAGGTTAT CTAC CAGGATT CAATCAACC 
GCTTGATTTAAATCAAATTGCAGTTAATAAAGACAAG 
AGAAACTTTTTGGAAAAGTAAAAGAG 

SEQ ID NO. 5307 
STRAIN COH1 

ACAAATACTTTGAAAAAAGAATTAGTTGAAG CTAAAA 

AGACAATT C CATC CGTAAAAGCTT CAAAAGTACCG CAAAAAT CAACAT CA 

TCGAAAGATAAAGAGTTTGTTCTTAAACCGATTATCGATGTCTCTGGTTG 

GCAACITCCTAAGGAGATTGATTACGATACGCTTTCA^ 

GTGTTGTTATTCGTAT CTTTGGTGGAT CAAAGATAT CTAAGACTAATAAC 

GCTG CTTAT ACAACTGGAAT CGATAAAT C GTTTAAGACC CATAT CAAAGA 

ATTTGAAAAGCGAAATATCCCAGTAGCTGTCTACAGTTA 

CAAGTGTTAAAGAAATGAAAGAAG AGG CT CAGAT ATTTTATAAGAATGCA 

GCTCCTTACAAACCAACTTTTTATTGGATTG^ 

GTCTAACATGAATAAAGGTGTCCAAGCATTCCGAAAA.G^GTTAAAAAGAC 
TTGGTG CTAAAAATGTTGGT AT CTACAT C GGTACTTACTTTATGACTGAG 
CAAGGTAT CT CTGTAAAAGGATTTGACG CTGTTTGGATT CCAACTTATGG 
TAG CGATTCTGG ATACTATGAAG CAG CTC CACAAACTGAACTTAAATACG 
ATTTACACCAATACAC CT CT CAAGGTTAT CTAC CAGG AT T CAAT CAAC C G 
CTTGATTTAAATCAAATTGCAGCT 
GAAACTTTTTGGAAAAGTAAAAGAG 

SEQ ID NO. 5308 
STRAIN M781 

ACAAATACTTTGAAAAAAGAATTAGTTGAAGCTAAA 

AAGACAATT C CATC C GTAAAAG CTT CAAAAGT AC CGCAAAAAT CAACAT C 
AT CGAAAGATAAAGAGTTTGTTCTTAAACCGATT ATCGATGT CT CTGGTT 
GG CAACTTC CTAAGGAGATTGATTACG ATACG CTTT CAAAAAATATTTCA 
GGTGTTGTTATT CGTATCTTTGGTGGAT CAAAGATAT CTAAGACT AATAA 
CGCTG CHT ATACAACrreGAATCGATAAATc GTTTAAGAC CCATAT CAAAG 
AATTT CAAAAGCGAAATAT CCCAGTAG CTGTCTACAGTT ATGCACTTGGT 
TCAAGTGTTAAAGAAATGAAAGAAGAGG CT CAGAT ATTTT AT AAGAATGC 
AGCTCCTTACAAACCAACTTTTTatTGC^TTGACGTAG 
TGTCT AACATGAATAAAGGTGT C CAAG CATT C CG AAAAGAGTTAAAAAGA 
CTTGGTG CTAAAAATGTTGGTATCTACAT CGGTACTTACTTTATGACTG A 
GCAAC^TATCTCTGTAAAAGGATTTGACG CTGTTTGG ATT C CAACTTATG 
GTAG CGATT CTGGATACTATGAAGCAGCT CCACAAACTGAACTTAAATAC 
GATTT ACAC CAATACAC CT CTCAAGGTTAT CTAC CAGGATTCAAT CAAC C 
GCTTGATTTAAATCAAATTG CAGTTAATAAAGACAAG AAG AAAACTT ATG 
AGAAACTTTTTGGAAAAGTAAAAGAG 

SEQ ID NO. 5309 
STRAIN CJB110 

AAATACTTTGAAAAAAG AATTAGTTGAAG CT AAAAAG ACAATTCCATCCG 

TAAAAG CTT CAAAAGTAC CG CAAAAAT CAACAT CATC GAAAG AT AAAGAG 

TTTGTTCTTAAACCGATTATCGATGTCTCTG^ 

GATTGATTACGATACGCTTTCAAAAAATATTTCAGGTGTTGTTA 

T CTTTGGTGGAT CAAAG ATATCTAAGACT AATAACGCTG CTTATACAACT 

GGAATCGAT AAATCGTTTAAGAC C CATAT CAAAGAATTT CAAAAGCG AAA 

TATCCCAGTAGCTGT CTACAGTTATG CACTTGGTTCAAGTGTTAAAGAAA 

TGAAAGAAGAGG CT CAGATATTTTATAAG AATG CAG CT CCTTACAAAC CA 

ACTTTTTATTGG ATT GACGTAGAAGAGGAG ACAATGT CTAACATG AATAA 

AGGTGTCCAAGCATTCCGAAAAGAATTAAAAAGACriT 

TTGGTAT CTACATT GGTACTTACTTTATGACTGAG CAAGG CATCT CTGT A 

AAAC4GATTTGACGCTGTTTGGATTCCAACTTATCCT 



873 



WO 2004/018646 



PCT/US2003/026827 



Table 53: Comparative Sequences relating to SAG 0755 



CTATGAAGCGGCTCCGCAAACTGAACTTAAATACGATTTACACCAATACA 
CCTCTCAAGGTTAT CTACCAGGATTCAATCAACCGCTTGATTTAAATCAA 
ATTACAGTTAATAAAGACAAGAAGAAAACTTATGAGAAACTTTTTGGAAA 
AGTAAAAGAG 



SEQ ID NO. 5310 
STRAIN 1169NT 

ACAAATACTTTGAAAAAAG AATTAGTTG AAG CTAAAAAG ACAATT C C 

ATCCGTAAAAGCTTCAAAAGTACCGCAAAAATCAACATCATCGAAAGATA 

AAGAGTTTGTTCTTAAACCGATTATCGATGTCTCTGGTTGGCAACTTCCT 

AAGGAGATTGATTACGATACGCTTTCAAAAAATATTTCAGGTGTTGTTAT 

TCGTGTCTTTGGTGGATCAAAGATATCTAAGACTAATAACGCTGCTTATA 

CAACTGG AATCGATAAATCGTTT AAGAC C CAT AT CAAAG AATTT C AAAAG 

CGAAATATCCCAGTAGCTGTCTACAGTTATGCACTTGGTTCAAGTGTTAA 

AGAAATGAAAGAAGAGGCrCAGATATTTTATAAGAATGCAGCTCCTTACA 

AACCAACTTTTTATTGGATTGACGTAGAAGAGGAGACAATGTCTAACATG 

AATAAAGGTGTC CAAG CATT CCGAAAAGAATTAAAAAGACTTGGCG CTAA 

AAATGTTGGTATCTACATCGGTACTTACTTTATGACTGAGCAAGGTATCT 

CTGTAAAAGGATTTGACGCTGTTTGGATTCCAACTTATGGTAGCGATTCT 

GGATACT ATGAAG CAGCT CCGCAAACTGAACTTAAATACGATTTACACCA 

ATACACCTCTCAAGGTTATCTACCAGGATTCAATC^^ 

AT CAAATTG CPvGTTAATAAAGACAAGAAGAAAACTTATG AGAAACTTTTT 

GGAAAAGTAAAAGAG 



SEQ ID NO. 5311 
STRAIN JM913 0013 

ACAAATACTTTGAAAAAAGAATTAG 

TTGAAGCTAAAAAGACAATTCCATCCGTAAAAGCTTCAAAAGTACCGCAA 
AAAT CAACAT CATCGAAAGATAAAGAGTTTGTTCTTAAACCG ATT ATCGA 
TGTCT CTGGTTGGCAACTT C CTAAGGAGATTG ATTACGATACGCTTT CAA 
AAAATATTTCAGGTGTTGTTATTCGTGTCTTTGGTGGATCAAAGATATCT 
AAGACTAATAACGCTGCT'TATACAACT GG AAT CG ATAAAT CGTTTAAGAC 
CCATATCAA^GAATTT CAAAAG CGAAATAT CC CAGTAG CTGTCTACAGTT 
ATGCACTTGGTT CAAGTGTTAA^GAAATGAAAGAAGAGG CT CAG ATATTT 
TATAAGAATG CAGCT CCTTACAAAC Q^ACTTTTTATTGGATT GACGTAG A 
AGAGGAGACAATGTCTVAACATGAATAAAGGTGTC CAAG CATT CCGAAAAG 
AATTAAAAAGAC3TTGGTGCTAAAAATGTTGGTATCTACATTGGTACTTAC 
TTTATGACTG AG CAAGG CATCT OTGTAAAAGGATTTGACG CTGTTTG GAT 
TCCAACrTATGGTAGCGATTCTGGATACTATGAAGCGGCTCCGCAAACTG 
AACTTAAATACGATTTACAC CAATACAC CTCTCAAGGTTAT CTACCAGGA 
TT CAAT CAAC CG CTTGATTTAAAT CAAATT G CAGTTAAT AA^VGACAAGAA 
GAAAACTTATGAGAAACIT'TTTGGAAAAGTAAAAGAG 

PRETTY of: /biotmp/msa21441 .2 { *} January 20, 2003 03:46 



msa21441 
msa21441.2{ 

msa21441 . 

msa21441 . 

msa21441 . 
msa21441.2{206 
msa21441.2{ 

msa21441 . 

msa21441 . 

rasa21441 . 
rasa21441.2{ 



.2{206_090} 
2 06_18RS2l} 
2{206_2603} 
2?206_A909} 
2{206_H3 6B) 
_JM9130013} 
206_CJB110} 
2(206_COHlj 
2{206_M732 < 
2{206_M781 
2 06_1169NT; 
Consensus 



a c AAATACTT 
acAAATACTT 
acAAATACTT 
acAAATACTT 
acAAATACTT 
acAAATACTT 
— AAATACTT 
acAAATACTT 
acAAATACTT 
acAAATACTT 
acAAATACTT 



TGAAAAAAGA 
TGAAAAAAGA 
TGAAAAAAGA 
TGAAAAAAGA 
TGAAAAAAGA 
TGAAAAAAGA 
TGAAAAAAGA 
TGAAAAAAGA 
TGAAAAAAGA 
TGAAAAAAGA 
TGAAAAAAGA 



******** ********** 



ATTAGTTGAA 
ATTAGTTGAA 
ATTAGTTGAA 
ATTAGTTGAA 
ATTAGTTGAA 
ATTAGTTGAA 
ATTAGTTGAA 
ATTAGTTGAA 
ATTAGTTGAA 
ATTAGTTGAA 

ATTAGTTGAA 
********** 



GCTAAAAAGA 

G CTAAAAAG A 

GCTAAAAAGA 

GCTAAAAAGA 

GCTAAAAAGA 

GCTAAAAAGA 

GCTAAAAAGA 

GCTAAAAAGA 

GCTAAAAAGA 

GCTAAAAAGA 

GCTAAAAAGA 
********** 



50 

CAATTCCATC 

CAATTCCATC 

CAATTCCATC 

CAATTCCATC 

CAATTCCATC 

CAATTCCATC 

CAATTCCATC 

CAATTCCATC 

CAATTCCATC 

CAATTCCATC 

CAATTCCATC 
********** 



msa21441 
msa21441.2{ 

msa21441 . 

msa21441. 

msa21441 . 
msa21441.2{206 
rasa21441 .2{ 

msa21441 - 

msa21441 . 

msa21441 . 
msa21441.2{ 



.2{206_090 
206_18RS21 
2{206_2603 
2{206_A909 
2{206_H36B; 
_JM9130013 
206__CJB110 
2(206_COH1 
2{206_M732 
2{206_M781 
206_1169NT} 
Consensus 



51 

CGTAAAAGCT 

CGT AAAAG CT 

CGTAAAAGCT 

CGTAAAAGCT 

CGTAAAAGCT 

CGTAAAAGCT 

CGTAAAAGCT 

CGTAAAAGCT 

CGTAAAAGCT 

CGTAAAAGCT 

CGTAAAAGCT 
********** 



TCAAAAGTAC 
TCAAAAGTAC 
TCAAAAGTAC 
TCAAAAGTAC 
TCAAAAGTAC 
TCAAAAGTAC 
TCAAAAGTAC 
TCAAAAGTAC 
TCAAAAGTAC 
TCAAAAGTAC 
TCAAAAGTAC 
********** 



CGCAAAAATC 

CGCAAAAATC 

CGCAAAAATC 

CGCAAAAATC 

CGCAAAAATC 

CGCAAAAATC 

CGCAAAAATC 

CGCAAAAATC 

CGCAAAAATC 

CGCAAAAATC 

CGCAAAAATC 
********** 



AACATCATCG 

AACATCATCG 

AACATCATCG 

AACATCATCG 

AACATCATCG 

AACATCATCG 

AACATCATCG 

AACATCATCG 

AACATCATCG 

AACATCATCG 

AACATCATCG 
********** 



100 

AAAGATAAAG 

AAAGATAAAG 

AAAGATAAAG 

AAAGATAAAG 

AAAGATAAAG 

AAAGATAAAG 

AAAGATAAAG 

AAAGATAAAG 

AAAGATAAAG 

AAAGATAAAG 

AAAGATAAAG 
********** 



msa21441 
msa21441.2{ 
ma a2 1441 . 
msa21441 . 
msa21441 . 
msa21441.2{206 
msa21441.2{ 
msa21441. 
msa21441 . 



.2{206_090} 
206_18RS2l} 
2 {206^2603} 
2{206_A909} 
2{206_H36B> 
_JM9130013) 
206 CJB110} 
2 



{206_COH1} 
{206_M732} 



101 

AGTTTGTTCT 
AGTTTGTTCT 
AGTTTGTTCT 
AGTTTGTTCT 
AGTTTGTTCT 
AGTTTGTTCT 
AGTTTGTTCT 
AGTTTGTTCT 
AGTTTGTTCT 



TAAACCGATT 
TAAACCGATT 
TAAACCGATT 
TAAACCGATT 
TAAACCGATT 
TAAACCGATT 
TAAACCGATT 
TAAACCGATT 
TAAACCGATT 



ATCGATGTCT 
ATCGATGTCT 
ATCGATGTCT 
ATCGATGTCT 
ATCGATGTCT 
ATCGATGTCT 
ATCGATGTCT 
ATCGATGTCT 
ATCGATGTCT 



CTGGTTGGCA 
CTGGTTGGCA 
CTGGTTGGCA 
CTGGTTGGCA 
CTGGTTGGCA 
CTGGTTGGCA 
CTGGTTGGCA 
CTGGTTGGCA 
CTGGTTGGCA 



150 

ACTTCCTAAG 
ACTTCCTAAG 
ACTTCCTAAG 
ACTTCCTAAG 
ACTTCCTAAG 
ACTTCCTAAG 
ACTTCCTAAG 
ACTTCCTAAG 
ACTTCCTAAG 
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Table 53: Comparative Sequences relating to SAG 0755 



msa2 144 1 . 2 { 2 0 6_M7 8 1 } 
msa21441.2{206_1169NT} 

Consensus 



AGTTTGTTCT TAAACCGATT ATCGATGTCT CTGGTTGGCA ACTTCCTAAG 
AGTTTGTTCT TAAACCGATT ATCGATGTCT CTGGTTGGCA ACTTCCTAAG 
********** ********** ********** ********** ********** 



msa21441 
msa21441.2{ 

msa21441. 

msa21441 . 

msa21441 . 
msa21441.2{206 
msa21441 ,2{ 

msa21441. 

msa21441 . 

msa21441 . 
msa21441. 2{ 



,2{206_090} 
206_18RS2l} 
2{206_2603} 
2{206_A909} 
2{206_H36B} 
_JM9130013} 
206__CJB110} 
2[ 206_COHl) 
2{2 06_M732} 
2{206_M781} 
206_1169NT} 
Consensus 



151 

GAGATTGATT 
GAGATTGATT 
GAGATTGATT 
GAGATTGATT 
GAGATTGATT 
GAGATTGATT 
GAGATTGATT 
GAGATTGATT 
GAGATTGATT 
GAGATTGATT 
GAGATTGATT 



ACGATACGCT 
ACGATACGCT 
ACGATACGCT 
ACGATACGCT 
ACGATACGCT 
ACGATACGCT 
ACGATACGCT 
ACGATACGCT 
ACGATACGCT 
ACGATACGCT 
ACGATACGCT 



TTCAAAAAAT 
TTCAAAAAAT 
TTCAAAAAAT 
TTCAAAAAAT 
TTCAAAAAAT 
TTCAAAAAAT 
TTCAAAAAAT 
TTCAAAAAAT 
TTCAAAAAAT 
TTCAAAAAAT 
TTCAAAAAAT 



ATTTCAGGTG 
ATTTCAGGTG 
ATTTCAGGTG 
ATTTCAGGTG 
ATTTCAGGTG 
ATTTCAGGTG 
ATTTCAGGTG 
ATTTCAGGTG 
ATTTCAGGTG 
ATTTCAGGTG 
ATTTCAGGTG 



200 

TTGTTATTCG 
TTGTTATTCG 
TTGTTATTCG 
TTGTTATTCG 
TTGTTATTCG 
TTGTTATTCG 
TTGTTATTCG 
TTGTTATTCG 
TTGTTATTCG 
TTGTTATTCG 
TTGTTATTCG 



********** ********** ********** ********** ********** 



msa21441 
msa21441.2{ 

msa21441 . 

msa21441 . 

msa21441 . 
msa21441.2{206 
msa21441.2{ 

msa21441 . 

msa21441 . 

msa21441 . 
msa21441.2{ 



.2{206_090 
206_18RS21 
2{206_2603; 
2{206_A909 
2{206_H36B 
_JM9130013; 
206_CJB110} 
2{206_COH1} 
2(206_M732) 
2{206_M78l} 
206JL1S9NT} 
Consensus 



201 

TgTCTTTGGT 
TgTCTTTGGT 
TgTCTTTGGT 
TgTCTTTGGT 
TgTCTTTGGT 
TgTCTTTGGT 
TgTCTTTGGT 
TaTCTTTGGT 
TaTCTTTGGT 
TaTCTTTGGT 
TgTCTTTGGT 
*_******** 



GGATCAAAGA 
GGATCAAAGA 
GGATCAAAGA 
GGATCAAAGA 
GGATCAAAGA 
GGATCAAAGA 
GGATCAAAGA 
GGATCAAAGA 
GGATCAAAGA 
GGATCAAAGA 
GGATCAAAGA 



TATCTAAGAC 
TATCTAAGAC 
TATCTAAGAC 
TATCTAAGAC 
TATCTAAGAC 
TATCTAAGAC 
TATCTAAGAC 
TATCTAAGAC 
TATCTAAGAC 
TATCTAAGAC 
TATCTAAGAC 



TAATAACGCT 
TAATAACGCT 
TAATAACGCT 
TAATAACGCT 
TAATAACGCT 
TAATAACGCT 
TAATAACGCT 
TAATAACGCT 
TAATAACGCT 
TAATAACGCT 
TAATAACGCT 



********** ********** ********** 



250 

GCTTATACAA 
GCTTATACAA 
GCTTATACAA 
GCTTATACAA 
GCTTATACAA 
GCTTATACAA 
GCTTATACAA 
GCTTATACAA 
GCTTATACAA 
GCTTATACAA 
GCTTATACAA 
********** 



msa21441 
msa21441.2{ 

msa21441. 

msa21441. 

msa21441. 
msa21441.2{206 
msa21441.2{ 

msa21441. 

msa21441. 

msa21441 . 
msa21441.2{ 



.2{206_090 
206_18RS21 > 
2{206_2603; 
2{206_A909 
2{20S_H36B 

JM9130013 / 
2 06_CJB110| 
2{206_COH1 ! 
2{206_M732 
2{206_M781 
206_1169NT 
Consensus 



251 

CTGGAATCGA 
CTGGAATCGA 
CTGGAATCGA 
CTGGAATCGA 
CTGGAATCGA 
CTGGAATCGA 
CTGGAATCGA 
CTGGAATCGA 
CTGGAATCGA 
CTGGAATCGA 
CTGGAATCGA 



TAAATCGTTT 
TAAATCGTTT 
TAAATCGTTT 
TAAATCGTTT 
TAAATCGTTT 
TAAATCGTTT 
TAAATCGTTT 
TAAATCGTTT 
TAAATCGTTT 
TAAATCGTTT 
TAAATCGTTT 



AAGACCCATA 
AAGACCCATA 
AAGACCCATA 
AAGACCCATA 
AAGACCCATA 
AAGACCCATA 
AAGACCCATA 
AAGACCCATA 
AAGACCCATA 
AAGACCCATA 
AAGACCCATA 



********** ********** ********** 



TCAAAGAATT 

TCAAAGAATT 

TCAAAGAATT 

TCAAAGAATT 

TCAAAGAATT 

TCAAAGAATT 

TCAAAGAATT 

TCAAAGAATT 

TCAAAGAATT 

TCAAAGAATT 

TCAAAGAATT 
********** 



300 

TCAAAAGCGA 

TCAAAAGCGA 

TCAAAAGCGA 

TCAAAAGCGA 

TCAAAAGCGA 

TCAAAAGCGA 

TCAAAAGCGA 

TCAAAAGCGA 

TCAAAAGCGA 

TCAAAAGCGA 

TCAAAAGCGA 
********** 



msa21441 
msa21441.2{ 

msa21441 . 

msa21441 . 

msa21441 . 
msa21441.2{206 
msa21441 . 2 { 

msa21441 . 

msa21441 . 

msa21441 . 
msa21441.2{ 



.2{206_090 
2 06_18RS21 
2{206_2603 
2{206_A909 
2{206_H36B 
JJM9130013 
206_CJB110 
2f 206_COH1 
2{206_M732) 
2{206_M781} 
2 06_1169NT} 
Consensus 



301 

AATATCCCAG 
AATATCCCAG 
AATATCCCAG 
AATATCCCAG 
AATATCCCAG 
AATATCCCAG 
AATATCCCAG 
AATATCCCAG 
AATATCCCAG 
AATATCCCAG 
AATATCCCAG 



TAGCTGTCTA 
TAGCTGTCTA 
TAGCTGTCTA 
TAGCTGTCTA 
TAGCTGTCTA 
TAGCTGTCTA 
TAGCTGTCTA 
TAGCTGTCTA 
TAGCTGTCTA 
TAGCTGTCTA 
TAGCTGTCTA 



CAGTTATGCA 
CAGTTATGCA 
CAGTTATGCA 
CAGTTATGCA 
CAGTTATGCA 
CAGTTATGCA 
CAGTTATGCA 
CAGTTATGCA 
CAGTTATGCA 
CAGTTATGCA 
CAGTTATGCA 



********** ********** ********** 



CTTGGTTCAA 

CTTGGTTCAA 

CTTGGTTCAA 

CTTGGTTCAA 

CTTGGTTCAA 

CTTGGTTCAA 

CTTGGTTCAA 

CTTGGTTCAA 

CTTGGTTCAA 

CTTGGTTCAA 

CTTGGTTCAA 
********** 



350 

GTGTTAAAGA 

GTGTTAAAGA 

GTGTTAAAGA 

GTGTTAAAGA 

GTGTTAAAGA 

GTGTTAAAGA 

GTGTTAAAGA 

GTGTTAAAGA 

GTGTTAAAGA 

GTGTTAAAGA 

GTGTTAAAGA 
********** 



msa21441 
msa21441.2{ 

msa21441 . 

msa21441 . 

msa21441 . 
msa21441.2{206 
msa21441.2{ 

msa21441 . 

msa21441 . 

msa21441 . 
msa21441.2{ 



.2{206_090} 
206_18RS2lJ 
2{206__2603 
2{206_A909 
2{206_H36B' 
_JM9130013 
206_CJB110 
2{206_COH1 
2{206_M732 
2{206_M781 
206_1169NT 
Consensus 



351 

AATGAAAGAA 

AATGAAAGAA 

AATGAAAGAA 

AATGAAAGAA 

AATGAAAGAA 

AATGAAAGAA 

AATGAAAGAA 

AATGAAAGAA 

AATGAAAGAA 

AATGAAAGAA 

AATGAAAGAA 
********** 



GAGGCTCAGA 

GAGGCTCAGA 

GAGGCTCAGA 

GAGGCTCAGA 

GAGGCTCAGA 

GAGGCTCAGA 

GAGGCTCAGA 

GAGGCTCAGA 

GAGGCTCAGA 

GAGGCTCAGA 

GAGGCTCAGA 
********** 



TATTTTATAA 

TATTTTATAA 

TATTTTATAA 

TATTTTATAA 

TATTTTATAA 

TATTTTATAA 

TATTTTATAA 

TATTTTATAA 

TATTTTATAA 

TATTTTATAA 

TATTTTATAA 
********** 



GAATGCAGCT 

GAATGCAGCT 

GAATGCAGCT 

GAATGCAGCT 

GAATGCAGCT 

GAATGCAGCT 

GAATGCAGCT 

GAATGCAGCT 

GAATGCAGCT 

GAATGCAGCT 

GAATGCAGCT 
********** 



400 

CCTTACAAAC 

CCTTACAAAC 

CCTTACAAAC 

CCTTACAAAC 

CCTTACAAAC 

CCTTACAAAC 

CCTTACAAAC 

CCTTACAAAC 

CCTTACAAAC 

CCTTACAAAC 

CCTTACAAAC 
********** 



msa21441.2{206_090} 
rasa21441 . 2 { 206_18RS21 " 
msa21441.2(206_2603 
msa21441 . 2 {206_A909. 
msa21441.2{206_H36B} 
msa21441.2{206_JM913 0013* 
msa21441.2{206_CJB110 
msa2 144 1 . 2 { 2 06_COH1 



401 

CAACTTTTTA 
CAACTTTTTA 
CAACTTTTTA 
CAACTTTTTA 
CAACTTTTTA 
CAACTTTTTA 
CAACTTTTTA 
CAACTTTTTA 



TTGGATTGAC 
TTGGATTGAC 
TTGGATTGAC 
TTGGATTGAC 
TTGGATTGAC 
TTGGATTGAC 
TTGGATTGAC 
TTGGATTGAC 



GTAGAAGAGG 
GTAGAAGAGG 
GTAGAAGAGG 
GTAGAAGAGG 
GTAGAAGAGG 
GTAGAAGAGG 
GTAGAAGAGG 
GTAGAAGAGG 



AGACAATGTC 
AGACAATGTC 
AGACAATGTC 
AGACAATGTC 
AGACAATGTC 
AGACAATGTC 
AGACAATGTC 
AGACAATGTC 



450 

TAACATGAAT 
TAACATGAAT 
TAACATGAAT 
TAACATGAAT 
TAACATGAAT 
TAACATGAAT 
TAACATGAAT 
TAACATGAAT 



875 
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Table 53: Comparative Sequences relating to SAG 0755 



msa21441.2 
msa21441 . 2 {206_M781 } 
msa21441.2{206_1169NT} 

Consensus 



(2 06_M732) 
{206^ 



CAACTTTTTA TTGGATTGAC GTAGAAGAGG AGACAATGTC TAACATGAAT 
CAACTTTTTA TTGGATTGAC GTAGAAGAGG AGACAATGTC TAACATGAAT 
CAACTTTTTA TTGGATTGAC GTAGAAGAGG AGACAATGTC TAACATGAAT 
********** ********** ********** ********** ********** 



msa21441 
msa21441.2{ 

msa21441. 

msa21441 . 

msa21441 . 
msa21441.2{206 
msa21441.2{ 

msa21441. 

msa21441 . 

msa21441. 
msa21441.2{ 



.2{206J)90} 
206_18RS2l} 
2{206_2603} 
2{206~A909} 
2{206_H36B} 
_JM9130013} 
2 06_CJB110j 
2{206_COHl} 
2{206_M732* 
2{206_M781 
206_1169NTi 
Consensus 



451 

AAAGGTGTCC 

AAAGGTGTCC 

AAAGGTGTCC 

AAAGGTGTCC 

AAAGGTGTCC 

AAAGGTGTCC 

AAAGGTGTCC 

AAAGGTGTCC 

AAAGGTGTCC 

AAAGGTGTCC 

AAAGGTGTCC 
********** 



AAGCATTCCG 

AAGCATTCCG 

AAGCATTCCG 

AAGCATTCCG 

AAGCATTCCG 

AAGCATTCCG 

AAGCATTCCG 

AAGCATTCCG 

AAGCATTCCG 

AAGCATTCCG 

AAGCATTCCG 
********** 



AAAAGAaTTA 

AAAAGAaTTA 

AAAAGAaTTA 

AAAAGAaTTA 

AAAAGAaTTA 

AAAAGAaTTA 

AAAAGAaTTA 

AAAAGAgTTA 

AAAAGAgTTA 

AAAAGAgTTA 

AAAAGAaTTA 
******_*** 



AAAAGACTTG 
AAAAGACTTG 
AAAAGACTTG 
AAAAGACTTG 
AAAAGACTTG 
AAAAGACTTG 
AAAAGACTTG 
AAAAGACTTG 
AAAAGACTTG 
AAAAGACTTG 
AAAAGACTTG 



500 

GtGCTAAAAA 
GtGCTAAAAA 
GtGCTAAAAA 
GtGCTAAAAA 
GtGCTAAAAA 
GtGCTAAAAA 
GtGCTAAAAA 
GtGCTAAAAA 
GtGCTAAAAA 
GtGCTAAAAA 
GcGCTAAAAA 



********** *_******** 



msa21441 
msa21441.2{ 

msa21441. 

msa21441. 

msa21441. 
msa21441.2{206 
msa21441.2{ 

msa21441 . 

msa21441 . 

msa21441. 
msa21441.2{ 



•2{206_090 
206__18RS21 
2{206_2603 
2f 206_A909] 
2{206_H36B] 
_JM9130013/ 
206_CJB110' 
2{206_COH1 
2(206_M732; 
2{206_M78l} 
206_1169NT} 
Consensus 



501 

TGTTGGTATC 
TGTTGGTATC 
TGTTGGTATC 
TGTTGGTATC 
TGTTGGTATC 
TGTTGGTATC 
TGTTGGTATC 
TGTTGGTATC 
TGTTGGTATC 
TGTTGGTATC 

TGTTGGTATC 
********** 



TACATtGGTA 

TACATtGGTA 

TACATtGGTA 

TACATtGGTA 

TACATtGGTA 

TACATtGGTA 

TACATtGGTA 

TACATcGGTA 

TACATcGGTA 

TACATcGGTA 

TACATcGGTA 
*****_**** 



CTTACTTTAT 

CTTACTTTAT 

CTTACTTTAT 

CTTACTTTAT 

CTTACTTTAT 

CTTACTTTAT 

CTTACTTTAT 

CTTACTTTAT 

CTTACTTTAT 

CTTACTTTAT 

CTTACTTTAT 
********** 



GACTGAGCAA 

GACTGAGCAA 

GACTGAGCAA 

GACTGAGCAA 

GACTGAGCAA 

GACTGAGCAA 

GACTGAGCAA 

GACTGAGCAA 

GACTGAGCAA 

GACTGAGCAA 

GACTGAGCAA 
********** 



550 

GGcATCTCTG 

GGcATCTCTG 

GGcATCTCTG 

GGcATCTCTG 

GGcATCTCTG 

GGcATCTCTG 

GGcATCTCTG 

GGtATCTCTG 

GGtATCTCTG 

GGtATCTCTG 

GGtATCTCTG 
**_******* 



msa21441 
msa21441.2{ 

msa21441 . 

rasa21441. 

msa21441 . 
msa21441.2{206 
msa21441.2{ 

msa21441 . 

msa21441 . 

msa21441 . 
msa21441.2{ 



.2{206_090] 
206_18RS2l] 
2{206_2603] 
2{206_A909 
2{206_H36B 
_JM9130013 
2 06_CJB110 ! 
2{206_COHi; 
2{206_M732 
2{206_M781} 
206_1169NT} 
Consensus 



551 

TAAAAGGATT 
TAAAAGGATT 
TAAAAGGATT 
TAAAAGGATT 
TAAAAGGATT 
TAAAAGGATT 
TAAAAGGATT 
TAAAAGGATT 
TAAAAGGATT 
TAAAAGGATT 
TAAAAGGATT 



TGACGCTGTT 
TGACGCTGTT 
TGACGCTGTT 
TGACGCTGTT 
TGACGCTGTT 
TGACGCTGTT 
TGACGCTGTT 
TGACGCTGTT 
TGACGCTGTT 
TGACGCTGTT 
TGACGCTGTT 



TGGATTCCAA 
TGGATTCCAA 
TGGATTCCAA 
TGGATTCCAA 
TGGATTCCAA 
TGGATTCCAA 
TGGATTCCAA 
TGGATTCCAA 
TGGATTCCAA 
TGGATTCCAA 
TGGATTCCAA 



********** ********** ********** 



CTTATGGTAG 

CTTATGGTAG 

CTTATGGTAG 

CTTATGGTAG 

CTTATGGTAG 

CTTATGGTAG 

CTTATGGTAG 

CTTATGGTAG 

CTTATGGTAG 

CTTATGGTAG 

CTTATGGTAG 
********** 



600 

CGATT CTGGA 

CGATTCTGGA 

CGATTCTGGA 

CGATTCTGGA 

CGATTCTGGA 

CGATTCTGGA 

CGATTCTGGA 

CGATTCTGGA 

CGATTCTGGA 

CGATTCTGGA 

CGATTCTGGA 
********** 



msa21441 
msa21441.2{ 

msa21441. 

msa21441 . 

msa21441 . 
msa21441.2{2 06 
msa21441.2{ 

msa21441 . 

msa21441. 

msa21441. 
msa21441.2{ 



.2{206_090} 
206JL8RS21} 
2{206_2603} 
2{206_A909} 
2{206_H36B} 
_JM9130013} 
206_CJB110j 
2(206__COHl} 
2{206_M732} 
2{206_M78l} 
206_1169NT} 
Consensus 



601 

TACTATGAAG 
TACTATGAAG 
TACTATGAAG 
TACTATGAAG 
TACTATGAAG 
TACTATGAAG 
TACTATGAAG 
TACTATGAAG 
TACTATGAAG 
TACTATGAAG 
TACTATGAAG 



CgGCTCCgCA 

CgGCTCCgCA 

CgGCTCCgCA 

CgGCTCCgCA 

CgGCTCCgCA 

CgGCTCCgCA 

CgGCTCCgCA 

CaGCTCCaCA 

CaGCTCCaCA 

CaGCTCCaCA 

CaGCTCCgCA 
********** *_*****_** 



AACTGAACTT 

AACTGAACTT 

AACTGAACTT 

AACTGAACTT 

AACTGAACTT 

AACTGAACTT 

AACTGAACTT 

AACTGAACTT 

AACTGAACTT 

AACTGAACTT 

AACTGAACTT 
********** 



AAATACGATT 

AAATACGATT 

AAATACGATT 

AAATACGATT 

AAATACGATT 

AAATACGATT 

AAATACGATT 

AAATACGATT 

AAATACGATT 

AAATACGATT 

AAATACGATT 
********** 



650 

TACACCAATA 

TACACCAATA 

TACACCAATA 

TACACCAATA 

TACACCAATA 

TACACCAATA 

TACACCAATA 

TACACCAATA 

TACACCAATA 

TACACCAATA 

TACACCAATA 
********** 



msa21441 . 2 {206_090 J 
msa21441 .2{206_18RS2l} 
msa21441.2{206_2603} 
msa21441.2{206_A909} 
msa21441 . 2 { 206_H36B} 
rasa21441 . 2 { 2 06__JM9130013 } 
msa2 14 4 1 . 2 { 2 0 6_CJB1 10 } 
msa2 144 1 . 2 ( 2 06_COH1 i 
• msa21441.2{206_M732} 
msa2 144 1 . 2 { 2 06_M7 8 1 } 
msa21441.2{206_1169NT} 

Consensus 



651 

CACCTCTCAA 
CACCTCTCAA 
CACCTCTCAA 
CACCTCTCAA 
CACCTCTCAA 
CACCTCTCAA 
CACCTCTCAA 
CACCTCTCAA 
CACCTCTCAA 
CACCTCTCAA 

CACCTCTCAA 
********** 



GGTTATCTAC 
GGTTATCTAC 
GGTTATCTAC 
GGTTATCTAC 
GGTTATCTAC 
GGTTATCTAC 
GGTTATCTAC 
GGTTATCTAC 
GGTTATCTAC 
GGTTATCTAC 
GGTTATCTAC 



CAGGAtTCAA 
CAGGAtTCAA 
CAGGAwTCAA 
CAGGAtTCAA 
CAGGAtTCAA 
CAGGAtTCAA 
CAGGAtTCAA 
CAGGAtTCAA 
CAGGAtTCAA 
CAGGAtTCAA 
CAGGAtTCAA 



TCAACCGCTT 
TCAACCGCTT 
TCAACCGCTT 
TCAACCGCTT 
TCAACCGCTT 
TCAACCGCTT 
TCAACCGCTT 
TCAACCGCTT 
TCAACCGCTT 
TCAACCGCTT 
TCAACCGCTT 



********** *****„**** ********** 



700 

GATTTAAATC 

GATTTAAATC 

GATTTAAATC 

GATTTAAATC 

GATTTAAATC 

GATTTAAATC 

GATTTAAATC 

GATTTAAATC 

GATTTAAATC 

GATTTAAATC 

GATTTAAATC 
********** 



msa21441.2{206_090} 
msa21441.2{206_18RS21> 
msa21441.2(206_2603) 
msa21441.2{206_A909} 
msa2 144 1 . 2 { 2 0 6_H3 6B } 
msa21441 . 2 { 206_JM9130013 } 
msa21441 . 2 {206_CJB110} 



701 

AAATTgCAGT 
AAATTgCAGT 
AAATTgCAGT 
AAATTgCAGT 
AAATTgCAGT 
AAATTgCAGT 
AAATTaCAGT 



TAATAAAGAC 
TAATAAAGAC 
TAATAAAGAC 
TAATAAAGAC 
TAATAAAGAC 
TAATAAAGAC 
TAATAAAGAC 



AAGAAGAAAA 
AAGAAGAAAA 
AAGAAGAAAA 
AAGAAGAAAA 
AAGAAGAAAA 
AAGAAGAAAA 
AAGAAGAAAA 



CTTATGAGAA 
CTTATGAGAA 
CTTATGAGAA 
CTTATGAGAA 
CTTATGAGAA 
CTTATGAGAA 
CTTATGAGAA 



750 

ACTTTTTGGA 
ACTTTTTGGA 
ACTTTTTGGA 
ACTTTTTGGA 
ACTTTTTGGA 
ACTTTTTGGA 
ACTTTTTGGA 



876 
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Table 53: Comparative Sequences relating to SAG 0755 



msa2 144 1 . 2 { 2 06_COH1 } 
msa21441.2{206__M732} 
msa21441.2{206_M78l} 
msa21441.2{206_1169NT} 

Consensus 



msa2 1441 . 2 { 206_090 / 
msa21441 . 2 { 206_18RS21 : 
msa21441.2{206_2603 ■ 
msa21441 .2{206_A909 
msa21441 . 2 { 206_H36B 
msa21441.2{206_xJM9130013 : 
msa21441 . 2 { 2 06_CJB110 
msa21441 .2{206_COH1 
msa21441.2(206_M732} 
msa21441.2{206_M781} 
msa21441 . 2 { 206_1169NT} 

Consensus 

SEQ ID NO. 5312 

STRAIN 2603 frame: 1 
TNTLKKELVEAKKTI PSVKASKVPQKSTSSKDKEFVLKPI IDVSGWQLPKEIDYDTLSKN 
ISGWIRVFGGSKI S KTNNAAYTTG I DKS FKTHI KE FQKRNI PVAVYS YALGS S VKEMKE 
EAQI FYKNAA P Y KP T FYW I D VE EETMSNMN KGVQAFRKE LKRLGAKNVG I Y I GT Y FMTE Q 
GISVKGFDAWIPTYGSDSGYYEAAPQTELKYDLHQYTSQGYLPGXNQPIJ5I^QIAVNKD 
KKKTYEKLFGKVKE 

SEQ ID NO. 5313 
STRAIN 090 frame: 1 

TNTLKKELVEAKKT I PSVKASKVPQKSTSSKDKEFVLKPI IDVSGWQLPKEIDYDTLSKN 
I SGWIRVFGGSKI S KTNNAAYTTG I DKS FKTHI KEFQKRNI PVAVYS YALGSSVKEMKE 
EAQI FYKNAAPYKPTFYW I DVEEETMSNMNKGVQAFRKELKRLGAKNVG I YI GTYFMTEQ 
GI SVKGFDAVWI PTYGSDSGYYEAAPQTELKYDLHQYTSQGYLPGFNQPLDLNQ I AVNKD 
KKKTYEKLFGKVKE 

SEQ ID NO. 5314 

STRAIN A909 frame: 1 

TNTLKKELVEAKKT I PS VKASKVPQKSTSSKDKEFVLKP I IDVSGWQLPKEIDYDTLSKN 
I SGWIRVFGGSKI S KTNNAAYTTG I DKS FKTHI KEFQKRNI PVAVYSYALGSS VKEMKE 
EAQI FYKNAAPYKPTFYWI DVEEETMSNMNKGVQAFRKELKRLGAKNVG I YI GTYFMTEQ 
GI SVKGFDAVWI PTYGSDSGYYEAAPQTELKYDLHQYTSQGYLPGFNQPLDLNQ I AVNKD 
KKKTYEKLFGKVKE 

SEQ ID NO. 5315 

STRAIN H36B frame: 1 

TNTLKKELVEAKKTI PSVKASKVPQKSTSSKDKEFVLKPI IDVSGWQLPKEIDYDTLSKN 
ISGWIRVFGGSKI S KTNNAAYTTG I DKS FKTHI KEFQKRNI PVAVYSYALGSSVKEMKE 
EAQI FYKNAAPYKPTFYWIDVEEETMSNMNKGVQAFRKELKRIiGAKNVGI YI GTYFMTEQ 
GISVKGFDAVWIPTYGSDSGYYEAAPQTELKYDLHQYTSQ^YLPGFNQPLDLNQIAVNKD 
KKKTYEKLFGKVKE 

SEQ ID NO. 5316 

STRAIN 18RS21 frame: 1 

TNTLKKELVEAKKT I PS VKASKVPQKSTSSKDKEFVLKP I IDVSGWQLPKEIDYDTLSKN 
I SGWIRVFGGSKI S KTNNAAYTTG I DKS FKTHI KEFQKRNI PVAVYSYALGSSVKEMKE 
EAQI FYKNAAPYKPTFYW I DVEEETMSNMNKGVQAFRKELKRLGAKNVG I YI GTYFMTEQ 
GI SVKGFDAVWI PTYGSDSGYYEAAPQTELKYDLHQYTSQGYLPGFNQPLDLNQI AVNKD 

KKKTYEKLFGKVKE 

SEQ ID NO. 5317 

STRAIN M732 frame: 1 

TNTLKKELVEAKKT I PS VKASKVPQKSTSSKDKEFVLKP I IDVSGWQLPKEI DYDTLSKN 
I SGWIRI FGGSKI S KTNNAAYTTG I DKS FKTHI KEFQKRNI PVAVYSYALGSSVKEMKE 
EAQI FYKNAAPYKPTFYW I DVEEETMSNMNKGVQAFRKELKRLGAKNVG I YIGTYFMTEQ 
GI SVKGFDAVWI PTYGSDSGYYEAAPQTELKYDLHQYTSCGYLPGFNQPLDLNQIAVNKD 
KKKTYEKLFGKVKE 

SEQ ID NO. 5318 

STRAIN COH1 frame: 1 

TNTLKKELVEAKKTI PSVKASKVPQKSTSSKDKEFVLKPI IDVSGWQLPKEIDYDTLSKN 
I SGWIRI FGGSKI S KTNNAAYTTG I DKS FKTHI KEFQKRNI PVAVYSYALGSSVKEMKE 
EAQI FYKNAAPYKPTFYWI DVEEETMSNMNKGVQAFRKELKRLGAKNVG! YIGTYFMTEQ 
GI SVKGFDAVWI PTYGSDSGYYEAAPQTELKYDLHQYTSQGYLPGFNQPLDLNQI AVNKD 
KKKTYEKLFGKVKE 

SEQ ID NO. 5319 

STRAIN M781 frame: 1 

TNTLKKELVEAKKTI PSVKASKVPQKSTSSKDKEFVLKPI IDVSGWQLPKEIDYDTLSKN 
I SGWIRI FGGSKI S KTNNAAYTTG I DKSFKTH I KEFQKRNI PVAVYS YAIX3SS VKEMKE 
EAQ I F YKNAAP YKPT FYWI DVEEETMSNMNKGVQAFRKELKRLGAKNVG I Y I GTYFMTEQ 
GI SVKGFDAVWI PTYGSDSGYYEAAPQTELKYDLHQYTSQGYLPGFNQPLDLNQIAVNKD 
KKKTYEKLFGKVKE 



AAATTgCAGT TAATAAAGAC AAGAAGAAAA CTTATGAGAA ACTTTTTGGA 
AAATTgCAGT TAATAAAGAC AAGAAGAAAA CTTATGAGAA ACTTTTTGGA 
AAATTgCAGT TAATAAAGAC AAGAAGAAAA CTTATGAGAA ACTTTTTGGA 
AAATTgCAGT TAATAAAGAC AAGAAGAAAA CTTATGAGAA ACTTTTTGGA 
*****_**** ********** ********** ********** *********** 

751 762 

AAAGTAAAAG AG 

AAAGTAAAAG AG 

AAAGTAAAAG AG 

AAAGTAAAAG AG 

AAAGTAAAAG AG 

AAAGTAAAAG AG 

AAAGTAAAAG AG 

AAAGTAAAAG AG 

AAAGTAAAAG AG 

AAAGTAAAAG AG 

AAAGTAAAAG AG 
********** ** 



877 
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Table 53: Comparative Sequences relating to SAG 0755 



SEQ ID NO. 5320 
STRAIN CJB110 frame: 2 

NTLKKELVEAKKTI PSVKASKVPQKSTSSKDKEFVLKPI I DVSGWQLPKE IDYDTLSKNI 
SGWI RVFGGSKI SKTNNAAYTTG I DKSFKTHI KEFQKRNI PVAVYSYALGSSVKEMKEE 
AQ I FYKNAAP YKPTFYW I DVEEETMSNMNKGVQAFRKELKRLGAKNVG I Y IGTYFMTEQG 
I SVKGFDAVWI PTYGSDSGYYEAAPQTELKYDLHQYTSQGYLPGFNQPLDLNQITVNKDK 
KKTYEKLFGKVKE 

SEQ XD NO. 5321 

STRAIN 1169NT frame: 1 

TNTLKKELVEAKKTIPSVKASKVPQKSTSSKDKEF^KPIIDVSGWQLPKEIDYDTLSKN 
I SGWIRVFGGSKI SKTNNAAYTTG I DKSFKTHI KEFQKRNI PVAVYSYALGSSVKEMKE 
EAQIFYKNAAPYKPTFYWIDVEEETMSNMNKGVQAFRKELKRLGAKNVGIYIGTYFMTEQ 
GI SVKGFDAVWI PTYGSDSGYYEAAPQTELKYDLHQYTSQGYLPGFNQPLDLNQI AVNKD 
KKKTYEKLFGKVKE 

SEQ ID NO. 5322 

STRAIN JM9130013 frame: 1 

TNTLKKELVEAKKTIPSVKASKVPQKSTSSKDKEFVLKPI I DVSGWQLPKE I DYDTLSKN 
I SGWIRVFGGSKI SKTNNAAYTTGIDKSFKTHI KEFQKRNI PVAVYSYALGSSVKEMKE 
EAQI FYKNAAP YKPTFYWI DVEEETMSNMNKGVQAFRKELKRLGAKNVG I Y I GT YFMTEQ 
GI SVKGFDAVWI PTYGSDSGYYEAAPQTELKYDLHQYTSQGYLPGFNQPLDLNQI AVNKD 
KKKTYEKLFGKVKE 

PRETTY of : /biotmp/msa21641 . 2 { * } January 20, 2003 03:59 



msa2164 
msa21641.2 
msa21641.2 
msa21641 
msa21641 
msa21641 
msa21641.2{20 
maa21641 
msa21641 
msa21641 
msa21641.2 



1.2 {206^090} 
{206_1169NT} 
{206_18RS21} 
.2{206_2603j 
.2{206_A909} 
.2{206_H3 6B} 
6_JM913 0013} 
.2{206_COH1) 
.2{206_M732} 
.2{206_M781} 
{206_CJB110} 
Consensus 



tNTLKKELVE 

tNTLKKELVE 

tNTLKKELVE 

tNTLKKELVE 

tNTLKKELVE 

tNTLKKELVE 

tNTLKKELVE 

tNTLKKELVE 

tNTLKKELVE 

tNTLKKELVE 

-NTLKKELVE 
_ * * * ****** 



AKKTI PSVKA 
AKKTIPSVKA 
AKKTI PSVKA 
AKKTIPSVKA 
AKKTIPSVKA 
AKKTIPSVKA 
AKKTIPSVKA 
AKKTIPSVKA 
AKKTIPSVKA 
AKKTIPSVKA 
AKKTIPSVKA 



SKVPQKSTSS 
SKVPQKSTSS 
SKVPQKSTSS 
SKVPQKSTSS 
SKVPQKSTSS 
SKVPQKSTSS 
SKVPQKSTSS 
SKVPQKSTSS 
SKVPQKSTSS 
SKVPQKSTSS 
SKVPQKSTSS 



KDKEFVLKPI 
KDKEFVLKPI 
KDKEFVLKPI 
KDKEFVLKPI 
KDKEFVLKPI 
KDKEFVLKPI 
KDKEFVLKPI 
KDKEFVLKPI 
KDKEFVLKPI 
KDKEFVLKPI 
KDKEFVLKPI 



50 

IDVSGWQLPK 
IDVSGWQLPK 
IDVSGWQLPK 
IDVSGWQLPK 
IDVSGWQLPK 
IDVSGWQLPK 
IDVSGWQLPK 
IDVSGWQLPK 
IDVSGWQLPK 
IDVSGWQLPK 
IDVSGWQLPK 



********** ********** ********** ********** 



msa21641 
msa21641 
msa21641 

msa21641 . 

msa21641 . 

msa21641 . 
msa21641.2{206 

msa21641 . 

msa21641 . 

msa21641 . 
msa21641 .2{ 



,> -i s .2{206_0 90 
.2f 206_1169NT 
.2{206_18RS21 
2{206_2603} 
2{206_A909} 
2{206_H36B} 
_JM9130013} 
2{206_COH1' 
2{206_M732 
2{206_M78i; 
206_CJB110} 
Consensus 



51 

EI DYDTLSKN 
EI DYDTLSKN 
EI DYDTLSKN 
EI DYDTLSKN 
EI DYDTLSKN 
EI DYDTLSKN 
EI DYDTLSKN 
EI DYDTLSKN 
EI DYDTLSKN 
EI DYDTLSKN 
EIDYDTLSKN 



ISGWIRvFG^ 
ISGWIRvFG 
ISGWIRvFG 
ISGWIRvFG 
ISGWIRvFG 
ISGWIRvFG 
ISGWIRvFG 
ISGWIRiFG 
ISGWIRiFG 
ISGWIRiFG 
ISGWIRvFG 



********** *******_** 



GSKISKTNNA 

GSKISKTNNA 

GSKISKTNNA 

GSKISKTNNA 

GSKISKTNNA 

GSKISKTNNA 

GSKISKTNNA 

GSKISKTNNA 

GSKISKTNNA 

GSKISKTNNA 

GSKISKTNNA 
********** 



AYTTG I DKS F 

AYTTGIDKSF 

AYTTG I DKS F 

AYTTGIDKSF 

AYTTGIDKSF 

AYTTGIDKSF 

AYTTGIDKSF 

AYTTGIDKSF 

AYTTGIDKSF 

AYTTGIDKSF 

AYTTGIDKSF 
********** 



100 

KTHIKEFQKR 

KTHI KEFQKR 

KTHIKEFQKR 

KTHIKEFQKR 

KTHIKEFQKR 

KTHIKEFQKR 

KTHIKEFQKR 

KTHIKEFQKR 

KTHIKEFQKR 

KTHIKEFQKR 

KTHIKEFQKR 
********** 



msa21641 
msa21641.2{ 
msa21641 .2{ 
msa21641. 
msa21641. 
msa21641 . 
msa21641.2{206 
, msa21641. 
msa21641 . 
msa21641. 
msa21641.2{ 



.2{206_090} 
206_1169NT} 
206_18RS2lJ 
2 {206^2603} 
2{206_A909} 
2{206_H36B> 
_JM9130013) 
2{206_COH1} 
2{206_M732" 
2{206_M781 
206_CJB110 
Consensus 



101 

NIPVAVYSYA 

NIPVAVYSYA 

NIPVAVYSYA 

NIPVAVYSYA 

NIPVAVYSYA 

NIPVAVYSYA 

NIPVAVYSYA 

NIPVAVYSYA 

NIPVAVYSYA 

NIPVAVYSYA 

NIPVAVYSYA 
********** 



LGSSVKEMKE 

LGSSVKEMKE 

LGSSVKEMKE 

LGSSVKEMKE 

LGSSVKEMKE 

LGSSVKEMKE 

LGSSVKEMKE 

LGSSVKEMKE 

LGSSVKEMKE 

LGSSVKEMKE 

LGSSVKEMKE 
********** 



EAQIFYKNAA 

EAQIFYKNAA 

EAQIFYKNAA 

EAQIFYKNAA 

EAQIFYKNAA 

EAQIFYKNAA 

EAQIFYKNAA 

EAQIFYKNAA 

EAQIFYKNAA 

EAQIFYKNAA 

EAQIFYKNAA 
********** 



PYKPTFYWID 

PYKPTFYWID 

PYKPTFYWID 

PYKPTFYWID 

PYKPTFYWID 

PYKPTFYWID 

PYKPTFYWID 

PYKPTFYWID 

PYKPTFYWID 

PYKPTFYWID 

PYKPTFYWID 
********** 



150 

VEEETMSNMN 

VEEETMSNMN 

VEEETMSNMN 

VEEETMSNMN 

VEEETMSNMN 

VEEETMSNMN 

VEEETMSNMN 

VEEETMSNMN 

VEEETMSNMN 

VEEETMSNMN 

VEEETMSNMN 
********** 



msa21641 
msa21641.2{ 
msa21641 .2{ 

msa21641. 

msa21641 . 

msa21641 . 
msa21641.2{206 

msa21641 . 

msa21641. 

msa21641 . 
msa2l64l.2{ 



.2{206_090 
206_1169Nt' 
206_18RS21 
2{206_2603 
2{206_A909 
2{206_H36B; 
_JM9130013 
2{2 06_COHi; 
2{206__M732> 
2{206_M781j 
206_CJB110} 
Consensus 



151 

KGVQAFRKEL 
KGVQAFRKEL 
KGVQAFRKEL 
KGVQAFRKEL 
KGVQAFRKEL 
KGVQAFRKEL 
KGVQAFRKEL 
KGVQAFRKEL 
KGVQAFRKEL 
KGVQAFRKEL 
KGVQAFRKEL 
********** 



KRLGAKNVGI 

KRLGAKNVGI 

KRLGAKNVGI 

KRLGAKNVGI 

KRLGAKNVGI 

KRLGAKNVGI 

KRLGAKNVGI 

KRLGAKNVGI 

KRLGAKNVGI 

KRLGAKNVGI 

KRLGAKNVGI 
********** 



YIGTYFMTEQ 
YIGTYFMTEQ 
YIGTYFMTEQ 
YIGTYFMTEQ 
YIGTYFMTEQ 
YIGTYFMTEQ 
YIGTYFMTEQ 
YIGTYFMTEQ 
YIGTYFMTEQ 
YIGTYFMTEQ 
YIGTYFMTEQ 
********** 



GI SVKGFDAV 

GISVKGFDAV 

GI SVKGFDAV 

GISVKGFDAV 

GISVKGFDAV 

GISVKGFDAV 

GISVKGFDAV 

GISVKGFDAV 

GISVKGFDAV 

GISVKGFDAV 

GISVKGFDAV 
********** 



200 

WIPTYGSDSG 

WIPTYGSDSG 

WIPTYGSDSG 

WIPTYGSDSG 

WIPTYGSDSG 

WIPTYGSDSG 

WIPTYGSDSG 

WIPTYGSDSG 

WIPTYGSDSG 

WIPTYGSDSG 

WIPTYGSDSG 
********** 



878 
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Table 53: Comparative Sequences relating to SAG 0755 



msa21641 
msa21641 .2{ 
msa21641 . 2{ 

msa21641 . 

msa21641 . 

msa21641 . 
msa21641.2{206 

tnsa21641 . 

msa21641 . 

msa21641 . 
rasa2164i .2{ 



• 2{206__090 
2 06__1169NT 
206_18RS21 
2{206_2603 
2{206_A909 
2{206_H3 6B 
_JM9130013 
2{206_COH1 
2{206_M732 
2{206_M7 81 
206__CJB110 
Consensus 



201 

YYEAAPQTEL 
YYEAAPQTEL 
YYEAAPQTEL 
YYEAAPQTEL 
YYEAAPQTEL 
YYEAAPQTEL 
YYEAAPQTEL 
YYEAAPQTEL 
YYEAAPQTEL 
YYEAAPQTEL 
YYEAAPQTEL 



KYDLHQYTSQ 
KYDLHQYTSQ 
KYDLHQYTSQ 
KYDLHQYTSQ 
KYDLHQYTSQ 
KYDLHQYTSQ 
KYDLHQYTSQ 
KYDLHQYTSQ 
KYDLHQYTSQ 
KYDLHQYTSQ 
KYDLHQYTSQ 



GYLPGfNQPL 
GYLPGf NQPL 
GYLPGfNQPL 
GYLPGxNQPL 
GYLPGfNQPL 
GYLPGfNQPL 
GYLPGfNQPL 
GYLPGfNQPL 
GYLPGfNQPL 
GYLPGfNQPL 
GYLPGfNQPL 



DLNQ I a VNKD 
DLNQ I aVNKD 
DLNQ I a VNKD 
DLNQ I a VNKD 
DLNQ I a VNKD 
DLNQIaVNKD 
DLNQ I a VNKD 
DLNQIaVNKD 
DLNQIaVNKD 
DLNQIaVNKD 
DLNQ I tVNKD 



250 

KKKTYEKLFG 
KKKTYEKLFG 
KKKTYEKLFG 
KKKTYEKLFG 
KKKTYEKLFG 
KKKTYEKLFG 
KKKTYEKLFG 
KKKTYEKLFG 
KKKTYEKLFG 
KKKTYEKLFG 
KKKTYEKLFG 



******************** *****_**** *****_**** ********** 



msa21641 . 2 { 206_090 
msa21641.2{206_1169NT 
msa21641.2{206_18RS21; 
msa21641.2{206_2603 
msa21641.2{206_A909 • 
msa2 1641 . 2 { 206_H3 6B 
msa21641.2{2 06_JM913 0013 
msa2 164 1 . 2 { 2 06_COH1 
msa21641.2f206_M732; 
rnsa21641.2{206_M781; 
msa21641 . 2 {206_CJB110 ; 

Consensus 



251 

KVKE 

KVKE 

KVKE 

KVKE 

KVKE 

KVKE 

KVKE 

KVKE 

KVKE 

KVKE 

KVKE 
**** 
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SEQ ID NO. 5401 
STRAIN 2603 

TTGACTCACAAAAATATATTATTAACCATTATATTTGGATTATTT 

ATGATTATATTATCAGCATGTGGTATGTCTAATAAGGAAATGGCTGGTATTGATAATTGG 
GAACATTATCAAAAGGAAAAGAAAATTACTATTGGATTTGATAATACTTTTGTTCCTATG 
GGATTTGAAAGTCGTTCTGGTGACTATACCGGCTTTGATATTGATTTAGCTAATGCTGTT 
TTTAAAGAATACGGTATTTCAGTGAAATGGCAGCCTATTAACTGGGATATGAAAGAAACT 
GAACTTAAT AATGGT AAT ATAG AC CTTATT TG GAATGGTTATT CAAAAACGG CAG AACGT 
GCTAAAAAAGTCGCTTTTACAAACCCATATATGAATAATCATCAAGTAATTGTTACTAAA 
ACTT CAT CACATATTAATAGTATTAAGGAT ATGAAGG GGAAAAAACTAGGAG CCCAGTCG 
GGTTCATCTGGTTTTGATGCrTTTAACGCTAAACCTGATATTT^ 

GGAAAAG AAG CAGTT CIAAT ACG AT ACTTT CACT CAGG CTTTG ATTGATTTAAAAAATAAC 
CGTATTGATGGTCTTTTGATTGAT G AAGTTTATG CTAACTATTAT TT AAAG CAAG AAGGA 
AATATAAAAGCTTATTATTTTGTTAAAACTGCTTATCAAGGAGAAAATTTTGTAGTAGGA 
G CT CGT AAAGTTGAT CGTAGACTAATTGAAAAGAT TAACAAAG CTTT CAAACAG CTT CAT 
AATAAGGGGAGATTTCAAAAAATCTCTTACAAATGGTTTGGTGAAGATGTTTATAGTAAA 

GAA 

SEQ ID NO. 5402 

STRAIN 090 
ATTGGGaACATTATC 

AAAAGGAAAAGAAAATTACTATTGGATTTGATAATACTTTTGTTCCTATG 
GGATTTGAAAGC CGT T CIGGTG ACTAt AC CGG CTTTGATATTGATTTAG C 
TAATG CTGTTTTTAAAGAATACGGTATTT CAGTGAAATG G CAG CCTATT A 
ACTGGGATATGAAAGAAACTGAACTTAATAATGGTAATATAGACCTTATT 
TGGAATGGTTATT CAAAAACGGCAGAACGT G CTAAAAAAGTCG CTTTTAC 
AAACC CATATATGAATAAT CAT CAAGTAAT TGTTACTAAAACTT CAT CAC 
AT ATT AATAGTATTAAGGATATGAAGGGGAAAAAACTAGGAG CC CAGT CG 
GGTT CAT CTGGTTTTGATG CTTTT AATGCT AAACCT GATATTTTAAAAAA 
GTTTGTAAAAGGAAAAGAAG CAGTT CAATACGATACTTT CACTCAGG CTT 
TGATTGATTTAAAAAATAACCGTATTGATGGTCTTTTGATTGATGAAGTT 
TATGCTAACTATTATTTAAAGCAAGAAGGAAATATAA^ 
TGTTAAAACT G CTTATCAAGGAGAAAATTTTGTAGTAGGAG CTCG CAAAG 
TTGAT CGTAGACrAATTGAAAAGATTAACAAAG CTTT CAAACAG CTT CAT 
AATAAGG GAAAATTT CAAAAAAT CT CTTACAAATGGTTTGGTG AAGATGT 
TTATAGTAAAGAA 

SEQ ID NO. 5403 

STRAIN A909 
ATTGGG 

aACATTATCAAAAGGAAAAGAAAATTACrATTGGATTTGATAATAC^ 

GTTCCTATGGGATTTGAAAGTCGTTOTGGTGACTATACCGGCTTTGATAT 

TGATTTAGCTAATGCTGTTTTTAAAGAATACGGTATTTCAG 

AG C CTTATTAAOTCGGATA t g AAAGAAACTG AACTTAATAATGGT AATAT A 

C4ACCTTATTTGGAATGGTTATT CAAAAACGG CIAGAACGTG CTAAAAAAGT 

CGCTTTTACAAACC CATATATGAATAAT CATCAAGTAATTGTTACTAAAA 

CTTCAT CACATATT AATAGTATTAAGGATATGAAGGGGAAAAAACTAGGA 

GCCCAGT CGGGTTCAT CTGGTTTTGATGCTTTTAA.CG CTAAACCTGATAT 

TTTAAAAAAGTTTGTAAAAGGAAAAGAAGCAGtTCAATACGATACTTTCA 

CT CAGG CTTTGATTGATTTAAAAAATAAC CGTATTGATG 

GATGAAGTTTATG CTAACTATTATTTAAAG CAAGAAGGAAATAT AAAAG C 

TTATT ATTTTGTTAAAACTG CTTAT CAAGGAGAAAATTTTGTAGTAGGAG 

CT CGTAAAGTTGAT CGTAG ACT AATTGAAAAGATTAACAAAG CTTT CAAA 

CAGCTTC^TAATAAGGGGAGATTTCAAAAAATCT^ 

TGAAGATGTTTATAGTAAAGaA 

SEQ ID NO. 5404 

STRAIN H36B 

ATTGGGAACATTAT CAAAAGGAAAAGAAAATTACT ATTGGATT 

TGATAATACTTTTGTTC CTATGGG ATTTG AAAGT CGTT CTGGTGACTATA 

CCG^CTT'TGATATTGATTTAGCTAATGCrrGTTTTTAAAGAATACGGTATT 

TCAGTGAAATGGCAGCCTATTAACTGGC^TATGAAAGA 

TAATGGT AATATAG ACCTTATTTGGAATGGTTATT CAAAAACGG CAG AAC 

GTGCTAAAAAAGTCG CTTTTACAAAC CCATAT ATG AATAAT CAT CAAGTA 

ATTGTTACTAAAACTTCAT CACAT ATTAATAGTATTAAG GAT ATGAAGG G 

GAAAAAACTACX3AGCC CAGT CGGGTTCAT CTGGTTTTGATG CTTTTAACG 

CTAAACCrGATATTTTAAAAAAGTTTGTAAAAGG AAAAG AAG CAG t T CAA 

TAOjATACTTTCACTCAGGCTTTGATTGATT^ 

XGGT CTTTTGATTGATGAAGT t TATG CTAACT ATT ATTT AAAGCAAG AAG 
GAAATATAAAAGCTTATTATTTTGTTAAAACTGCTTAT CAAGGAgAAA^T 
TTTGTAGTAGGAG CT CGTAAAGTTGATCGTAGACTAATTG AAAAGATTAA 
CAAAG CTTT CAAACAGCITCATAATAAGGGGAGATTT CAAAAAAT CT CTT 
ACAAATGGTTTGGTGAAGATGTTTATAGTAAAGAA 

SEQ ID NO. 5405 

STRAIN 18RS21 
ATTGGGAACATTA 

T CAAAAGGAAAAGAAAATT ACT ATTGGATTTG ATAATAC^ 
TGGGATTTGAAAGTCGTTCTGGTGACTAtACCG^CTTTGATATTGATTTA 
GCTAATG CTGTTTTTAAAG AATACGGTATTT CAGTGAAATGG CAGCCTAT 
TAACTGGGATATGAAAGAAACTGAACTTAATAATGGTAATATAGACCTTA 
TTTGGAATGGTTATT CAAAAACGG CAGAACGTGCTAAAAAAGTCGCTTTT 
ACAAACCCATATATGAATAATCATCAAGTAATTGTTACTAAAACTTCATC 
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ACATATTAATAGTATTAAGGATATGAAGGGGAAAAAACTAGGAGCCCAGT 
CGGGTTCATCTGGTTTTGATGCTTTTAACGCTAAACCTGATATTTTAAAA 
AAGTTTGTAAAAGG AAAAG AAG CAGTT CAATACG ATACTTT CACTCAGG C 
TTTGATTGATTTAAAAAATAACCGTATTGATGGTCTTTTGATTGATGAAG 
TTTATGCTAACTATTATTTAAAGCAAGAAGGAAATATAAAAGCTTATTAT 
TTTGTTAAAACT G CTTAT CAAGGAG AAAATTTTGT AGTAGGAG CTCGTAA 
AGTTGATCGTAGACTAATTGAAAAGATTAACAAAGCTTTCAAACAGCTTC 
AT AAT AAGG GGAGATTT CAAAAAAT CT CTTACAAATGGTTTG GTGAAGAT 
GTTTATAGTAAAGAA 

SEQ ID NO. 5406 

STRAIN M732 

ATTGGGAACATTATCAAAAGGAAAAGAAAATTACTATTGGATTTGATAA 
TACTTTTGTTCCTATGGGATTTGAAAGTCGTTCTGGTGACTATACCGGCT 
TTGATATTG ATTTAG C7T AA.TG CTGTTTTTAAAGAATACG GTATTT CAGTG 
AAATGGCAG C CTATTAACTG GGATATG AAAGAAACTGAACTTAATAATG G 
TAATATAGAC CTTATTTGGAAT GGTTATT CAAAAACGG CAGAACGTG CTA 
AAAAAGT CG CTTTTACAAAC CCAT ATATGAATAATCAT CAAGTAATTGTT 
ACTAAAACTTCATCACATATTAATAGTATTAAGGATATGAAGGGGAAAAA 
ACTAGGAGCCa^GTCGGGTTCATCTGGTTTTGATGCTTTTAACGCTAAAC 
CTGATATTTTAAAAAAGTTTGTAAAAGGAAAAGAAGCAGTTCAATACGAT 
ACTTT Cl^CT CAGGCTTTGATTGATTTAAAAA^TAAC CGTATTGATGGT CT 
TTTGATTGATGAAGTTTATGCTAACTATTATTTAAAGCAAGAAGGAAAT 
TAAAAGCTTATTATTTTGTTAAAACTG CTTATCAAGG AGAAAATTTTGTA 
GTAGGAG CT CGTAAAGTTGATCGTAGACTAATTG AAAAG ATTAACAAAG C 
TTT CAAACAG CTTCATAATAAGGGGAG ATTTCAAAAAAT CTCTT^ 
GGTTTGGTGAAGATGTTTATAGTAAAGAA 

SEQ ID NO. 5407 

STRAIN COH1 

ATTGGGAACATTAT CAAAAGGAAAAGAAAATTACTATTGGATTTGATAA 

TACTTTTGTTCCTATGGGATTTGAAAGTCGTTCTGGTGACTATACCGGCT 

TTGATATTGATTTAGCTAATGCKSTTTTTAAAGAATAC^ 

AAATGG CAG CCTATTAACTGGGAT ATGAAAGAAACTGAACTTAAT AATGG 

TAATATAGACCTTATTTGGAATGGTTATTCAAAAACGGCAGAACGTGCTA 

AAAAAGTCG CTTTTACAAAC CCAT ATATGAATAAT CATCAAGTAATTGTT 

ACTAAAACTT CAT CACATATTAATAGTATTAAGGATATGAAGGGG AAAAA 

ACTAGGAG C C CAGT CGGGTT CAT CTGGTTTTGATG CTTTTAACG CTAAAC 

CTGATATTTTAAAAA^GTTTGTAAAAGGAAAAGAAGCAGT 

ACTT T CACT CAGGCTTTGATTG ATTTAAAAAATAACCGTATTGATGGTCT 

TTTGATTGATGAAGTTTATG CTAACTATTATTTAAAGCAAGAAGGAAAT^ 

TAAAAGCTT ATT ATTTTGTTAAAACTG CTTAT CAAGGAGAAAATTTTGTA 

GTAGGAG CTCGTAAAGTTGAT CGTAGACTAATTGAAAAGATT AACAAAG C 

TTTGA^CAGCTTaA-TAATAAGGGGAGATTTCAAAAAATC 

GGTTTGGTGAAGATGTTTATAGTAAAGAA 

SEQ ID NO. 5408 

STRAIN M781 

ATTGGGAACATTATCAAAAGGAAAAGAAAATTACTATTGGATTTGATA 
ATACTTTTGTTC CTATGGGATTTGAAAGT CGTTCTGGTGACTATAC CGG C 
TTTGATATTGATTTAGCTAATG CTGTTTTTAAAG AAT ACGGTATTT CAGT 
GAAATGGCAGCCTATTAACTGGGATATGAAAGAAACTGAACTTAATAATG 
GTAATATAGACCTTATTTGGAATGGTTATT CAAAAACGG CAGAACGTG CT 
AAAAAAGT CG CTTTTACAAACCCATATATGAATAATCAT C AAGTAATTGT 
TACTAAAACTT CATCACATATTAATAGTATTAAGG ATATGAAGGGGAAAA 
AACTAGGAG C CCAGT CGGGTTCAT CTG GTTTTGATGCTTTTAACGCTAAA 
C CTGATATTTTAAAAAAGTTTGTAAAAGGAAAAGAAG CAGTT CAATACG A 
TACTTT CACT CAGG CTTTGATTGATTTAAAAAATAAC CGTATTG ATGGT C 
TTTTGATTGATGAAGTTTAT G CTAACTATTATTTAAAG CAAGAAGGAAAT 
ATAAAAGCITATTATTTTGTTAAAACrGCTTATCAAGG^ 
AGTAGGAGCT CGTAAAGTTGAT CGT AGACTAATTG AAA^GATTAACAAAG 
CTTT CAAACAG CTT CATAATAA3GGGAGATTT CAAAAAAT CT CTT ACAAA 
TGGTTTGGTGAAGATGTTTATAGTAAAGaA 

SEQ ID NO. 5409 

STRAIN CJB110 

ATTGGGAACATTAT CAAAAGGAAAAGAAAATTACTATTGGATTTG ATAAT 
ACTTTTGTT C CTATGGGATTTGAAAGTCGTTCTGGTGACTATACCGG CTT 
TGATATTGATTTAG CTAATG CTGTTTTTAAAGAATACGGT ATTTCAGTGA 
AATGGCAGCCTATTAACTGGGATATGAAAGAAACTGAACTTAATAATGGT 
AATATAGAC CTT ATTTGGAATGGTT ATTCAAAAACGG CAGAACGTGCTAA 
AAAAGT CGCITTTACAAAC C CATATATGAATA^TCAT CAAGTAATTGTT A 
CTAAAACTTCATCACATATTAATAGTATTAAGGATATGAAGGGGAAAAAA 
CTAGGAGCCCAGTCGGGTTCATCTGGTTTTGATGCITT^ 
TG AT ATTTT AAAAAAGTTTGTAAAAGG AAAAG AAG CAGTT CAAT ACGAT A 
CTTT CACTCAGG CTTTGATTGATTTAAAAAATAAC CGTATTGATGGTCTT 
TTGATTGATGAAGTTTATGCT'AACTATTATTTAAAGCAAG 
AAAAGCTTATTATTTTGTTAAAACTG CTTATCAAGGAGAAAATTTTGTAG 
TAGG AGCTCGTAAAGTT GAT CGTAG ACTAATTGAAAAGATTAACAAAG CT 
TT CAAACAG CTT CATAATAAGGGGAGATTT CAAAAAAT CT CTTACAAATG 
GTTT GGTGAAGATGTTTAT AGT AAAGAA 

SEQ ID NO. 5410 
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STRAIN 1169NT 

ATTGGGAACATTATCAAAAGGAAAAGAAAATTACTATTGGATTTGATAA 
TACTTTTGTTCCTATGGGATTTGAAAGTCGTTCTGGTGACTATACCGGCT 
TTGAT ATTGATTT AG CTAATGCTGTTTTTAAAGAATACGGT ATTT CAGTG 
AAATGG CAG C CT ATTAACTG GGAT ATG AAAGAAACTGAACTCAAT AATGG 
TAAT ATAGAC CTTATTTGG AATGGT T ATT CAAAAACGG CAGAAC GTG CT A 
AAAAAGTCGCTTTTACAAACCCATATATGAATAATCATCAAGTAATTGTT 
ACTAAAACTT CAT CACATATTAATAGTATTAAGG ATATG AAGGGG AAAAA 
ACTAGGAGCCCAGTCGGGTTCATCTGGTTTTGATGCTTTTAATGCTAAAC 
CTGACATTTTAAAAAAGTTrGTAAAAGGAAAAGAAGCAGTTCAATACGAT 
ACTTTCACTCAGGCTTTGATTGATTTAAAAAATAACCGTATTGATGGTCT 
TTTG ATTGATGAAGTTT ATG CTAACTATTATTTAAAG CAAGAAGGAAATA 
TAAAAG CTTATTATTTTGTT AAAACTG CTT ATCAAGGAGAAAATTTTGTA 
GTAGG AG CT CGCAAAGTTGATCGTAGACTAATTG AAAAG ATT AACAAAG C 
TTTCAAACAGCTTCLA.TAATAAGGGGAAATTTCAAA^AAT CTCTTACAAAT 
GGTTTGGTGAAGATGTTTATAGTAAAGAA 



SEQ ID NO. 5411 

STRAIN JM913 0013 

ATTGGGAACATTAT C 

AAAAGGAAAAGAAAA.TTACTATTGGATTTGATAATACTTTTGTTCCTATG 
GGATTTGAAAGTO^TTCTGGTGACTAtACCGGCTTTGATATTGATTTAGC 
TAATG CTGTTTTTAAAGAATACGGT ATTT CAGTGAAATGG CAGC CTATTA 
ACTGGGATATGAAAGAAACrGAACTTAATAATGGTAATATAGACCTTATT 
TGGAATGGTTATTCAAAAACGG CAGAACGTGCTAAAAAAGTCG C'iTTT AC 
AAAC C CATATATGAATAAT CAT CAAGTAATTGTTACTAAAACTTCATCAC 
ATATTAATAGTATTAAGGATATGAAGGGGAAAAAACTAGGAGCCCAGTCG 
GGTT CAT CTGGTTTTGATGCTTTTAACG CTAAAC CTICATATTTTAAAAAA 
GTTTGTAAAAGGAAAAGAAG CAGTTCAATACGATACITTCACTCAGG CTT 
TGATTGATTTAAAAAATAACCGTATTGATGGTCTTTTGATTGAT 
TATG CTAACTATTATTTAAAG CAAGAAGGAAATAT AAAAG CTTATTATTT 
TGTTAAAACTGCTTATCAAGGAGAAAATTTTGTAGTAGGAG CTCGTAAAG 
TTGAT CGTAGACTAATTGAAAAGATTAACAAAGCTTT CAAACAG CTT CAT 
AATAAGGGGAGATTT CAAAAAAT CT CTTACAAATGGTTTGGTGAAGATGT 
TTATAGTAAAGAA 

PRETTY of: /biotmp/msa39314 . 2 { * } February 18, 2003 11:01 
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Consensus 



50 



ttgactcaca aaaatatatt attaaccatt atatttggat tatttatgat 



********** ********** ********** ********** ********** 

51 100 
tatattatca gcatgtggta tgtctaataa ggaaatggct ggtattgata 



********** ********** ********** ********** ********** 



msa39314.2{ 

msa39314 . 

msa39314 . 
msa39314.2{ 

msa39314 . 

msa39314. 
msa39314.2{225 

msa39314 . 

msa39314. 
msa39314 
nisa39314.2{ 



225_18RS2lJ 
2(225_2603} 
2{225_A909; 
225_CJB110 
2{225_COHl 
2{225JH36B 
_KM9130013 
2{225_M732; 
2{225_M781; 
.2{225_090 
225JL169NT 
Consensus 



101 

ATTGGGAACA 
ATTGGGAACA 
ATTGGGAACA 
ATTGGGAACA 
ATTGGGAACA 
ATTGGGAACA 
ATTGGGAACA 
ATTGGGAACA 
ATTGGGAACA 
ATTGGGAACA 
ATTGGGAACA 
********** 



TTATCAAAAG 
TTATCAAAAG 
TTATCAAAAG 
TTATCAAAAG 
TTATCAAAAG 
TTATCAAAAG 
TTATCAAAAG 
TTATCAAAAG 
TTATCAAAAG 
TTATCAAAAG 

TTATCAAAAG 
********** 



GAAAAGAAAA 

GAAAAGAAAA 

GAAAAGAAAA 

GAAAAGAAAA 

GAAAAGAAAA 

GAAAAGAAAA 

GAAAAGAAAA 

GAAAAGAAAA 

GAAAAGAAAA 

GAAAAGAAAA 

GAAAAGAAAA 
********** 



TTACTATTGG 
TTACTATTGG 
TTACTATTGG 
TTACTATTGG 
TTACTATTGG 
TTACTATTGG 
TTACTATTGG 
TTACTATTGG 
TTACTATTGG 
TTACTATTGG 
TTACTATTGG 
********** 



150 

ATTTGATAAT 
ATTTGATAAT 
ATTTGATAAT 
ATTTGATAAT 
ATTTGATAAT 
ATTTGATAAT 
ATTTGATAAT 
ATTTGATAAT 
ATTTGATAAT 
ATTTGATAAT 
ATTTGATAAT 
********** 



msa39314.2{225_18RS2l} 
msa39314.2{225_2603} 



151 200 
ACTTTTGTTC CTATGGGATT TGAAAG t CGT TCTGGTGACT ATACCGGCTT 
ACTTTTGTTC CTATGGGATT TGAAAG t CGT TCTGGTGACT ATACCGGCTT 
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msa3 9314 . 2 { 225_A909 
msa39314 . 2 (225_CJB110 
msa3 9314 . 2 { 225_COHl ' 
msa39314.2{225_H3 6B 
msa39314.2{225_KM9130013 
msa3 9314 . 2 {225_M732 
msa3 9314.2{225__M781; 
msa39314.2{225_090; 
msa39314 . 2{225_1169NT; 

Consensus 



ACTTTTGTTC 

ACTTTTGTTC 

ACTTTTGTTC 

ACTTTTGTTC 

ACTTTTGTTC 

ACTTTTGTTC 

ACTTTTGTTC 

ACTTTTGTTC 

ACTTTTGTTC 
********** 



CTATGGGATT 

CTATGGGATT 

CTATGGGATT 

CTATGGGATT 

CTATGGGATT 

CTATGGGATT 

CTATGGGATT 

CTATGGGATT 

CTATGGGATT 
********** 



TGAAAG tCGT 

TGAAAGtCGT 

TGAAAG tCGT 

TGAAAGtCGT 

TGAAAGtCGT 

TGAAAGtCGT 

TGAAAGtCGT 

TGAAAGc CGT 

TGAAAGtCGT 
******_*** 



TCTGGTGACT 

TCTGGTGACT 

TCTGGTGACT 

TCTGGTGACT 

TCTGGTGACT 

TCTGGTGACT 

TCTGGTGACT 

TCTGGTGACT 

TCTGGTGACT 
********** 



ATACCGGCTT 

ATACCGGCTT 

ATACCGGCTT 

ATACCGGCTT 

ATACCGGCTT 

ATACCGGCTT 

ATACCGGCTT 

ATACCGGCTT 

ATACCGGCTT 
********** 



msa39314 .2{ 

msa3 9314. 

msa39314 , 
msa39314-2{ 

msa3 9314. 

msa3 9314 . 
msa39314.2{225 

msa39314 . 

msa39314 . 
msa39314 
msa39314.2{ 



225_18RS2l} 
2{225_2603} 
2{225_A909} 
225_CJB110) 
2{225_C0H1 > 
2{225_H36B 
JKM9130013 
2{225_M732; 
2(225_M781 
.2{225_090 
225_1169NT 
Consensus 



201 

TGATATTGAT 
TGATATTGAT 
TGATATTGAT 
TGATATTGAT 
TGATATTGAT 
TGATATTGAT 
TGATATTGAT 
TGATATTGAT 
TGATATTGAT 
TGATATTGAT 
TGATATTGAT 



TTAGCTAATG 
TTAGCTAATG 
TTAGCTAATG 
TTAGCTAATG 
TTAGCTAATG 
TTAGCTAATG 
TTAGCTAATG 
TTAGCTAATG 
TTAGCTAATG 
TTAGCTAATG 
TTAGCTAATG 



********** ********** 



CTGTTTTTAA 
CTGTTTTTAA 
CTGTTTTTAA 
CTGTTTTTAA 
CTGTTTTTAA 
CTGTTTTTAA 
CTGTTTTTAA 
CTGTTTTTAA 
CTGTTTTTAA 
CTGTTTTTAA 
CTGTTTTTAA 
********** 



AGAATACGGT 

AGAATACGGT 

AGAATACGGT 

AGAATACGGT 

AGAATACGGT 

AGAATACGGT 

AGAATACGGT 

AGAATACGGT 

AGAATACGGT 

AGAATACGGT 

AGAATACGGT 
********** 



250 

ATTTCAGTGA 

ATTTCAGTGA 

ATTTCAGTGA 

ATTTCAGTGA 

ATTTCAGTGA 

ATTTCAGTGA 

ATTTCAGTGA 

ATTTCAGTGA 

ATTTCAGTGA 

ATTTCAGTGA 

ATTTCAGTGA 
********** 



msa3 9314 . 2 { 

msa3 9314. 

msa39314 . 
msa39314 .2{ 

msa39314. 

msa39314 . 
msa39314.2{225 

msa39314. 

msa39314. 
msa39314 
msa39314 .2{ 



225_18RS2l} 
2l225_2603} 
2{225_A909} 
225_CJB110} 
2{225_COHl} 
2{225_H36B' 
_KM9130013' 
2{225_M732 
2{225_M781 
.2{225_090) 
225_1169NT} 
Consensus 



251 

AATGGCAGCC 

AATGGCAGCC 

AATGGCAGCC 

AATGGCAGCC 

AATGGCAGCC 

AATGGCAGCC 

AATGGCAGCC 

AATGGCAGCC 

AATGGCAGCC 

AATGGCAGCC 

AATGGCAGCC 
********** 



TATTAACTGG 

TATTAACTGG 

TATTAACTGG 

TATTAACTGG 

TATTAACTGG 

TATTAACTGG 

TATTAACTGG 

TATTAACTGG 

TATTAACTGG 

TATTAACTGG 

TATTAACTGG 
* * * * ****** 



GATATGAAAG 

GATATGAAAG 

GATATGAAAG 

GATATGAAAG 

GATATGAAAG 

GATATGAAAG 

GATATGAAAG 

GATATGAAAG 

GATATGAAAG 

GATATGAAAG 

GATATGAAAG 
********** 



AAACTGAACT 

AAACTGAACT 

AAACTGAACT 

AAACTGAACT 

AAACTGAACT 

AAACTGAACT 

AAACTGAACT 

AAACTGAACT 

AAACTGAACT 

AAACTGAACT 

AAACTGAACT 
********** 



300 

tAATAATGGT 

tAATAATGGT 

tAATAATGGT 

tAATAATGGT 

tAATAATGGT 

tAATAATGGT 

tAATAATGGT 

tAATAATGGT 

tAATAATGGT 

tAATAATGGT 

CAATAATGGT 
_********* 



msa3 93 14 . 2 { 225_1 8RS2 1 } 
msa39314 .2 {225_2603 } 
msa39314.2{225_A909' 
msa39314 . 2{225_CJB110 
rasa39314 . 2 { 225_C0H1 
msa39314.2{225_H36B} 
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msa3 9314.2{225_M732} 
msa3 9314.2{225_M78l} 
msa39314 . 2 { 225^090 } 
msa3 93 14 . 2 { 22 5_1169NT } 

Consensus 



301 

AATATAGACC 

AATATAGACC 

AATATAGACC 

AATATAGACC 

AATATAGACC 

AATATAGACC 

AATATAGACC 

AATATAGACC 

AATATAGACC 

AATATAGACC 

AATATAGACC 
********** 



TTATTTGGAA 

TTATTTGGAA 

TTATTTGGAA 

TTATTTGGAA 

TTATTTGGAA 

TTATTTGGAA 

TTATTTGGAA 

TTATTTGGAA 

TTATTTGGAA 

TTATTTGGAA 

TTATTTGGAA 
********** 



TGGTTATTCA 

TGGTTATTCA 

TGGTTATTCA 

TGGTTATTCA 

TGGTTATTCA 

TGGTTATTCA 

TGGTTATTCA 

TGGTTATTCA 

TGGTTATTCA 

TGGTTATTCA 

TGGTTATTCA 
********** 



AAAACGGCAG 

AAAACGGCAG 

AAAACGGCAG 

AAAACGGCAG 

AAAACGGCAG 

AAAACGGCAG 

AAAACGGCAG 

AAAACGGCAG 

AAAACGGCAG 

AAAACGGCAG 

AAAACGGCAG 
********** 



350 

AACGTGCTAA 

AACGTGCTAA 

AACGTGCTAA 

AACGTGCTAA 

AACGTGCTAA 

AACGTGCTAA 

AACGTGCTAA 

AACGTGCTAA 

AACGTGCTAA 

AACGTGCTAA 

AACGTGCTAA 
********** 



msa3 9314 .2{ 

msa39314 . 

msa39314 . 
msa39314.2{ 

msa39314. 

msa39314 . 
msa39314.2{225 

msa39314 . 

msa39314 . 
msa39314 
msa39314 .2{ 



225_18RS21 
2{225_2603 
2{225_A909 
225_CJB110 ! 
2{225_COHl' 
2{225_H36B" 
_KM9130013 
2{225^M732 
2{225_M781 
.2{225_090 
225__1169NT; 
Consensus 



351 

AAAAGTCGCT 

AAAAGTCGCT 

AAAAGTCGCT 

AAAAGTCGCT 

AAAAGTCGCT 

AAAAGTCGCT 

AAAAGTCGCT 

AAAAGTCGCT 

AAAAGTCGCT 

AAAAGTCGCT 

AAAAGTCGCT 
********** 



TTTACAAACC 

TTTACAAACC 

TTTACAAACC 

TTTACAAACC 

TTTACAAACC 

TTTACAAACC 

TTTACAAACC 

TTTACAAACC 

TTTACAAACC 

TTTACAAACC 

TTTACAAACC 
********** 



CATATATGAA 

CATATATGAA 

CATATATGAA 

CATATATGAA 

CATATATGAA 

CATATATGAA 

CATATATGAA 

CATATATGAA 

CATATATGAA 

CATATATGAA 

CATATATGAA 
********** 



TAATCATCAA 

TAATCATCAA 

TAATCATCAA 

TAATCATCAA 

TAATCATCAA 

TAATCATCAA 

TAATCATCAA 

TAATCATCAA 

TAATCATCAA 

TAATCATCAA 

TAATCATCAA 
********** 



400 

GTAATTGTTA 

GTAATTGTTA 

GTAATTGTTA 

GTAATTGTTA 

GTAATTGTTA 

GTAATTGTTA 

GTAATTGTTA 

GTAATTGTTA 

GTAATTGTTA 

GTAATTGTTA 

GTAATTGTTA 
********** 



msa3 93 14 . 2 { 225_18RS2 1 ] 
msa3 9314 . 2 ( 225_2603 ]■ 
msa39314 .2{225_A909 
ms a3 93 1 4 . 2 { 2 2 5_C JB1 1 0 
msa3 9 3 1 4 . 2 { 2 2 5_COHl 
msa39314.2{225_H36B 
msa3 9314-2{225_KM9130013 
msa3 9314 . 2 { 225_M732 
msa39314 . 2 {225_M781; 
msa3 93 14 . 2 { 2 2 5_0 9 0 
msa39314.2{225_1169NT 

Consensus 



401 

CTAAAACTTC 
CTAAAACTTC 
CTAAAACTTC 
CTAAAACTTC 
CTAAAACTTC 
CTAAAACTTC 
CTAAAACTTC 
CTAAAACTTC 
CTAAAACTTC 
CTAAAACTTC 
CTAAAACTTC 
********** 



ATCACATATT 

ATCACATATT 

ATCACATATT 

ATCACATATT 

ATCACATATT 

ATCACATATT 

ATCACATATT 

ATCACATATT 

ATCACATATT 

ATCACATATT 

ATCACATATT 
********** 



AATAGTATTA 
AATAGTATTA 
AATAGTATTA 
AATAGTATTA 
AATAGTATTA 
AATAGTATTA 
AATAGTATTA 
AATAGTATTA 
AATAGTATTA 
AATAGTATTA 
AATAGTATTA 



AGGATATGAA 
AGGATATGAA 
AGGATATGAA 
AGGATATGAA 
AGGATATGAA 
AGGATATGAA 
AGGATATGAA 
AGGATATGAA 
AGGATATGAA 
AGGATATGAA 
AGGATATGAA 



450 

GGGGAAAAAA 
GGGGAAAAAA 
GGGGAAAAAA 
GGGGAAAAAA 
GGGGAAAAAA 
GGGGAAAAAA 
GGGGAAAAAA 
GGGGAAAAAA 
GGGGAAAAAA 
GGGGAAAAAA 
GGGGAAAAAA 



********** ********** ********** 



451 500 
msa39314 . 2 {225_18RS2l} CTAGGAGCCC AGTCGGGTTC ATCTGGTTTT GATGCTTTTA AcGCTAAACC 



883 



WO 2004/018646 



PCT/US2003/026827 



Table 54: Comparative Sequences relating to SAG0949 



msa39314 . 

msa39314 . 
msa39314.2{ 

msa39314 . 

msa39314 . 
msa39314.2{22S 

msa39314 . 

msa39314 . 
msa39314 
msa39314.2{ 



2{225_2603} 
2{225_A909} 
225_CJB110} 
2{225_COHlj 
2{225_H36B} 
_KM9130013} 
2{225_M732} 
2{225_M78l} 
,2{225_090} 
225_1169NT} 
Consensus 



CTAGGAGCCC 

CTAGGAGCCC 

CTAGGAGCCC 

CTAGGAGCCC 

CTAGGAGCCC 

CTAGGAGCCC 

CTAGGAGCCC 

CTAGGAGCCC 

CTAGGAGCCC 

CTAGGAGCCC 
********** 



AGTCGGGTTC 
AGTCGGGTTC 
AGTCGGGTTC 
AGTCGGGTTC 
AGTCGGGTTC 
AGTCGGGTTC 
AGTCGGGTTC 
AGTCGGGTTC 
AGTCGGGTTC 
AGTCGGGTTC 



ATCTGGTTTT 
ATCTGGTTTT 
ATCTGGTTTT 
ATCTGGTTTT 
ATCTGGTTTT 
ATCTGGTTTT 
ATCTGGTTTT 
ATCTGGTTTT 
ATCTGGTTTT 
ATCTGGTTTT 



GATGCTTTTA 
GATGCTTTTA 
GATGCTTTTA 
GATGCTTTTA 
GATGCTTTTA 
GATGCTTTTA 
GATGCTTTTA 
GATGCTTTTA 
GATGCTTTTA 
GATGCTTTTA 



AcGCTAAACC 
AcGCTAAACC 
AcGCTAAACC 
AcGCTAAACC 
AcGCTAAACC 
AcGCTAAACC 
AcGCTAAACC 
AcGCTAAACC 
AtGCTAAACC 
AtGCTAAACC 



********** ********** ********** *„******** 



msa39314 .2{ 

msa39314 . 

msa39314 . 
msa39314 . 2{ 

rasa3 9314 . 

msa39314 . 
msa39314.2{225 

msa39314. 

msa3 9314 . 
msa3 9314 
msa3 9314.2{ 



225_18RS2l} 
2{225_2603 
2{225_A909 
225_CJB110 
2{22S_C0H1 
2{225_H36B; 
_KM9130013 ! 
2{225_M732} 
2{225_M781} 
.2 {225_090} 
225_1169NT} 
Consensus 



501 

TGAtATTTTA 

TGAtATTTTA 

TGAtATTTTA 

TGAtATTTTA 

TGAtATTTTA 

TGAtATTTTA 

TGAtATTTTA 

TGAtATTTTA 

TGAtATTTTA 

TGAtATTTTA 

TGAcATTTTA 
***_****** 



AAAAAGTTTG 

AAAAAGTTTG 

AAAAAGTTTG 

AAAAAGTTTG 

AAAAAGTTTG 

AAAAAGTTTG 

AAAAAGTTTG 

AAAAAGTTTG 

AAAAAGTTTG 

AAAAAGTTTG 

AAAAAGTTTG 
********** 



TAAAAGGAAA 

TAAAAGGAAA 

TAAAAGGAAA 

TAAAAGGAAA 

TAAAAGGAAA 

TAAAAGGAAA 

TAAAAGGAAA 

TAAAAGGAAA 

TAAAAGGAAA 

TAAAAGGAAA 

TAAAAGGAAA 
********** 



AGAAGCAGTT 

AGAAGCAGTT 

AGAAGCAGTT 

AGAAGCAGTT 

AGAAGCAGTT 

AGAAGCAGTT 

AGAAGCAGTT 

AGAAGCAGTT 

AGAAGCAGTT 

AGAAGCAGTT 

AGAAGCAGTT 
********** 



550 

CAATACGATA 

CAATACGATA 

CAATACGATA 

CAATACGATA 

CAATACGATA 

CAATACGATA 

CAATACGATA 

CAATACGATA 

CAATACGATA 

CAATACGATA 

CAATACGATA 
********** 



msa3 9314.2{ 

msa39314 . 

rasa39314 . 
msa3 9314.2{ 

msa39314 . 

msa39314 . 
msa39314.2{225 

msa3 9314 . 

rasa3 9314 . 
msa39314 
msa39314 .2{ 



225_18RS2i; 
2f 225_2603 
2{225_A909 
225_CJB110] 
2(225_C0H1 : 
2{225_H36B 
_KM9130013; 
2 {225_M732; 
2{225_M78i; 
.2{225_090 
225_1169NT 
Consensus 



551 

CTTTCACTCA 

CTTTCACTCA 

CTTTCACTCA 

CTTTCACTCA 

CTTTCACTCA 

CTTTCACTCA 

CTTTCACTCA 

CTTTCACTCA 

CTTTCACTCA 

CTTTCACTCA 

CTTTCACTCA 
********** 



GGCTTTGATT 

GGCTTTGATT 

GGCTTTGATT 

GGCTTTGATT 

GGCTTTGATT 

GGCTTTGATT 

GGCTTTGATT 

GGCTTTGATT 

GGCTTTGATT 

GGCTTTGATT 

GGCTTTGATT 
********** 



GATTTAAAAA 

GATTTAAAAA 

GATTTAAAAA 

GATTTAAAAA 

GATTTAAAAA 

GATTTAAAAA 

GATTTAAAAA 

GATTTAAAAA 

GATTTAAAAA 

GATTTAAAAA 

GATTTAAAAA 
********** 



ATAAC CGTAT 

ATAACCGTAT 

ATAACCGTAT 

ATAACCGTAT 

ATAACCGTAT 

ATAACCGTAT 

ATAACCGTAT 

ATAACCGTAT 

ATAACCGTAT 

ATAACCGTAT 

ATAACCGTAT 
********** 



600 

TGATGGTCTT 

TGATGGTCTT 

TGATGGTCTT 

TGATGGTCTT 

TGATGGTCTT 

TGATGGTCTT 

TGATGGTCTT 

TGATGGTCTT 

TGATGGTCTT 

TGATGGTCTT 

TGATGGTCTT 
********** 



msa39314.2{ 

msa39314. 

msa39314 . 
msa3 9314 -2{ 

rasa39314 . 

msa39314 . 
msa39314.2{225 

msa39314 . 

msa39314 . 
msa39314 
msa3 9314.2{ 



225_18RS2l} 
2 {225^2603} 
2{225_A909} 
225_CJB110} 
2{225_COHl} 
2{225_H36B> 
_KM9130013} 
2{225_M732} 
2{225_M7B1} 
.2{225_090j 
225_1169NT} 
Consensus 



601 

TTGATTGATG 

TTGATTGATG 

TTGATTGATG 

TTGATTGATG 

TTGATTGATG 

TTGATTGATG 

TTGATTGATG 

TTGATTGATG 

TTGATTGATG 

TTGATTGATG 

TTGATTGATG 
********** 



AAGTTTATGC 

AAGTTTATGC 

AAGTTTATGC 

AAGTTTATGC 

AAGTTTATGC 

AAGTTTATGC 

AAGTTTATGC 

AAGTTTATGC 

AAGTTTATGC 

AAGTTTATGC 

AAGTTTATGC 
********** 



TAACTATTAT 

TAACTATTAT 

TAACTATTAT 

TAACTATTAT 

TAACTATTAT 

TAACTATTAT 

TAACTATTAT 

TAACTATTAT 

TAACTATTAT 

TAACTATTAT 

TAACTATTAT 
********** 



TTAAAGCAAG 

TTAAAGCAAG 

TTAAAGCAAG 

TTAAAGCAAG 

TTAAAGCAAG 

TTAAAGCAAG 

TTAAAGCAAG 

TTAAAGCAAG 

TTAAAGCAAG 

TTAAAGCAAG 

TTAAAGCAAG 
********** 



650 

AAGGAAATAT 

AAGGAAATAT 

AAGGAAATAT 

AAGGAAATAT 

AAGGAAATAT 

AAGGAAATAT 

AAGGAAATAT 

AAGGAAATAT 

AAGGAAATAT 

AAGGAAATAT 

AAGGAAATAT 
********** 



msa39314.2{ 

msa39314 . 

tnsa39314 . 
msa3 9314.2{ 

msa39314 . 

msa39314 . 
msa39314.2{225 

msa39314 . 

msa39314 . 
msa39314 
msa3 9314.2{ 



225_18RS2l} 
2{225_2603} 
2{225_A909} 
225_CJB110} 
2{225_C0Hl} 
2{225_H36B} 
_KM9130013} 
2(225_M732} 
2{225_M781} 
.2{225_090} 
225_1169NT} 
Consensus 



651 

AAAAGCTTAT 
AAAAGCTTAT 
AAAAGCTTAT 
AAAAGCTTAT 
AAAAGCTTAT 
AAAAGCTTAT 
AAAAGCTTAT 
AAAAGCTTAT 
AAAAGCTTAT 
AAAAGCTTAT 
AAAAGCTTAT 



TATTTTGTTA 
TATTTTGTTA 
TATTTTGTTA 
TATTTTGTTA 
TATTTTGTTA 
TATTTTGTTA 
TATTTTGTTA 
TATTTTGTTA 
TATTTTGTTA 
TATTTTGTTA 
TATTTTGTTA 



********** ********** 



AAACTGCTTA 

AAACTGCTTA 

AAACTGCTTA 

AAACTGCTTA 

AAACTGCTTA 

AAACTGCTTA 

AAACTGCTTA 

AAACTGCTTA 

AAACTGCTTA 

AAACTGCTTA 

AAACTGCTTA 
********** 



TCAAGGAGAA 

TCAAGGAGAA 

TCAAGGAGAA 

TCAAGGAGAA 

TCAAGGAGAA 

TCAAGGAGAA 

TCAAGGAGAA 

TCAAGGAGAA 

TCAAGGAGAA 

TCAAGGAGAA 

TCAAGGAGAA 
********** 



700 

AATTTTGTAG 

AATTTTGTAG 

AATTTTGTAG 

AATTTTGTAG 

AATTTTGTAG 

AATTTTGTAG 

AATTTTGTAG 

AATTTTGTAG 

AATTTTGTAG 

AATTTTGTAG 

AATTTTGTAG 
********** 



msa3 9314.2{ 

msa39314 . 

msa39314 . 
msa39314.2{ 

msa39314 . 

msa39314 . 
msa39314.2{225 

msa39314 . 

msa39314 . 
msa39314 
msa39314.2{ 



225_18RS2lJ 
2 { 225J2603} 
2{225_A909} 
225_CJB110} 
2(225_COHl) 
2{225_H36B} 
_KM9130013} 
2{225_M732} 
2{225_M781) 
.2{225_090> 
225_1169NT} 
Consensus 



701 

TAGGAGCTCG 
TAGGAGCTCG 
TAGGAGCTCG 
TAGGAGCTCG 
TAGGAGCTCG 
TAGGAGCTCG 
TAGGAGCTCG 
TAGGAGCTCG 
TAGGAGCTCG 
TAGGAGCTCG 
TAGGAGCTCG 



tAAAGTTGAT 
tAAAGTTGAT 
tAAAGTTGAT 
tAAAGTTGAT 
tAAAGTTGAT 
tAAAGTTGAT 
tAAAGTTGAT 
tAAAGTTGAT 
tAAAGTTGAT 
cAAAGTTGAT 
cAAAGTTGAT 



CGTAGACTAA 
CGTAGACTAA 
CGTAGACTAA 
CGTAGACTAA 
CGTAGACTAA 
CGTAGACTAA 
CGTAGACTAA 
CGTAGACTAA 
CGTAGACTAA 
CGTAGACTAA 
CGTAGACTAA 



TTGAAAAGAT 
TTGAAAAGAT 
TTGAAAAGAT 
TTGAAAAGAT 
TTGAAAAGAT 
TTGAAAAGAT 
TTGAAAAGAT 
TTGAAAAGAT 
TTGAAAAGAT 
TTGAAAAGAT 
TTGAAAAGAT 



750 

TAACAAAGCT 
TAACAAAGCT 
TAACAAAGCT 
TAACAAAGCT 
TAACAAAGCT 
TAACAAAGCT 
TAACAAAGCT 
TAACAAAGCT 
TAACAAAGCT 
TAACAAAGCT 
TAACAAAGCT 



********** _********* ********** ********** ********** 



751 



800 



884 



WO 2004/018646 PCT/US2003/026827 

Table 54: Comparative Sequences relating to SAG0949 



msa3 9314.2(22 5_1 8RS21 } 
msa39314. 2 {225^2603} 
msa39314.2(225_A909} 
msa393 14 . 2 (225_CJB110 } 
msa3 93 14 . 2 { 2 25_COHl \ 
msa3 93 14 . 2 { 2 2 5__H3 6B } 
msa393 14 . 2 { 225_KM9130013 } 
msa39314.2{225_M732} 
msa39314 .2 {225_M78lJ 
tnsa39314.2{225_090) 
msa39314 . 2 { 225JL169NT} 

Consensus 



msa393 14 . 2 { 225_18RS21 
msa39314 . 2 f 225_2603 
msa39314.2{225_A909 
msa39314 . 2 { 225_CJB110 , 
msa3 9314.2(22 5_COHl ] 
msa3 93 14 . 2 { 2 2 5_H3 6B } 
msa39314.2(225_KM9130013} 
msa39314 .2 {225_M732} 
msa39314.2(225_M78l} 
msa393 14 - 2 { 225_090 } 
msa39314 . 2{225_1169NT} 

Consensus 

SEQ ID NO. 5412 

STRAIN 2603 frame: 1 
LTHKN I LLT I IFGLFMI I LSACGMSNKEMAG I DNWEHYQKEKKIT I GFDNTFVPMGFESR 
SGDYTGFD I DLANAVFKE YG I S VKWQP I NWDM KET ELNNGN I DL I WNG Y S KTAE RAKKVA 
FTNP YMNNHQVI VTKTS SH I NS I KDMKGKKLGAQSGSSGFDAFNAKPD I LKKFVKGKEAV 
QYDTFTQAL I DLKNNRI DGLLIDE VYANYYLKQEGNI KAYYFVKTAYQGENFWGARKVD 
RRLI EKINKAFKQLHNKGRFQKI S YKWFGEDVYS KE 

SEQ ID NO. 5413 

STRAIN 090 frame: 3 

WEHYQKEKKI T I GFDNTFVPMGFE SRSGD YTGFD I DLANAVFKE YG I S VKWQP I NWDMKE 

TELNNGNIDLIWNGYSKTAERAKKVAFTNPYMNNHQVIVTKTSSHINSIKD 

SGS SGFDAFNAKPD I LKKFVKGKEAVQYDTFTQAL I DLKNNRI BGLL I DE VYANYYLKQE 

GNI KAYYFVKTAYQGENFWGARKVDRRLI EKINKAFKQLHNKGKFQKI SYKWFGEDVYS 

KE 

SEQ ID NO. 5414 

STRAIN A909 frame: 3 

WEHYQKEKKITIGFDNTFVPMGFESRSGDYTGFDIDLANAVFKEYGISVKWQPINWDMKE 
TELNNGNIDL I WNG Y S KTAE RAKKVAFTN P YMNNHQV I VTKT S SH INS I KDMKGKKLGAQ 
SGSSGFDAFNAKPD I LKKFVKGKEAVQYDTFTQAL IDLKNNRI DGLL I DEVYANYYLKQE 
GNIKAYYFVKTAYOGENFWGARKVDRRLI EKINKAFKQLHNKGRFQKI SYKWFGEDVYS 
KE 

SEQ ID NO. 5415 

STRAIN H36B frame: 3 

WEHYQKEKKI T I GFDNTFVPMG FES RSGDYTGFD I DLANAVFKEYGIS VKWQP I NWDMKE 
TELNNGNIDLI WNGYSKTAERAKKVAFTNPYMNNHQVI VTKTSSHINS I KDMKGKKLGAQ 
SGSSGFDAFNAKPD I LKKFVKGKEAVQYDTFTQAL I DLKNNRIDGLL I DEVYANYYLKQE 
GNI KAYYFVKTAYQGENFWGARKVDRRLI EKINKAFKQLHNKGRFQKI SYKWFGEDVYS 
KE 

SEQ ID NO. 5416 

STRAIN 18RS21 frame: 3 

WEHYQKEKKI T I GFDNTFVPMGFESRSGD YTGFD I DLANAVFKE YG IS VKWQP I NWDMKE 
TELNNGNIDL I WNGYSKTAERAKKVAFTNPYMNNHQVI VTKTSSHINS I KDMKGKKLGAQ 
SGSSGFDAFNAKPD I LKKFVKGKEAVQYDTFTCALI DLKNNRI DGLL I DEVYANYYLKQE 
GNIKAYYFVKTAYQGENFVVGARKVDRRL I EKINKAFKQLHNKGRFQKI SYKWFGEDVYS 
KE 

SEQ ID NO. 5417 

STRAIN M732 frame: 3 

WEHYQKEKKITIGFDNTFVPMGFESRSGDYTGFDIDLANAVFKEYGISVKWQPINWDMKE 
TELNNGNIDLI WNGYSKTAERAKKVAFTNPYMNNHQVI VTKTSSHINS I KDMKGKKLGAQ 
SGSSGFDAFNAKPD I LKKFVKSKEAVQYDTFTQALI DLKNNRIDGLL I DEVYANYYLKQE 
GN I KAYYFVKTAYQGENFVVGARKVDRRL I EKI NKAFKQLHNKGRFQKI SYKWFGEDVYS 
KE 

SEQ ID NO. 5418 

STRAIN COH1 frame: 3 

WEHYQKEKKI T I GFDNT FVPMGFES RSGD YTG FD I DLANAVFKE YGI S VKWQP I NWDMKE 
TELNNGNIDLI WNGYSKTAERAKKVAFTNPYMNNHQVI VTKTSSHINS I K^ 
SGSSGFDAFNAKPD I LKKFVKGKEAVQYDT FTQAL I DLKNNRI DGLL I DEVYANYYLKQE 
GNI KAYYFVKTAYQGENFWGARKVDRRLI EKINKAFKQLHNKGRFQKI SYKWFGEDVYS 
KE 



TTCAAACAGC 
TTCAAACAGC 
TTCAAACAGC 
TTCAAACAGC 
TTCAAACAGC 
TTCAAACAGC 
TTCAAACAGC 
TTCAAACAGC 
TTCAAACAGC 
TTCAAACAGC 

TTCAAACAGC 
********** 



TTCATAATAA 
TTCATAATAA 
TTCATAATAA 
TTCATAATAA 
TTCATAATAA 
TTCATAATAA 
TTCATAATAA 
TTCATAATAA 
TTCATAATAA 
TTCATAATAA 

TTCATAATAA 
********** 



GGGgAgATTT 
GGGgAgATTT 
GGGgAgATTT 
GGGgAgATTT 
GGGgAgATTT 
GGGgAgATTT 
GGGgAgATTT 
GGGgAgATTT 
GGGgAgATTT 
GGGaAaATTT 
GGGgAaATTT 



CAAAAAATCT 
CAAAAAATCT 
CAAAAAATCT 
CAAAAAATCT 
CAAAAAATCT 
CAAAAAATCT 
CAAAAAATCT 
CAAAAAATCT 
CAAAAAATCT 
CAAAAAATCT 
CAAAAAATCT 



CTTACAAATG 
CTTACAAATG 
CTTACAAATG 
CTTACAAATG 
CTTACAAATG 
CTTACAAATG 
CTTACAAATG 
CTTACAAATG 
CTTACAAATG 
CTTACAAATG 
CTTACAAATG 



***_*_**** ********** ********** 



801 

GTTTGGTGAA 
GTTTGGTGAA 
GTTTGGTGAA 
GTTTGGTGAA 
GTTTGGTGAA 
GTTTGGTGAA 
GTTTGGTGAA 
GTTTGGTGAA 
GTTTGGTGAA 
GTTTGGTGAA 
GTTTGGTGAA 



GATGTTTATA 
GATGTTTATA 
GATGTTTATA 
GATGTTTATA 
GATGTTTATA 
GATGTTTATA 
GATGTTTATA 
GATGTTTATA 
GATGTTTATA 
GATGTTTATA 
GATGTTTATA 



********** ********** 



828 

GTAAAGAA 

GTAAAGAA 

GTAAAGAA 

GTAAAGAA 

GTAAAGAA 

GTAAAGAA 

GTAAAGAA 

GTAAAGAA 

GTAAAGAA 

GTAAAGAA 

GTAAAGAA 
******** 



885 



WO 2004/018646 



PCT/US2003/026827 



Table 54: Comparative Sequences relating to SAG0949 



SEQ ID NO. 54X9 

STRAIN M781 frame: 3 

WEHYQKEKKI T I GFDNTFVPMGFE S RSGD YTGFD I DLANAVFKE YG I S VKWQ P I NWDMKE 
TELNNGN IDL I WNGYSKTAERAKKVAFTNPYMNNHQVI VTKTSSHINS I KDMKGKKLGAQ 
SGSSGFDAFNAKPD I LKKFVKGKEAVQYDTFTQAL I DLKNNR I DGLL I DEVYANYYLKQE 
GNIKAYYFVKTAYQGENFVVGARKVDRRLI EKINKAFKQLHNKGRFQKI SYKWFGEDVYS 

KE 

SEQ ID NO. 5420 

STRAIN CJB110 frame: 3 

WEHYQKEKKITI GFDNTFVPMG FESRSGD YTGFD I DLANAVFKE YG I S VKWQPINWDMKE 
TELNNGN IDL I WNGYSKTAERAKKVAFTNPYMNNHQVI VTKTSSHINS I KDMKGKKLGAQ 
SGS S G FDAFNAKPD I LKKFVKGKEAVQYDTFTQAL I DLKNNR I DGLL I DE VYANYYLKQE 
GNIKAYYFVKTAYQGENFVVGARKVDRRLI EKINKAFKQLHNKGRFQKI SYKWFGEDVYS 

KE 

SEQ ID NO. 5421 

STRAIN 1169NT frame: 3 

WEHYQKEKKITI GFDNTFVPMGFE SRSGDYTGFD I DLANAVFKEYGI SVKWQPINWDMKE 
TELNNGNIDLIWNGYSKTAERAKKVAFTNPYMNNHQVIVTKTSSHINSIKDMKGKKLG^ 
SGSSGFDAFNAKPD I LKKFVKGKEAVQYDT FTQAL I DLKNNRI DGLL I DEVYANYYLKQE 
GNI KAYYFVKTAYQGENFWGARKVDRRL I EKINKAFKQLHNKGKFQKI SYKWFGEDVYS 

KE 



SEQ ID NO. 5422 

STRAIN JM9130013 frame: 3 

WEHYQKEKKI TI GFDNTFVPMGFE S RSGD YTGFD I DLANAVFKE YGI SVKWQP I NWDMKE 
TELNNGNIDLIWNGYSKTAERAKKVAFTNPYMNNHQVI VTKTSSHINS I KDMKGKKLGAQ 
SGSSGFDAFNAKPD I LKKFVKGKEAVQYDTFTQAL I DLKNNR IDGLL I DEVYANYYLKQE 
GNI KAYYFVKTAYQGENFVVGARKVDRRL I EKI NKAFKQLHNKGRFQKI SYKWFGEDVYS 
KE 

PRETTY of: /biotmp/msa45901 . 2 { *} February 19, 2003 03:09 



msa45901.2{225_090} 
msa45901.2{225_1169NT} 
msa45901.2{225_18RS2l} 
msa45901 . 2 { 225__2603 } 
msa45901 . 2 {225_A909 
msa45901.2{225_CJBHO; 
msa45901 . 2 {225_COHl} 
msa45901.2{225_H36B; 
msa45901 . 2 {225_JM9130013 
msa45901 . 2 {225_M732 
msa45901.2{225_M781 
Consensus 



x 50 

WEHYQK EKKITIGFDN 

WEHYQK EKKITIGFDN 

WEHYQK EKKITIGFDN 

lthknillti ifglfmiils acgmsnkema gidnWEHYQK EKKITIGFDN 

WEHYQK EKKITIGFDN 

WEHYQK EKKITIGFDN 

WEHYQK EKKITIGFDN 

WEHYQK EKKITIGFDN 

WEHYQK EKKITIGFDN 

WEHYQK EKKITIGFDN 

WEHYQK EKKITIGFDN 

********** ********** ********** ********** ********** 



msa45901 
msa45901.2{ 
msa45901.2{ 

msa45901 . 

msa45901. 
msa45901.2{ 

msa45901. 

msa45901. 
msa45901.2{225 

msa45901. 

msa45901 . 



.2{225_090) 
225_1169NT} 
225_18RS2l} 
2{225_2603} 
2{225_A909* 
225_CJB110 
2{225_COHi; 
2{225_H36B} 
i_JM9130013} 
2{225_M732} 
2{225_M781} 
Consensus 



msa45901 . 2 {225_090 } 
msa45901 . 2 f 225_1169NT} 
msa45901.2{225_18RS21> 
msa45901.2{225_2603) 
msa45901.2{225_A909} 
msa45901.2{225_CJB110) 
msa4 5901.2(22 5_COHl } 
msa45901 . 2 {225__H36B} 
msa45901.2{225_JM913 0013} 
msa45901.2(225_M732} 
msa45901.2{225_M78l} 
Consensus 



msa45901 . 2 (225_090 ) 
msa45901 . 2 ( 225_1169NT 
msa45901.2{225_18RS21 
msa45901 . 2f 225_2603 
msa45901.2{225__A909 
msa45901.2{225__CJB110} 



51 

TFVPMGFESR 

TFVPMGFESR 

TFVPMGFESR 

TFVPMGFESR 

TFVPMGFESR 

TFVPMGFESR 

TFVPMGFESR 

TFVPMGFESR 

TFVPMGFESR 

TFVPMGFESR 

TFVPMGFESR 
********** 

101 

NIDLIWNGYS 
NIDLIWNGYS 
NIDLIWNGYS 
NIDLIWNGYS 
NIDLIWNGYS 
NIDLIWNGYS 
NIDLIWNGYS 
NIDLIWNGYS 
NIDLIWNGYS 
NIDLIWNGYS 
NIDLIWNGYS 
********** 

151 

LGAQSGSSGF 
LGAQSGSSGF 
LGAQSGSSGF 
LGAQSGSSGF 
LGAQSGSSGF 
LGAQSGSSGF 



SGDYTGFDID 
SGDYTGFDID 
SGDYTGFDID 
SGDYTGFDID 
SGDYTGFDID 
SGDYTGFDID 
SGDYTGFDID 
SGDYTGFDID 
SGDYTGFDID 
SGDYTGFDID 
SGDYTGFDID 
********** 



LANAVFKEYG 
LANAVFKEYG 
LANAVFKEYG 
LANAVFKEYG 
LANAVFKEYG 
LANAVFKEYG 
LANAVFKEYG 
LANAVFKEYG 
LANAVFKEYG 
LANAVFKEYG 
LANAVFKEYG 



ISVKWQPINW 
ISVKWQPINW 
ISVKWQPINW. 
ISVKWQPINW 
ISVKWQPINW 
ISVKWQPINW 
ISVKWQPINW 
ISVKWQPINW 
ISVKWQPINW 
ISVKWQPINW 
ISVKWQPINW 



100 

DMKETELNNG 
DMKETELNNG 
DMKETELNNG 
DMKETELNNG 
DMKETELNNG 
DMKETELNNG 
DMKETELNNG 
DMKETELNNG 
DMKETELNNG 
DMKETELNNG 
DMKETELNNG 



********** ********** ********** 



KTAERAKKVA 

KTAERAKKVA 

KTAERAKKVA 

KTAERAKKVA 

KTAERAKKVA 

KTAERAKKVA 

KTAERAKKVA 

KTAERAKKVA 

KTAERAKKVA 

KTAERAKKVA 

KTAERAKKVA 
********** 



FTNPYMNNHQ 
FTNPYMNNHQ 
FTNPYMNNHQ 
FTNPYMNNHQ 
FTNPYMNNHQ 
FTNPYMNNHQ 
FTNPYMNNHQ 
FTNPYMNNHQ 
FTNPYMNNHQ 
FTNPYMNNHQ 
FTNPYMNNHQ 



VIVTKTSSHI 
VIVTKTSSHI 
VIVTKTSSHI 
VIVTKTSSHI 
VIVTKTSSHI 
VIVTKTSSHI 
VIVTKTSSHI 
VIVTKTSSHI 
VIVTKTSSHI 
VIVTKTSSHI 
VIVTKTSSHI 



********** ********** 



DAFNAKPD I L 
DAFNAKPDIL 
DAFNAKPD I L 
DAFNAKPDIL 
DAFNAKPDIL 
DAFNAKPDIL 



KKFVKGKEAV 
KKFVKGKEAV 
KKFVKGKEAV 
KKFVKGKEAV 
KKFVKGKEAV 
KKFVKGKEAV 



QYDTFTQALI 
QYDTFTQALI 
QYDTFTQALI 
QYDTFTQALI 
QYDTFTQALI 
QYDTFTQALI 



150 

NSIKDMKGKK 
NSIKDMKGKK 
NSIKDMKGKK 
NSIKDMKGKK 
NSIKDMKGKK 
NSIKDMKGKK 
NSIKDMKGKK 
NSIKDMKGKK 
NSIKDMKGKK 
NSIKDMKGKK 
NSIKDMKGKK 
********** 

200 

DLKNNR I DGL 
DLKNNR I DGL 
DLKNNRI DGL 
DLKNNRI DGL 
DLKNNR I DGL 
DLKNNR I DGL 



886 
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Table 54: Comparative Sequences relating to SAG0949 



f 225_COHl) 
(225_I 



msa45901.2 
msa45901 . 2 {225_H3 6B} 
msa45901 . 2 { 225_JM9130013 } 
msa45901.2{225_M732} 
msa45901.2{225_M78l} 
Consensus 



msa4S901 .2{225__090} 
msa45901.2{225_1169NT} 
msa45901.2{225_18RS2l} 
msa45901.2{225_2603} 
msa45901.2{225_A909} 
msa45901 . 2 {225_CJB110 } 
msa4 5901.2(22 5_COH 1 } 
msa45901.2{225_H36B} 
msa45901.2{225_JM9130013} 
msa45901 . 2 { 225_M732 } 
msa45901 . 2 {225_M78l} 
Consensus 



'01.2? 
'01 .2{ 



msa45901 
msa459 
msa459 

rasa45901. 

msa4 5 901 . 
msa45901-2{ 

msa45901. 

msa45901. 
msa45901.2{225 

msa45901. 

msa45901. 



.2{225_090 
225_1169NT 
225_18RS21' 
2{225_2603 
2{225_A909/ 
225_CJB110 
2{225_COHl - 
2{225_H36B' 
_JM9130013} 
2{225_M732} 
2{225_M781} 
Consensus 



LGAQSGSSGF 
LGAQSGSSGF 
LGAQSGSSGF 
LGAQSGSSGF 
LGAQSGSSGF 



DAFNAKPD I L 
DAFNAKPDIL 
DAFNAKPD I L 
DAFNAKPDIL 
DAFNAKPD I L 



KKFVKGKEAV 
KKFVKGKEAV 
KKFVKGKEAV 
KKFVKGKEAV 
KKFVKGKEAV 



QYDTFTQALI 
QYDTFTQALI 
QYDTFTQALI 
QYDTFTQALI 
QYDTFTQALI 



DLKNNRIDGL 
DLKNNRIDGL 
DLKNNRIDGL 
DLKNNRIDGL 
DLKNNRIDGL 



********** ********** ********** ********** ********** 



201 

L I DEVYANYY 
LIDEVYANYY 
LI DEVYANYY 
LIDEVYANYY 
LIDEVYANYY 
LIDEVYANYY 
LIDEVYANYY 
LIDEVYANYY 
LIDEVYANYY 
LIDEVYANYY 
LIDEVYANYY 



LKQEGNIKAY 
LKQEGNIKAY 
LKQEGNIKAY 
LKQEGNIKAY 
LKQEGNIKAY 
LKQEGNIKAY 
LKQEGNIKAY 
LKQEGNIKAY 
LKQEGNIKAY 
LKQEGNIKAY 
LKQEGNIKAY 



YFVKTAYQGE 
YFVKTAYQGE 
YFVKTAYQGE 
YFVKTAYQGE 
YFVKTAYQGE 
YFVKTAYQGE 
YFVKTAYQGE 
YFVKTAYQGE 
YFVKTAYQGE 
YFVKTAYQGE 
YFVKTAYQGE 



********** ********** ********** 



251 

FKQLHNKGkF 
FKQLHNKGkF 
FKQLHNKGrF 
FKQLHNKGrF 
FKQLHNKGrF 
FKQLHNKGrF 
FKQLHNKGrF 
FKQLHNKGrF 
FKQLHNKGrF 
FKQLHNKGrF 
FKQLHNKGrF 



QKISYKWFGE 
QKISYKWFGE 
QKISYKWFGE 
QKISYKWFGE 
QKISYKWFGE 
QKISYKWFGE 
QKISYKWFGE 
QKISYKWFGE 
QKISYKWFGE 
QKISYKWFGE 
QKISYKWFGE 



276 
DVYSKE 
DVYSKE 
DVYSKE 
DVYSKE 
DVYSKE 
DVYSKE 
DVYSKE 
DVYSKE 
DVYSKE 
DVYSKE 
DVYSKE 



NFWGARKVD 

NFWGARKVD 

NFWGARKVD 

NFWGARKVD 

NFWGARKVD 

NFWGARKVD 

NFWGARKVD 

NFWGARKVD 

NFWGARKVD 

NFWGARKVD 

NFWGARKVD 
********** 



250 

RRLIEKINKA 
RRLIEKINKA 
RRLIEKINKA 
RRLIEKINKA 
RRLIEKINKA 
RRLIEKINKA 
RRLIEKINKA 
RRLIEKINKA 
RRLIEKINKA 
RRLIEKINKA 
RRLIEKINKA 
********** 



********_* ********** ****** 
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Table 55: Comparative Sequences relating to SAG1592 



SEQ ID NO- 5501 
STRAIN 2603 

ATGCTTAAATCTTTTTTGATTTTCITAGTT 

TTCCCAGCTAGCTGTCGTTATCGTCCAACTTGCTCTACGTATATGATAG 

AAACATG GTCTAAAAGGTGTGTTG ATGGGGATTG CACGTATTTTG CG ATGT CAT C CCTT A 

GCCCACGGAGGAAATGATCCTGTCCCTGATCATTTTAGCTTAAGACGTAATAAAACGGAT 

ATATCAGAT 

SEQ ID NO. 5502 

STRAIN 090 

TTCCCAGCTAGCTGTCGTTATCGTCCAACTTGCTCTACGTATATGATAGA 
AGCTATTCAAAAACATGGTCTAAAAGGTGTGTTGATGGGGATTGCACGTA 
TTTTGCGATGTCATCCCTTAGCCCACGGAGGAAATGATCCTGTCCCTGAT 

CATTTTAG CTT 

SEQ ID NO. 5503 
STRAIN A909 

TTCCCMCTAGCTGTCGTTATCGTCCAACtTGCTCTACGTATATGATAGA 
AG CTATT CAAAAAGATGGTCTAAAAGGTGTGTTGATGGGGATTG CACGTA 
TTTTGCGATGTCATCCCTTAGCCCACGGAGGAAATGATCCTGTCCCTGAT 
CATTTTAgCTTAAGACGTAATAAAACGGATATA 

SEQ ID NO. 5504 

STRAIN H36B 

TT CCCAG CT AGCTGT CGTTAT CGT C CaACTTG CT CTACGT AT ATGAT AGA 
AGCTATT CAAAAACATGGT CTAAAAGGTGTTCTGATGGGGATTG CACGTA 
TTTTGCGAT GTCAT C C CITAGC CGACGGAGGAAATGAT CCTGT C CCTGAT 
CATTTTAG CTTAAG ACGTAATAAAACGGATATAT CAGAT 

SEQ ID NO. 5505 

STRAIN 18RS21 

TTC CCAG CT AGCTGT OTTTATCGT CCAACTTG CTCTACGTATATGATAGA 
AG CTATT CAAAAACATGGT CTAAAAGGTGTGTTGATGGGGATTG CACGTA 
TTTTG CGAT GT CAT C C CTT AG C CCACGGAGGAAATGAT C CTGTC C CTGAT 
CATTTTAGCTTAAGACGTAATAAAACGGATATATCAGAT 

SEQ ID NO. 5506 

STRAIN M732 

TTCCC^GCTAGCTGTCGTTATCGTCC^CrTGCTCTACGTATATGATAGA 
AG CTATT CAAAAACATGGTCTAAAAGGTGTGTTGATGGGGATTG CACGTA 
TTTTGCGATGTCATCCCTTAgCCCACGGAGGAAATGATCCTGTCCCTGAT 
CATTTTAGCTTAAGACGTAATAAAACGGATATATCAGAT 

SEQ ID NO. 5507 

STRAIN COH1 

TT CC CAG CT AG CTGT CGTTATCGT CCAACTTG CT CTACGTATATG ATAG AAG CTATT CAA 
AAACATGGT CTAAAAGGTGTGTTGATGGGGATTG CACGTATTTTG CGATGT CATC CCTTA 
GCCCACGGAGGAAATGAtCCTGtCCCTGATCATTTTAGCT 

SEQ ID NO. 5508 

STRAIN M781 

TT CCCAG CTAGCTGT CGTTATCGT CCAACTTG CTCTACGT AT ATGAT AGA 
AGCTATT CAAAAAC^TGGTCTAAAAGGTGTGTTGATGGGGATTG CACGTA 
TTTTG CGATGTCAT C C CTTAG CCCACGGAGGAAATGATCCTGTC CCTGAT 
CATTTTAGCTTAAGACGTAATAAAACGGATATATCAGAT 

SEQ ID NO. 5509 

STRAIN CJB110 

TTC C CAG CTAGCTGT CGTTATCGTC CAACTTG CT CTACGTATATG AT AGA 
AGCTATT CAAAAACATGGTCTAAAAGGTGTGTTGATGGGGATTG CACGTA 
TTTTGCGATGTCATCCCTTAGCCCACC<1AGGAAATGATCCTGTCCCTGAT 
CATTTTAG CTTAAGACGTAATAAAACGGATAT AT CAGAT 

SEQ ID NO. 5510 

STRAIN 1169NT 

TT CC CAGCTAG CTGTCGTTATCGT C CAACTTG CTCTACGTATATGATAGA 
AGCTATT CAAAAACATGGTCTAAAAGGTGTGGTGATGGGGATTG CACGTA 
TTTTGCGATGTCATCCCTTAGCCCACGGAGGAAATGATCCTGTCCCTGAT 
TATTTTAGCTTAAGACGTAATAAAACGGATATAT CAGAT 

SEQ ID NO. 5511 

STRAIN JM9130013 

TTCC CAG CTAGCTGT CGTTATCGTC CAACTTG CTCTACGT ATATGATAGA 
AG CTATT CAAAAACATGGT CTAAAAGGTGTT CTGATGGGGATTG CACGTA 
TTTTGCGATGTCATCCCTTAGCCCACGGAGGAAATGATCCTGTCCCTGAT 
CATTTTAG CTTAAGACGTAATAAAACGGATAT AT CAGAT 



PRETTY Of : /biotmp/msall9306 . 2 { * } April 29, 2003 06:23 .. 

1 50 

msall9306.2{233_H36B} ~~ 

.msalI9306.2{233_JM9130013} 



888 



WO 2004/018646 
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Table 55: Comparative Sequences relating to SAG1592 



msall9306 
msall9306 .2{ 

msall9306 . 

msall9306 . 
msall9306 .2 { 

msall9306 . 

rasall9306 . 

msall9306 . 
msall9306 .2 { 



•2{233_090} 
233_18RS2l} 
2{233__2603} 
2{233_A909} 
233_CJB110} 
2{233_COHl) 
2{233_M732} 
2{233_M78l} 
233__1169NT} 
Consensus 



atgcttaaat cttttttgat tttcttagtt cgcttttacc aaaaaaatat 



msall9306 . 
msall9306.2{233 
msall9306 
msall9306 ,2{ 

msall9306 . 

msall93 06 . 
msall9306 .2{ 

msall9306 . 

msall9306 . 

msall9306 . 
msall9306 .2{ 



2{233_H36B 
_JM913 0013; 
.2{233_090; 
233_18RS21/ 
2{233_2603 
2{233_A909/ 
233_CJB110 
2(233_COHl 
2{233_M732 
2{233_M781 
233__1169NT; 
Consensus 



msall9306 . 
msall9306.2{233 
msall9306 
msall9306.2{ 

msall9306 . 

msall9306 . 
msall9306 .2{ 

msall9306 . 

msall9306 . 

msall9306. 
msall9306.2{ 



2{233_H36B} 
__JM9130013 :■ 
.2{233_090 
233_18RS21 
2{233_2603 
2{233_A909' 
233_CJB110: 
2{233_COHl 
2{233_M732 
2{233_M781 
233_1169NT 
Consensus 



msall9306. 
msalX9306.2{233 
msall9306 
msall9306 ,2{ 

msall9306 . 

msa!19306 . 
msall9306.2{ 

msa!19306 . 

msall9306 . 

msall9306. 
msall9306.2{ 



2{233_H36B} 
_JM9130013} 
.2{233_090} 
233_18RS21} 
2{233_2603} 
2{233_A909} 
233_CJB110} 
2{233_COHl} 
2{233__M732) 
2{233_M78l} 
233_1169NT} 
Consensus 



msall9306 
msa!19306.2{233 
msa!19306 
msall9306.2{ 

msall9306 . 

rasall9306. 
msall9306 .2{ 

msall9306. 

msa!19306 . 

msall9306 . 
msall9306.2{ 



2{233_H36B} 
_JM9130013} 
.2{233_090} 
233_18RS21} 
2{233_2603j 
2{233_A909} 
233_CJB110} 
2{233_C0H1* 
2f 233_M732 
2{233_M78i; 
233__1169NT} 
Consensus 



********** ********** ********** ********** ********** 



51 

. — TTCCCAGCTA 

TTCCCAGCTA 

TTCCCAGCTA 

TTCCCAGCTA 

ttctccagct TTCCCAGCTA 

TTCCCAGCTA 

TTCCCAGCTA 

TTCCCAGCTA 

TTCCCAGCTA 

TTCCCAGCTA 

TTCCCAGCTA 

********** ********** ********** ********** ********** 



GCTGTCGTTA 
GCTGTCGTTA 
GCTGTCGTTA 
GCTGTCGTTA 
GCTGTCGTTA 
GCTGTCGTTA 
GCTGTCGTTA 
GCTGTCGTTA 
GCTGTCGTTA 
GCTGTCGTTA 
GCTGTCGTTA 



TCGTCCAACT 
TCGTCCAACT 
TCGTCCAACT 
TCGTCCAACT 
TCGTCCAACT 
TCGTCCAACT 
TCGTCCAACT 
TCGTCCAACT 
TCGTCCAACT 
TCGTCCAACT 
TCGTCCAACT 



100 

TGCTCTACGT 
TGCTCTACGT 
TGCTCTACGT 
TGCTCTACGT 
TGCTCTACGT 
TGCTCTACGT 
TGCTCTACGT 
TGCTCTACGT 
TGCTCTACGT 
TGCTCTACGT 
TGCTCTACGT 



101 

ATATGATAGA 
ATATGATAGA 
ATATGATAGA 
ATATGATAGA 
ATATGATAGA 
ATATGATAGA 
ATATGATAGA 
ATATGATAGA 
ATATGATAGA 
ATATGATAGA 
ATATGATAGA 



AGCTATTCAA 
AGCTATTCAA 
AGCTATTCAA 
AGCTATTCAA 
AGCTATTCAA 
AGCTATTCAA 
AGCTATTCAA 
AGCTATTCAA 
AGCTATTCAA 
AGCTATTCAA 
AGCTATTCAA 



AAACATGGTC 
AAACATGGTC 
AAACATGGTC 
AAACATGGTC 
AAACATGGTC 
AAACATGGTC 
AAACATGGTC 
AAACATGGTC 
AAACATGGTC 
AAACATGGTC 
AAACATGGTC 



********** ********** ********** 



151 

ATTGCACGTA 

ATTGCACGTA 

ATTGCACGTA 

ATTGCACGTA 

ATTGCACGTA 

ATTGCACGTA 

ATTGCACGTA 

ATTGCACGTA 

ATTGCACGTA 

ATTGCACGTA 

ATTGCACGTA 
********** 

201 

TGTCCCTGAT 
TGTCCCTGAT 
TGTCCCTGAT 
TGTCCCTGAT 
TGTCCCTGAT 
TGTCCCTGAT 
TGTCCCTGAT 
TGTCCCTGAT 
TGTCCCTGAT 
TGTCCCTGAT 

TGTCCCTGAT 
********** 



TTTTGCGATG 

TTTTGCGATG 

TTTTGCGATG 

TTTTGCGATG 

TTTTGCGATG 

TTTTGCGATG 

TTTTGCGATG 

TTTTGCGATG 

TTTTGCGATG 

TTTTGCGATG 

TTTTGCGATG 
********** 



TCATCCCTTA 
TCATCCCTTA 
TCATCCCTTA 
TCATCCCTTA 
TCATCCCTTA 
TCATCCCTTA 
TCATCCCTTA 
TCATCCCTTA 
TCATCCCTTA 
TCATCCCTTA 
TCATCCCTTA 
********** 



TAAAAGGTGT 
TAAAAGGTGT 
TAAAAGGTGT 
TAAAAGGTGT 
TAAAAGGTGT 
TAAAAGGTGT 
TAAAAGGTGT 
TAAAAGGTGT 
TAAAAGGTGT 
TAAAAGGTGT 
TAAAAGGTGT 
********** 



GCCCACGGAG 

GCCCACGGAG 

GCCCACGGAG 

GCCCACGGAG 

GCCCACGGAG 

GCCCACGGAG 

GCCCACGGAG 

GCCCACGGAG 

GCCCACGGAG 

GCCCACGGAG 

GCCCACGGAG 
********** 



150 

tcTGATGGGG 

tcTGATGGGG 

g t TGATGGGG 

gtTGATGGGG 

gt TGATGGGG 

gtTGATGGGG 

gtTGATGGGG 

gtTGATGGGG 

gtTGATGGGG 

gtTGATGGGG 

ggTGATGGGG 
_ _******** 

200 

GAAATGATCC 

GAAATGATCC 

GAAATGATCC 

GAAATGATCC 

GAAATGATCC 

GAAATGATCC 

GAAATGATCC 

GAAATGATCC 

GAAATGATCC 

GAAATGATCC 

GAAATGATCC 
********** 



249 

cATTTTAGCT taagacgtaa taaaacggat atatcagat 
cATTTTAGCT taagacgtaa taaaacggat atatcagat 

cATTTTAGCT t ' 

cATTTTAGCT taagacgtaa taaaacggat atatcagat 
cATTTTAGCT taagacgtaa taaaacggat atatcagat 

cATTTTAGCT taagacgtaa taaaacggat at a 

cATTTTAGCT taagacgtaa taaaacggat atatcagat 

CATTTTAGCT 

cATTTTAGCT taagacgtaa taaaacggat atatcagat 
cATTTTAGCT taagacgtaa taaaacggat atatcagat 
tATTTTAGCT taagacgtaa taaaacggat atatcagat 
_********* — 



SKQ ID NO. 5512 

STRAIN 2603 frame: 1 

MLKSFLIFLWFYQKNISPAFPASCRYRPTCSTYMIEAIQKHGLKGVLMGIARILRCHPL 
AHGGNDPVPDHFSLRRNKTDISD 

SEQ ID NO. 5513 

STRAIN 090 frame: 1 

FPASCRYRPTCSTYMI EAI QKHGLKGVLMG I ARILRCHPLAHGGNDPVPDHFS 

SEQ ID NO. 5514 

STRAIN A909 frame: 1 

FPASCRYRPTCSTYMI EAI QKHGLKGVLMG I ARI LRCHPLAHGGNDPVPDHFSLRRNKTD 
I 

SEQ ID NO. 5515 

STRAIN H36B frame: 1 
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Table 55: Comparative Sequences relating to SAG1592 



FPASCRYRPTCSTYMIEAI QKHGLKGVLMG I ARI LRCHPLAHGGNDP VPDHFSLRRNKTD 
ISD 

SEQ ID NO. 5516 
STRAIN 18RS21 frame: 1 

FPASCRYRPTCSTYMIEAI QKHGLKGVLMG IARI LRCHPLAHGGNDPVPDHFSLRRNKTD 
ISD 



SEQ ID NO. 5517 

STRAIN M732 frame: 1 

FPASCRYRPTCSTYM I EAI QKHGLKGVLMG IARI LRCHPLAHGGNDPVPDHFSLRRNKTD 
ISD 

SEQ XD NO. 5518 

STRAIN COH1 frame: 1 

FPASCRYRPTCSTYMIEAI QKHGLKGVLMG I ARI LRCHPLAHGGNDP VPDHFS 



SEQ XD NO. 5519 

STRAIN M781 frame: 1 

FPASCRYRPTCSTYM I EAI QKHGLKGVLMG I ARI LRCHPLAHGGNDPVPDHFSLRRNKTD 
ISD 



SEQ XD NO. 5520 
STRAIN CJB110 frame: 1 

FPASCRYRPTCSTYMIEAI QKHGLKGVLMG I ARI LRCHPLAHGGNDPVPDHFSLRRNKTD 
ISD 

SEQ XD NO. 5521 

STRAIN 1169NT frame: 1 

FPASCRYRPTCSTYMIEAI QKHGLKGWMGI ARI LRCHPLAHGGNDPVPDYFSLRRNKTD 
ISD 



SEQ ID NO. 5522 

STRAIN JM9130013 frame: 1 

FPASCRYRPTCSTYMIEAI QKHGLKGVLMG I ARI LRCHPLAHGGNDPVPDHFSLRRNKTD 
ISD 

PRETTY of : /biotmp/msa!19415 . 2 { * } April 29, 2003 06:25 .. 



msall9415 
msall9415.2{ 

msall9415 . 

msall9415 . 

msall9415 . 
msall9415.2{ 

msall9415 . 
msall9415.2{233 

msall9415. 

msall9415. 
msall9415.2{ 



.2{233_090; 
233_18RS21 
2{233_C0H1 
2{233_A909 
2{233_2603 
233_CJB110~ 
2{233_H36B 
_JM9130013 
2{233_M732 
2{233_M781 
233_1169NT} 
Consensus 



FPAS CRYRPT 

FPASCRYRPT 

FPASCRYRPT 

FPASCRYRPT 

mlksflif lv rfyqknispa FPASCRYRPT 

FPASCRYRPT 

FPASCRYRPT 

FPASCRYRPT 

FPASCRYRPT 

FPASCRYRPT 

FPASCRYRPT 



CSTYMIEAIQ 
CSTYMIEAIQ 
CSTYMIEAIQ 
CSTYMIEAIQ 
CSTYMIEAIQ 
CSTYMIEAIQ 
CSTYMIEAIQ 
CSTYMIEAIQ 
CSTYMIEAIQ 
CSTYMIEAIQ 
CSTYMIEAIQ 



50 

KHGLKGV1MG 
KHGLKGV1MG 
KHGLKGVlMG 
KHGLKGV1MG 
KHGLKGVlMG 
KHGLKGVlMG 
KHGLKGVlMG 
KHGLKGVlMG 
KHGLKGVlMG 
KHGLKGVlMG 
KHGLKGVvMG 



********** ********** ********** ********** *******_** 



msall9415 
msall9415 .2 { 

msall9415 . 

msall9415 . 

msa!19415 . 
msall94l5.2{ 

msall9415 . 
msall9415.2{233 

msall9415 . 

msall9415 . 
msall9415.2{ 



.2{233_090} 
233_18RS21} 
2(233_COHlJ 
2{233_A909) 
2{233_2603} 
233_CJB110} 
2{233_H36B} 
_JM9130013} 
2f 233_M732} 
2{233_M78l} 
233_1169NT} 
Consensus 



51 

IARILRCHPL 

IARILRCHPL 

IARILRCHPL 

IARILRCHPL 

IARILRCHPL 

IARILRCHPL 

IARILRCHPL 

IARILRCHPL 

IARILRCHPL 

IARILRCHPL 

IARILRCHPL 
********** 



AHGGNDPVPD 

AHGGNDPVPD 

AHGGNDPVPD 

AHGGNDPVPD 

AHGGNDPVPD 

AHGGNDPVPD 

AHGGNDPVPD 

AHGGNDPVPD 

AHGGNDPVPD 

AHGGNDPVPD 

AHGGNDPVPD 
********** 



83 

hFS 

hFSLRRNKTD ISD 

hFS 

hFSLRRNKTD I — 

hFSLRRNKTD ISD 

hFSLRRNKTD ISD 

hFSLRRNKTD ISD 

hFSLRRNKTD ISD 

hFSLRRNKTD ISD 

hFSLRRNKTD ISD 

yFSLRRNKTD ISD 
_********* *** 
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Table 56: Comparative Sequences relating to SAG0806 



SEQ ID NO. 5601 
STRAIN 2603 

aagaagcttacttttatttgggatttagatgggacattaatagattcgta 
tgtaccaattatggaagctcttgaagaaacctatcgtcattttggtttaa 
tatttgataaagaattaatccatgaatatattttacaggaatcagtgggg 
aaattattggtaaacctttcagaggaagagcaaatacctcatgaaaaact 
gaaagcatattttacaaaagaacaagaaagtcgagattctaaaatacatt 
taatgccatatgcaaaagagattttagaatggaccaaagaacaagatatc 
cccaattttatgtatacacataaaggagcaagtacgcattcagtgttgga 
aaccttgcagatctctcattattttgatgaaattttaactggtgtttcgg 
gattcgagcgaaaaccacatccacaagggattaattatttagttaaacga 
tattctttagataaatcaatgacttattacataggagatcgtccactaga 
tttggaggttgctcaaaatgctggtataaaatccataaacttaaggttag 
agaattccaaagaaaactataatatttcaagtctcaaagatataatatca 
cttgatttcactcgtttggat 

SEQ ID NO. 5602 
STRAIN COH1 

AAGAAGCTTACTTTTATTTGGGATTTAGATGGGACATTAA 
TAGATTCGTATGTACCAATTATGGAAGCTCTTGAAGAAAC CTAT CGTCAT 
TTTGGCTTAATATTTGAT AAAGAATTAAT C CATG AATATATTTT ACAGGA 
AT CAGTGGGG CAATTATTGGTAAACCTTT CAGAGG AAGAG CAAATACCT C 
ATGAAAAACTGAAAG CATATTTTACAAAAGAACAAGAAAGT CGAGATTCT 
AAAATACATTTAATGCCATATGCAAAAGAGATTTTAGAATGGACCAAAGA 
ACAAGATATTCCCAATTTTATGTATACACATAAAGGAGCAAGTACGCATT 
CAGTGTTGGAAACCTTG CAGATCT CTCATT ATTTTGAT^ 
GGTGTTTCGGGATT CGAG CGAAAAC CACATCCACAAGGGATTAATTATTT 
AGTTAAACX3ATATTCTTTAGATAAATGAATGACITATTACATAGGAGATC 
GTCCACTAGATTTGGAGGTTGCTCAAAATGCTGGTATAAAATCCATAAAC 
TT AAGGTTAGAGAATTC CAAAGAAAACTATAATATTT CAAGTCTCAAAG A 
TATAATAT CACTTGATTT CACT CGTTTGGAT 

SEQ ID NO. 5603 
STRAIN A909 

AAGAAG CTTACrTTTATTTGGGATTTAGATGGGACATTAAT 
AGATT CX3TATGTAC CAATTATGGAAGCTCTTG AAC^ 

ATTTGATAAAGAATTAAT C CATGAATATATTTTACAGGAATCAGTGGGGAAATT ATTGGT 
AAACCTTTCAGAGGAAGAGCAAATACCTCATGAAAAACTGAAAG CATATTTTACAAAAGA 
ACAAGAAAGT CGAGATT CITAAAATAG^TTT AATG CCATATG CAAAAGAG ATTTTAGAATG 
GACCAAAGAACAAGATATC C CCAATTTTATGTATACACAT AAAGGAG CAAGTACG CATT C 
AGTGTTGGAAACCrTGCAGATCTCTCATTATTTTGATGAAATT^ 

ATTCGAG CGAAAAC CACAT C CAC!AAGGGATTAATT ATTTAGTTAAACGATATTCTTT AG A 
TAAAT CAATGACTTATTAGA.TAGGAGATCGTCCACTAGATTTGGAGGTTG CT CAAAATG C 
TGGTATAAAAT CCAT AAACTTAAGGTT AGAGAATT CCAAAGAAAACT ATAATATTT CAAG 
TCTCAAAGATATAATATCACTTGATTTCACTCGT 

SEQ ID NO. 5604 
STRAIN H36B 

AAGAAGCTTACITTTATTTGGGATTTAGATGGGACATTAATAGATTCG 

TATGTACCAATTATGGAAGCTCITGAAGA 

AAAGAATTAAT CCATGAATATATTTTACAGGAATCAGTGGGGAAATTATTGGTAAAC CTT 
TCAGAGGAAG AG CAAATAC CTCATGAAAAACTGAAAGCATATTTTACAAAAGAACAAGAA 
AGTCGAGATTCTAAAATACATTTAATCCCATATGCAAAAGAGATTT^ 

GAACIAAGATATCCCCAATTTTATGTATACACATAA 

GAAACCTTGCAGATCTCTCATTATTTTGATGAAATTTTAACTGGTGT^ 

CGAAAAC CACAT CCACAAGGGATTAATTATTTAGTTAAACGATATT CTTTAGATAAATCA 

ATGACTTATTACAT AGG AG ATCGT C CACTAGATTTGGAGGTTGCT CAAAATGCTGGTATA 

AAAT C CAT AAACTT AAGGTTAGAG AATT C CAAAGAAAACTATAATATTT CAAGT CT CAAA 

GATATAATAT CACTTGATTT CACT CGTTTGGAT 

SEQ ID NO. 5605 
STRAIN 18RS21 

AAGAAG CTT ACTTTT ATTTGGG ATTTAGATGGGACLATTAATAGA^ 
CGTATGTACCAATTATGGAAGCTCTTGAAGAAACCTATCGTCATTTTGGTT^ 
ATAAAGAATT AATCCATGAATATATTTTACAGGAAT CAGTGGGG AAATT ATTGGT AAAC C 
TTTCAGAGG AAGAG CAAATACCTCATG AAAAACTGAAAG CAT ATTTTACAAAAG AACAAG 
AAAGT CGAGATT CTAAAATACATTTAATG C CATATG CAAAAG AGATTTT AGAATGGACCA 
AAGAACAAG ATATC C CCAATTTTATGTATACACATAAAGGAG CAAGTACG CATTCAGTGT 
TGGAAACCTTGCAGATCTCTCATTATTT^ 

AG CGAAAAC CACATC CACAAGGGATTAATT ATTTAGTTAAACGAT ATT CTTTAGATAAAT 
CAATGACTT ATTACATAGGAGAT CGT C CACTAGATTTGGAGGTTGCT CAAAATG CTGGT A 
TAAAAT C CATAAACTTAAGGTTAG AGAATT CCAAAGAAAACTAT AATATTTCAAGTOT 
AAGATATAATATCACTTGATTTCACTCGTTTGGAT 

SEQ ID NO. 5606 
STRAIN M732 

AAGAAGCTTACTTTTATTTGC4GATTTAGATGGGACATTAATAGAT 

TCGTATGTAC CAATT ATGGAAG CT CTTGAAGAAAC CT ATCGT CATTTTGG CTTAATATTT 
GATAAAG AATTAAT CCATGAATATATTTT ACAGGAATCAGTGGG G CAATTATTGGT A AAC 
CTTT CAGAGG AAGAG CAAATAC CT CATGAAAAACTGAAAG CATATTTTACAAAAGAACAA 
GAAAGTCGAGATT CTAAAAT ACATTTAATG C CAT ATGCAAAAGAGATTTTAG AATGGAC C 
AAAGAACAAGATATTCCCAATTTTATGTATACACATAAAGGAGCAAGTACGCATTCAGTG 
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TTC^AAACCTTGCAGATCTCTCATT^TTTTGATGAAATTTTAACTGGTGTTTCGGGATTC 
GAGCGAAAACCACATCCACAAGGGATTAATTATTTAGTTAAACGATATTCTTTAGATA^ 
TCAATGACTTATTACATAGGAGATCGTCCACTAGATTTGGAGGTTGCTCAAAATGCTGGT 
AT AAAAT C CATAAACTT AAGGTTAG AG AATT C CAAAG AAAAC TATAAT AT TTCAAGT CT C 
AAAGATATAATATCACTTGATTTCACTCGTTTGGAT 

SEQ ID NO. 5607 
STRAIN CJB 110 

AAGAAG CTTACTTTTATTTGGG ATTTAGAT GG GACATT 

AATAGATTCGTATGT AC CAATT ATGGAAG CT CTTG AAGAAAC CTAT CGT CATTTTGG CTT 
AATATTTGATAAAG AATTAAT C CATG AATATATTTTACAG GAAT CAGTGGGG CAATTATT 
GGTAAACCTTTCAGAGGAAGAGCAAATACCTCATGAAAAACTGAAAGCAT^^ 
AGAACAAGAAAGTCGAGATTCTAAAATACATTTAATGCCATAT 

. ATGG ACCAAAGAACAAG ATAT C CC CAATTTTATGTAT ACACATAAAGGAG CAAGTACG CA 
TTCAGTGTTGGAAACCnTGCAGATCTCTCATTATTTTGATGAA 
TGGATTCGAGCGAAAA.CCA-CATCCACAAGGGATTAATTATTTAGTTAAAC 
AGATAAAT CAATG ACTTATT ACATAGGAG AT CGT CCC CT AGATTTGGAG GTTGCTCAAAA 
TG CTGGTATAAAAT C CAT AAACTT AAGGTTAGAG AATTC CAAAGAAAACTATAAT^ 
AAGT CTCAAGGATATAATAT CACTTGATTT CACT CGTT 

SEQ ID NO. 5608 
STRAIN 1I69NT 

aAGAAGCTTACTTTTATTTGGGATTTAGATGGG 

TAGAAGCT CTTGAAGAAAC CTATCGTCATTTTGGCTTAATATT^ 

ATGAATATATTTTACAGGAATCAGTGGGGAAATTATTGGTAAAC CTTT CAGAGGAAG AG C 
AAATACCTCATGAAAAACTGAAAGCATATTTTACAAAAGAACAAGAAAGT 
AAATACATTTAATG C CATACG CAAAAGAGATTTTAGAATGGACCAAAGAACAAGATATC C 
CCAATTTTATGTATACACAT AAAGGAG CAAGTACG CATT CAGTGTTGGAAACCTTG CAGA 
TCTCT CATTATTTTGATGAAATTTTAACTGGTGTTTCGGGATT CGAG CGAAAACCACAT C 
C^CAAGGGATTAATTATTTAGTTAAACGATATTCTTTAGAT 

TAGGAGATCGT C C C CTAGATTTGGAGGTTG CT CAAAATGCTGGTATAAAATCCATAAACT 

TAAGGTTAGAGAATTCCAAAGAAAACTATAATATTTCAAGT^ 

TTGATTTCACTCGTTTGGAT 

SEQ ID NO. 5609 
STRAIN JM9 130013 

AAGAAGCITACTTTTATTTGGGATTTAGATGGG 

TTCGTATGTACCAATTATGGAAGCTCTTGAAGAAACCTATCGTCAT^ 
TGATAAAGAATTAATCCATGAATATATTTTACAGGAATCAGTGGGGAAATTATTGGTAAA 
CCTTTCAGAGGAAGAGCAAATACCTCAT^ 
AGAAAGTCGAGATTCTAAAATACATTTAATGCCATATGCA^ 

CAAAG AACAAGATAT C C C CAATTTTATGTATACACATAAAGGAGCAAGT ACG CATTCAGT 
GTTGGAAAC CTTG CAGATCT CT CATTATTTTGATGAAATTTTAACTGGTGTTTCGGGATT 
CGAG CGAAAAC CACATCCACAAGGG ATTAATTATTTAGTTAAACGATATT CTTT AGATAA 
AT CAATGACTTATTACATAGGAGAT CGTC CACTAGATT^ 

TATAAAAT CCATAAACITAAGGTT AGAGAATT C CAAAGAAAACT ATAATATTTCAAGTCT 
CAAAGATATAAT AT CACTTGATTT CACTCGT 

SEQ ID NO. 5610 

STRAIN 690 

AAGAAGCTTACTTTTATTTGG 

GATTTAGATGGGACATTAATAGATTCGTATGTACCAATTATGGAAGCTCT 
TGAAGAAAC CTAT CGTCATTTTGG C1TAATATTTG ATAAAGAATT AATC C 
ATGAATATATTTT ACAGGAATCAGTGGGG CAATTATTGGTAAACCTTTCA 
GAGGAAGAGCAAATACCTCATGAAAAACTGAAAGCATATTTTACAAA^ 
ACAAGAAAGT CGAGATT CTAAAATACATTTAATGCCATATGCAAAAGAG A 
TTTTAGAATGGACCAAAGAACAAGATATCC CCAATTTTATGTATACACAT 
AAAGGAG CAAGTACG CATT CAGTGTTGGAAAC CTTGCAGATCTCT CATTA 
TTTTGATGAAATTTTAACTGGTGTTTCTGGATTCG AG CGAAAAC CACATC 
CACAAGGGATTAATTATTTAGTTAAACGATATTCITTAG 
ACITATTACATAGGAGATCGTCCCCTAG^TTTGGAGGTTGCTCAAAATGC 
TGGTATAAAATCCATAAACTTAAGGTTAGAGAATTCCAAAGAAAACTATA 
ATATTTCAAGTCTCAAGGATATAATATCACTTGATTTCACTCGT 

SEQ ID NO. 5611 
STRAIN M781 

AAGAAGCTT ACTITTTATTT GGG ATTTAGATGGGACATTAATAGATTCGT 
ATGTACCAATTATGGAAGCTCTTGAAGAAACCTATCGTCATTTT 
ATATTTGATAAAGAATTAATCCATGAATATATTTTACAGGAATCAGTGGG 
GCAATTATTGGTAAACCTTTCAG^GGAAGAGCAAATACCrrCATC 
TGAAAGCATATTTTACAAAAGAACAAGAAAGTCX^GATTyTAAAA 
TTAATGCCATATGCAAAAGAGATTTTAGAATGGACCAAAGAACAAGATAT 
T CCCAATTTTATGTATACACAT AAAGGAG CAAGTACG CATTCAGTGTTGG 
AAACCTTG CAGATCT CTCATTATTTTG AT G AAATTTT AACTGGTGTTTCG 
GGATTCGAG CGAAAACCACAT C CACAAGGGATTAATTATTTAGTTAAACG 
ATATT CTTTAGATAAAT CAATGACTTATT ACATAGGAGAT CGTC CACTAG 
ATTTGGAGGTTG CT CAAAATGCTGGTATAAAATC CAT AAACTTAAGGTTA 
GAGAATTC CAAAGAAAACTATAATATTT CAAGTCT CAAAGATATAATAT C 
ACTTGATTT CACTCGT 

PRETTY of : /biotrnp/msa45163 . 2 { *} January 21, 2003 06:53 
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msa45163.2{240_18RS2i; 
msa45163 .2{240_2603 
msa45163 . 2 { 240_A909 ; 
msa4 5163 .2{240_H36B; 
msa45163 . 2{240_JM9130013 
msa45163 .2f 240_COH1 
msa45163 . 2 { 240_M732 [ 
msa45163.2{240_M78i; 
msa45163 . 2 { 240_090 } 
msa45163 . 2 { 240_CJB110 } 
msa45163 . 2 { 240_1169NT } 

Consensus 



AAGAAGCTTA 
AAGAAGCTTA 
AAGAAGCTTA 
AAGAAGCTTA 
AAGAAGCTTA 
AAGAAGCTTA 
AAGAAGCTTA 
AAGAAGCTTA 
AAGAAGCTTA 
AAGAAGCTTA 

AAGAAGCTTA 
********** 



CTTTTATTTG 

CTTTTATTTG 

CTTTTATTTG 

CTTTTATTTG 

CTTTTATTTG 

CTTTTATTTG 

CTTTTATTTG 

CTTTTATTTG 

CTTTTATTTG 

CTTTTATTTG 

CTTTTATTTG 
********** 



GGATTTAGAT 
GGATTTAGAT 
GGATTTAGAT 
GGATTTAGAT 
GGATTTAGAT 
GGATTTAGAT 
GGATTTAGAT 
GGATTTAGAT 
GGATTTAGAT 
GGATTTAGAT 
GGATTTAGAT 



GGGACATTAA 
GGGACATTAA 
GGGACATTAA 
GGGACATTAA 
GGGACATTAA 
GGGACATTAA 
GGGACATTAA 
GGGACATTAA 
GGGACATTAA 
GGGACATTAA 
GGGACATTAA 



50 

TAGATTCGTA 
TAGATTCGTA 
TAGATTCGTA 
TAGATTCGTA 
TAGATTCGTA 
TAGATTCGTA 
TAGATTCGTA 
TAGATTCGTA 
TAGATTCGTA 
TAGATTCGTA 
TAGATTCGTA 



********** ********** ********** 



msa45163 .2{ 

msa45163 . 

msa45163. 

msa45163 . 
msa45163.2{240 

msa45163 . 

msa45163 . 

msa45163 . 
msa45163 
msa45163 .2{ 
msa45163.2{ 



240_18RS21> 
2{240_2603} 
2{240_A909/ 
2{240_H36B • 
_JM9130013 
2{240_COH1 ■ 
2{240_M732 
2{240_M781; 
.2{240_090 
240_CJB110 
240_1169NT 
Consensus 



51 

TGTACCAATT 
TGTACCAATT 
TGTACCAATT 
TGTACCAATT 
TGTACCAATT 
TGTACCAATT 
TGTACCAATT 
TGTACCAATT 
TGTACCAATT 
TGTACCAATT 
TGTACCAATT 



ATgGAAGCTC 
ATgGAAGCTC 
ATgGAAGCTC 
ATgGAAGCTC 
ATgGAAGCTC 
ATgGAAGCTC 
ATgGAAGCTC 
ATgGAAGCTC 
ATgGAAGCTC 
ATgGAAGCTC 
ATaGAAGCTC 



TTGAAGAAAC 
TTGAAGAAAC 
TTGAAGAAAC 
TTGAAGAAAC 
TTGAAGAAAC 
TTGAAGAAAC 
TTGAAGAAAC 
TTGAAGAAAC 
TTGAAGAAAC 
TTGAAGAAAC 
TTGAAGAAAC 



CTATCGTCAT 
CTATCGTCAT 
CTATCGTCAT 
CTATCGTCAT 
CTATCGTCAT 
CTATCGTCAT 
CTATCGTCAT 
CTATCGTCAT 
CTATCGTCAT 
CTATCGTCAT 
CTATCGTCAT 



100 

TTTGGtTTAA 
TTTGGtTTAA 
TTTGGtTTAA 
TTTGGtTTAA 
TTTGGtTTAA 
TTTGGcTTAA 
TTTGGcTTAA 
TTTGGcTTAA 
TTTGGcTTAA 
TTTGGcTTAA 
TTTGGcTTAA 



********** **_******* ********** ********** *****_**** 



msa45163 .2{ 
msa4 5163 . 
msa45163 . 
msa45163 . 
msa45163.2{240 
msa45163 . 
msa45163 - 
msa45163 . 
msa45163 
msa45163 
msa45163 



240_18RS2l} 
2{240_2603} 
2{240_A909) 
2{240_H36B} 
_JM9130013} 
2{240_COH1} 
2{240_M732} 
2{240_M781} 
, ^ .2{240_090} 
.2{240_CJB110) 
.2{240_1169NT} 
Consensus 



101 

TATTTGATAA 
TATTTGATAA 
TATTTGATAA 
TATTTGATAA 
TATTTGATAA 
TATTTGATAA 
TATTTGATAA 
TATTTGATAA 
TATTTGATAA 
TATTTGATAA 
TATTTGATAA 



AGAATTAATC 
AGAATTAATC 
AGAATTAATC 
AGAATTAATC 
AGAATTAATC 
AGAATTAATC 
AGAATTAATC 
AGAATTAATC 
AGAATTAATC 
AGAATTAATC 
AGAATTAATC 



********** ********** 



CATGAATATA 

CATGAATATA 

CATGAATATA 

CATGAATATA 

CATGAATATA 

CATGAATATA 

CATGAATATA 

CATGAATATA 

CATGAATATA 

CATGAATATA 

CATGAATATA 
********** 



TTTTACAGGA 

TTTTACAGGA 

TTTTACAGGA 

TTTTACAGGA 

TTTTACAGGA 

TTTTACAGGA 

TTTTACAGGA 

TTTTACAGGA 

TTTTACAGGA 

TTTTACAGGA 

TTTTACAGGA 
********** 



150 

ATCAGTGGGG 

ATCAGTGGGG 

ATCAGTGGGG 

ATCAGTGGGG 

ATCAGTGGGG 

ATCAGTGGGG 

ATCAGTGGGG 

ATCAGTGGGG 

ATCAGTGGGG 

ATCAGTGGGG 

ATCAGTGGGG 
********** 



msa45163.2{240_18RS2l} 
msa45163 . 2 { 240_2603 } 
msa45163.2{240_A909' 
msa45163.2{240_H36B 
msa45163 . 2 {240_JM9130013 
msa45163.2{240_COHl} 
msa45163 . 2{240_M732 } 
msa45163.2{240_M78l' 
msa45163 .2{240_090 
msa45163 . 2 {240_CJB110 
msa45163.2{240_1169NT] 

Consensus 



151 

aAATTATTGG 
aAATTATTGG 
aAATTATTGG 
aAATTATTGG 
aAATTATTGG 
cAATTATTGG 
cAATTATTGG 
CAATTATTGG 
CAATTATTGG 
cAATTATTGG 
aAATTATTGG 



TAAACCTTTC 
TAAACCTTTC 
TAAACCTTTC 
TAAACCTTTC 
TAAACCTTTC 
TAAACCTTTC 
TAAACCTTTC 
TAAACCTTTC 
TAAACCTTTC 
TAAACCTTTC 
TAAACCTTTC 



AGAGGAAGAG 
AGAGGAAGAG 
AGAGGAAGAG 
AGAGGAAGAG 
AGAGGAAGAG 
AGAGGAAGAG 
AGAGGAAGAG 
AGAGGAAGAG 
AGAGGAAGAG 
AGAGGAAGAG 
AGAGGAAGAG 



CAAATACCTC 
CAAATACCTC 
CAAATACCTC 
CAAATACCTC 
CAAATACCTC 
CAAATACCTC 
CAAATACCTC 
CAAATACCTC 
CAAATACCTC 
CAAATACCTC 
CAAATACCTC 



200 

ATGAAAAACT 
ATGAAAAACT 
ATGAAAAACT 
ATGAAAAACT 
ATGAAAAACT 
ATGAAAAACT 
ATGAAAAACT 
ATGAAAAACT 
ATGAAAAACT 
ATGAAAAACT 
ATGAAAAACT 



_********* ********** ********** ********** ********** 



msa45163.2{ 

msa45163 . 

msa45163 . 

msa45163 . 
msa45163.2{240 

msa45163 . 

msa45163 . 

msa45163 . 
msa45163 
msa45163.2{ 
msa45163 .2{ 



240_18RS21} 
2f240_2603 ) 
2?240_A909} 
2{240_H36B} 
_JM9130013} 
2{240_COH1* 
2{240_M732 
2{240_M781 
.2{240_090 
240_CJB110} 
240_1169NT} 
Consensus 



msa45163 . 2 { 240_18RS21 } 
msa45163 . 2 { 240_2603 } 
msa45163 .2 {240_A909} 
msa4 5163.2(24 0__H3 6B } 
msa45163 . 2 {240_JM9130013 } 
msa4 5 1 6 3 . 2 ( 2 4 0_COH1 } 
msa45163 . 2 ( 240_M732 } 
msa45163.2{240_M78l} 
msa45163 . 2 { 240_090 } 
msa4 51 63.2(24 0_C JB1 10 } 
msa45163 . 2 {240_1169NT} 

Consensus 



201 

GAAAGCATAT 
GAAAG CAT AT 
GAAAGCATAT 
GAAAGCATAT 
GAAAGCATAT 
GAAAGCATAT 
GAAAGCATAT 
GAAAGCATAT 
GAAAGCATAT 
GAAAGCATAT 

GAAAGCATAT 
********** 

251 

TAATG CCATA 
TAATGCCATA 
TAATGCCATA 
TAATGCCATA 
TAATGCCATA 
TAATGCCATA 
TAATGCCATA 
TAATGCCATA 
TAATGCCATA 
TAATGCCATA 

TAATGCCATA 
********** 



TTTACAAAAG 

TTTACAAAAG 

TTTACAAAAG 

TTTACAAAAG 

TTTACAAAAG 

TTTACAAAAG 

TTTACAAAAG 

TTTACAAAAG 

TTTACAAAAG 

TTTACAAAAG 

TTTACAAAAG 
********** 



tGCAAAAGAG 

tGCAAAAGAG 

tGCAAAAGAG 

tGCAAAAGAG 

tGCAAAAGAG 

tGCAAAAGAG 

tGCAAAAGAG 

tGCAAAAGAG 

tGCAAAAGAG 

tGCAAAAGAG 

cGCAAAAGAG 
_********* 



AACAAGAAAG 

AACAAGAAAG 

AACAAGAAAG 

AACAAGAAAG 

AACAAGAAAG 

AACAAGAAAG 

AACAAGAAAG 

AACAAGAAAG 

AACAAGAAAG 

AACAAGAAAG 

AACAAGAAAG 
********** 



ATTTTAGAAT 

ATTTTAGAAT 

ATTTTAGAAT 

ATTTTAGAAT 

ATTTTAGAAT 

ATTTTAGAAT 

ATTTTAGAAT 

ATTTTAGAAT 

ATTTTAGAAT 

ATTTTAGAAT 

ATTTTAGAAT 
********** 



250 

AAAATACATT 

AAAATACATT 

AAAATACATT 

AAAATACATT 

AAAATACATT 

AAAATACATT 

AAAATACATT 

AAAATACATT 

AAAATACATT 

AAAATACATT 

AAAATACATT 
********** 

300 

ACAAGATATc 

ACAAGATATc 

ACAAGATATc 

ACAAGATATC 

ACAAGATATc 

ACAAGATATt 

ACAAGATATt 

ACAAGATATt 

ACAAGATATc 

ACAAGATATc 

ACAAGATATc 
********** *********_ 



TCGAGATTcT 
TCGAGATTcT 
TCGAGATTcT 
TCGAGATTcT 
TCGAGATTcT 
TCGAGATTcT 
TCGAGATTcT 
TCGAGATTyT 
TCGAGATTcT 
TCGAGATTcT 

TCGAGATTcT 
********_* 



GGACCAAAGA 
GGACCAAAGA 
GGACCAAAGA 
GGACCAAAGA 
GGACCAAAGA 
GGACCAAAGA 
GGACCAAAGA 
GGACCAAAGA 
GGACCAAAGA 
GGACCAAAGA 
GGACCAAAGA 
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PCT/US2003/026827 



Table 56: Comparative Sequences relating to SAG0806 



msa45163.2{240_18RS2l) 
msa45163. 2 { 240^2603} 
msa45163.2{240_A909} 
msa45163 . 2 {240_H36B} 
msa45163 . 2 { 240_JM9130013 } 
msa4 5 1 6 3 . 2 { 2 4 0_COH1 } 
msa45163 .2(240_M732) 
msa45163 .2{240_M781} 
msa45163 .2{240__090} 
msa45163 .2{240__CJB110} 
msa45163 .2{240_1169NT} 

Consensus 



301 

CCCAATTTTA 

CCCAATTTTA 

CCCAATTTTA 

CCCAATTTTA 

CCCAATTTTA 

CCCAATTTTA 

CCCAATTTTA 

CCCAATTTTA 

CCCAATTTTA 

CCCAATTTTA 

CCCAATTTTA 
****** **** 



TGTATACACA 

TGTATACACA 

TGTATACACA 

TGTATACACA 

TGTATACACA 

TGTATACACA 

TGTATACACA 

TGTATACACA 

TGTATACACA 

TGTATACACA 

TGTATACACA 
********** 



TAAAGGAGCA 

TAAAGGAGCA 

TAAAGGAGCA 

TAAAGGAGCA 

TAAAGGAGCA 

TAAAGGAGCA 

TAAAGGAGCA 

TAAAGGAGCA 

TAAAGGAGCA 

TAAAGGAGCA 

TAAAGGAGCA 
********** 



AGTACGCATT 

AGTACGCATT 

AGTACGCATT 

AGTACGCATT 

AGTACGCATT 

AGTACGCATT 

AGTACGCATT 

AGTACGCATT 

AGTACGCATT 

AGTACGCATT 

AGTACGCATT 
********** 



350 

CAGTGTTGGA 

CAGTGTTGGA 

CAGTGTTGGA 

CAGTGTTGGA 

CAGTGTTGGA 

CAGTGTTGGA 

CAGTGTTGGA 

CAGTGTTGGA 

CAGTGTTGGA 

CAGTGTTGGA 

CAGTGTTGGA 
********** 



msa45163 . 2{ 

msa45163 . 

msa45163 . 

msa45163 . 
msa4 5163 .2(240 

msa45163 . 

msa45163 . 

msa45163 . 
msa45163 
msa45163 .2{ 
msa45163 .2{ 



240_18RS2l} 
2{240_2603} 
2{240_A909} 
2{240__H36BJ 
_JM9130013} 
2{240_COH1} 
2{240_M732} 
2{240_M781) 
.2{240_090) 
24 0_C JB110) 
240_1169NT} 
Consensus 



351 

AACCTTGCAG 
AACCTTGCAG 
AACCTTGCAG 
AACCTTGCAG 
AACCTTGCAG 
AACCTTGCAG 
AACCTTGCAG 
AACCTTGCAG 
AACCTTGCAG 
AACCTTGCAG 
AACCTTGCAG 



ATCTCTCATT 
AT CTCTCATT 
ATCTCTCATT 
ATCTCTCATT 
ATCTCTCATT 
ATCTCTCATT 
ATCTCTCATT 
ATCTCTCATT 
ATCTCTCATT 
ATCTCTCATT 
ATCTCTCATT 



ATTTTGATGA 
ATTTTGATGA 
ATTTTGATGA 
ATTTTGATGA 
ATTTTGATGA 
ATTTTGATGA 
ATTTTGATGA 
ATTTTGATGA 
ATTTTGATGA 
ATTTTGATGA 
ATTTTGATGA 



AATTTTAACT 
AATTTTAACT 
AATTTTAACT 
AATTTTAACT 
AATTTTAACT 
AATTTTAACT 
AATTTTAACT 
AATTTTAACT 
AATTTTAACT 
AATTTTAACT 
AATTTTAACT 



********** ********** ********** ********** 



400 

GGTGTTTCgG 

GGTGTTTCgG 

GGTGTTTCgG 

GGTGTTTCgG 

GGTGTTTCgG 

GGTGTTTCgG 

GGTGTTTCgG 

GGTGTTTCgG 

GGTGTTTCtG 

GGTGTTTCtG 

GGTGTTTCgG 
********_* 



msa45163 . 2 {240_18RS21 } 
msa45163.2f 240_2603} 
msa45163.2{240_A909} 
msa45163 .2{240_H36B} 
msa45163 . 2 {240__JM9130013 } 
ms a4 5 163 . 2 { 2 4 0_COH1 J 
msa45163 -2{240_M732} 
msa451 63. 2 { 24 0_M78l} 
msa45163 . 2 { 240_090 } 
msa45163.2{24 0_CJB1 1 0 } 
msa45163.2{240_1169NT} 

Consensus 



401 

GATTCGAGCG 

GATTCGAGCG 

GATTCGAGCG 

GATTCGAGCG 

GATTCGAGCG 

GATTCGAGCG 

GATTCGAGCG 

GATTCGAGCG 

GATTCGAGCG 

GATTCGAGCG 

GATTCGAGCG 
********** 



AAAACCACAT 

AAAACCACAT 

AAAACCACAT 

AAAACCACAT 

AAAACCACAT 

AAAACCACAT 

AAAACCACAT 

AAAACCACAT 

AAAACCACAT 

AAAACCACAT 

AAAACCACAT 
********** 



CCACAAGGGA 

CCACAAGGGA 

CCACAAGGGA 

CCACAAGGGA 

CCACAAGGGA 

CCACAAGGGA 

CCACAAGGGA 

CCACAAGGGA 

CCACAAGGGA 

CCACAAGGGA 

CCACAAGGGA 
********** 



TTAATTATTT 

TTAATTATTT 

TTAATTATTT 

TTAATTATTT 

TTAATTATTT 

TTAATTATTT 

TTAATTATTT 

TTAATTATTT 

TTAATTATTT 

TTAATTATTT 

TTAATTATTT 
********** 



450 

AGTTAAACGA 

AGTTAAACGA 

AGTTAAACGA 

AGTTAAACGA 

AGTTAAACGA 

AGTTAAACGA 

AGTTAAACGA 

AGTTAAACGA 

AGTTAAACGA 

AGTTAAACGA 

AGTTAAACGA 
********** 



msa45163 ,2{ 
msa45163 . 
msa45163 - 
msa45163 . 
msa45163.2{240 
msa45163 . 
msa45163 . 
msa45163l 
msa45163 
msa45163 ,2{ 
msa45163 .2{ 



240_18RS21} 
2{240_2603} 
2{240_A909 r } 
2{240_H36B} 
__JM9130013} 
2{240_COH1} 
2{240_M732} 
2{240_M781> 
,2{240_090} 
240_CJB110} 
240_JL169NT} 
Consensus 



451 

TATTCTTTAG 
TATTCTTTAG 
TATTCTTTAG 
TATTCTTTAG 
TATTCTTTAG 
TATTCTTTAG 
TATTCTTTAG 
TATTCTTTAG 
TATTCTTTAG 
TATTCTTTAG 
TATTCTTTAG 



ATAAATCAAT 
ATAAATCAAT 
ATAAATCAAT 
ATAAATCAAT 
ATAAATCAAT 
ATAAATCAAT 
ATAAATCAAT 
ATAAATCAAT 
ATAAATCAAT 
ATAAATCAAT 
ATAAATCAAT 



GACTTATTAC 
GACTTATTAC 
GACTTATTAC 
GACTTATTAC 
GACTTATTAC 
GACTTATTAC 
GACTTATTAC 
GACTTATTAC 
GACTTATTAC 
GACTTATTAC 
GACTTATTAC 



ATAGGAGATC 
ATAGGAGATC 
ATAGGAGATC 
ATAGGAGATC 
ATAGGAGATC 
ATAGGAGATC 
ATAGGAGATC 
ATAGGAGATC 
ATAGGAGATC 
ATAGGAGATC 
ATAGGAGATC 



500 

GTCCaCTAGA 
GTCCaCTAGA 
GTCCaCTAGA 
GTCCaCTAGA 
GTCCaCTAGA 
GTCCaCTAGA 
GTCCaCTAGA 
GTCCaCTAGA 
GTCCcCTAGA 
GTCCcCTAGA 
GTCCcCTAGA 



********** ********** ********** ********** ****_***** 



msa45163.2{240_18RS21; 
msa45163 .2 {240^2603} 
msa45 163 .2(24 0_A909> 
msa45163 .2{240_H36B : 
msa45163.2{240_JM9130013 
msa45163 .2{240_COHl' 
msa45163 . 2 { 240_M732 
nisa45163 .2{240_M781 : 
msa45163 .2{240_090 1 
msa45163.2{240_CJB110 
msa45163 . 2 { 240JL169NT 

Consensus 



501 

TTTGGAGGTT 

TTTGGAGGTT 

TTTGGAGGTT 

TTTGGAGGTT 

TTTGGAGGTT 

TTTGGAGGTT 

TTTGGAGGTT 

TTTGGAGGTT 

TTTGGAGGTT 

TTTGGAGGTT 

TTTGGAGGTT 
********** 



GCTCAAAATG 

GCTCAAAATG 

GCTCAAAATG 

GCTCAAAATG 

GCTCAAAATG 

GCTCAAAATG 

GCTCAAAATG 

GCTCAAAATG 

GCTCAAAATG 

GCTCAAAATG 

GCTCAAAATG 
********** 



CTGGTATAAA 

CTGGTATAAA 

CTGGTATAAA 

CTGGTATAAA 

CTGGTATAAA 

CTGGTATAAA 

CTGGTATAAA 

CTGGTATAAA 

CTGGTATAAA 

CTGGTATAAA 

CTGGTATAAA 
********** 



ATCCATAAAC 

ATCCATAAAC 

ATCCATAAAC 

ATCCATAAAC 

ATCCATAAAC 

ATCCATAAAC 

ATCCATAAAC 

ATCCATAAAC 

ATCCATAAAC 

ATCCATAAAC 

ATCCATAAAC 
********** 



550 

TTAAGGTTAG 

TTAAGGTTAG 

TTAAGGTTAG 

TTAAGGTTAG 

TTAAGGTTAG 

TTAAGGTTAG 

TTAAGGTTAG 

TTAAGGTTAG 

TTAAGGTTAG 

TTAAGGTTAG 

TTAAGGTTAG 
********** 



msa45163.2{ 

msa45163 . 

msa45163 . 

msa45163 . 
msa45163 . 2 {240 

msa45163 . 

msa45163 . 

msa45163 . 
msa45163 
msa45163.2f 
msa45163.2{ 



240_18RS21) 

2f 240_2603" 

2{240__A909 

2{240_H36B 

_JM9130013 

2 1 240_COH1 

2(240_M732 

2{240_M781 

.2{240_090 

240_CJB110 

240 1169NT 



551 

AGAATTCCAA 
AGAATTCCAA 
AGAATTCCAA 
AGAATTCCAA 
AGAATTCCAA 
AGAATTCCAA 
AGAATTCCAA 
AGAATTCCAA 
AGAATTCCAA 
AGAATTCCAA 
AGAATTCCAA 



AGAAAACTAT 
AGAAAACTAT 
AGAAAACTAT 
AGAAAACTAT 
AGAAAACTAT 
AGAAAACTAT 
AGAAAACTAT 
AGAAAACTAT 
AGAAAACTAT 
AGAAAACTAT 
AGAAAACTAT 



AATATTTCAA 
AATATTTCAA 
AATATTTCAA 
AATATTTCAA 
AATATTTCAA 
AATATTTCAA 
AATATTTCAA 
AATATTTCAA 
AATATTTCAA 
AATATTTCAA 
AATATTTCAA 



GTCTCAAaGA 
GTCTCAAaGA 
GTCTCAAaGA 
GTCTCAAaGA 
GTCTCAAaGA 
GTCTCAAaGA 
GTCTCAAaGA 
GTCTCAAaGA 
GTCTCAAgGA 
GTCTCAAgGA 
GTCTCAAgGA 



600 

TATAATATCA 
TATAATATCA 
TATAATATCA 
TATAATATCA 
TATAATATCA 
TATAATATCA 
TATAATATCA 
TATAATATCA 
TATAATATCA 
TATAATATCA 
TATAATATCA 
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Table 56: Comparative Sequences relating to SAG0806 



Consensus ********** ********** ********** *******_** ********** 



tnsa45163 . 2{240_18RS2l} 
msa45163 .2f240_2603> 
msa45163 .2(240_A909) 
msa45163.2{240__H36B} 
msa45163.2{240_JM9130013} 
msa45163 .2{240_COHl} 
ms'a4 5163.2 { 24 0_M73 2 } 
msa45163.2{240_M78l} 
msa45163 .2{240_090" 
msa45163.2{240_CJB110 
msa45163 -2{240_1169NT' 

Consensus 



601 621 

CTTGATTTCA CTCGTttgga t 

CTTGATTTCA CTCGTttgga t 

CTTGATTTCA CTCGT ~ 

CTTGATTTCA CTCGTttgga t 

CTTGATTTCA CTCGT- - 

CTTGATTTCA CTCGTttgga t 

CTTGATTTCA CTCGTttgga t 

CTTGATTTCA CTCGT 

CTTGATTTCA CTCGT 

CTTGATTTCA CTCGT t ~ 

CTTGATTTCA CTCGTttgga t 
********** *****__ 



SEQ ID NO. 5612 
STRAIN 2603 frame: 1 

KKLTPIWDLDGTLIDSYVPIMEALEETYRHFGLIFDKELIHEYILQESVGKLIjVNLSEEE 
QI PHEKLKAYFTKEQESRDSKI HLMPYAKE I LEWTKEQD I PNFMYTHKGASTHSVLETLQ 
I SHYFDE I LTGVSGFERKPHPQG I NYL VKRYS LD KS MTYY I GDRPLDLEVAQNAG IKS IN 
LRLENSKENYNI SSLKD I I SLDFTRLD 

SEQ ID NO. 5613 

STRAIN A909 frame: 1 

KKLTFIWDLDGTLIDSYVPIMEALEETYRHFGIilFDKELIHEYILQESVGKLLVNLSEEE 
QI PHEKLKAYFTKEQESRDSKI HLMPYAKE I LEWTKEQD I PNFMYTHKGASTHSVLETLQ 
I SHYFDE I LTGVSGFERKPHPQG I NYLVKRYSLDKSMTYY IGDRPLDLEVAQNAG I KS I N 
LRLENSKENYNI SSLKD I I SLDFTR 

SEQ ID NO. 5614 

STRAIN H36B frame: 1 

KKLTFIWDLDGTLIDSYVPIMEALEETYRHFGLIFDKELIHEYILQESVGKLLVNLSEEE 
QI PHEKLKAYFTKEQESRDSKI HLMPYAKE I LEWTKEQD I PNFMYTHKGASTHSVLETLQ 
I SHYFDE I LTGVSGFERKPHPQG I NYLVKRYSLDKSMTYY I GDRPLDLE VAQNAG IKS IN 
LRLENSKENYNI SSLKD 1 1 SLDFTRLD 

SEQ ID NO. 5615 

STRAIN 18RS21 frame: 1 

KKLTF I WDLDGTL I D S YVP I MEALEET YRHFGL I FDKEL I HEY I LQESVGKLLVNLSEEE 
QI PHEKLKAYFTKEQESRDSKI HLMPYAKE ILEWTKEQD I PNFMYTHKGASTHSVLETLQ 
I SHYFDE I LTGVSGFERKPHPQG I NYL VKRYS LDKSMTYY I GDRPLDLE VAQNAG I KS I N 
LRLENSKENYNI SSLKD I I SLDFTRLD 

« 

SEQ ID NO. 5616 

STRAIN M732 frame: 1 

KKLTFI WDLDGTLIDS YVP IMEALEETYRHFGLI FDKEL I HEYI LQESVGQLLVNLSEEE 
Q I PHEKLKAYFTKEQESRDSKI HLMPYAKE I LEWTKEQD I PNFMYTHKGASTHSVLETLQ 
I SHYFDE I LTGVSGFERKPHPQG I NYLVKRYSLDKSMTYY IGDRPLDLEVAQNAG IKS IN 
LRLENSKENYNI SSLKD 1 1 SLDFTRLD 

SEQ ID NO. 5617 

STRAIN COH1 frame: 1 

KKLTF I WDLDGTL IDS YVP IMEALEETYRHFGLI FDKEL I HEY I LQESVGQLLVNLSEEE 
QI PHEKLKAYFTKEQESRDSKI HLMPYAKE I LEWTKEQD I PNFMYTHKGASTHSVLETLQ 
I SHYFDE I LTGVSGFERKPHPQG I NYL VKRYS LDKSMTYY I GDRPLDLE VAQNAG IKSIN 
LRLENSKENYNI SSLKD I I SLDFTRLD 

SEQ ID NO. 5618 

STRAIN CJB110 frame: 1 

KKLTF I WDLDGTLIDSYVPIMEALEETYRHFGLI FDKELI HEYI LQESVGQLLVNLSEEE 
QI PHEKLKAYFTKEQESRDSKI HLMPYAKE I LEWTKEQD I PNFMYTHKGASTHSVLETLQ 
I SHYFDE I LTGVSGFERKPHPQG I NYLVKRYS LDKSMTYY I GDRPLDLEVAQNAG I KS I N 
LRLENSKENYNI SSLKD 1 1 SLDFTR 

SEQ ID NO. 5619 

STRAIN 1169NT frame: 1 

KKLTF I WDLDGTL I DS YVP 1 1 EALEET YRHFGL I FDKEL I HEYI LQE S VGKLLVNLSEEE 
QIPHEKLKAYFTKEQESRDSKIHLMPYAKE ILEWTKEQDI PNFMYTHKGASTHSVLETLQ 
I SHYFDE I LTGVSGFERKPHPQ/3 1 NYLVKRYSLDKSMTYY IGDRPLDLEVAQNAG I KS IN 
LRLENSKENYNI SSLKD I I SLDFTRLD 

SEQ ID NO. 5620 

STRAIN JM9130013 frame: 1 

KKLTFI WDLDGTLIDSYVPIMEALEETYRHFGLI FDKEL I HEYI LQESVGKLLVNLSEEE 
QI PHEKLKAYFTKEQESRDSKI HLMPYAKE ILEWTKEQDI PNFMYTHKGASTHSVLETLQ 
I SHYFDE I LTGVSGFERKPHPQG I NYLVKRYS LDKSMTYY I GDRPLDLEVAQNAG I KS IN 
LRLENSKENYNI SSLKD I I SLDFTR 



SEQ ID NO. 5621 

STRAIN 090 frame: 1 

KKLTF I WDLDGTLI DSYVP IMEALEETYRHFGLI FDKEL I HE YI LQESVGQLLVNLSEEE 
QI PHEKLKAYFTKEQESRDSKI HLMPYAKE I LEWTKEQD I PNFMYTHKGASTHSVLETLQ 
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I SHYPDE ILTGVSGFERKPHPQG I NYLVKRYSLDKSMTYY IGDRPLDLEVAQNAG IKS IN 
LRLENSKENYNISSLKDI I SLDFTR 



SEQ ID NO. 5622 
STRAIN M781 frame: 1 

KKLTFI WDLDGTLI DS YVP I MEALEETYRHFGLI FDKEL IHEYI LQES VGQLLVNLSEEE 
QI PHEKLKAYFTKEQESRDXKI HLMPYAKE ILEWTKEQD I PNFMYTHKGASTHS VLETLQ 
I SHYFDE I LTGVSGFERKPHPQG I NYLVKRYSLDKSMTYY I GDRPLDLE VAQNAG IKS IN 
LRLENSKENYNI SSLKD 1 1 SLDFTR 

PRETTY of: /biotmp/msa45645 . 2 { * } January 21, 2003 06:57 . 



msa45645.2{ 

msa45645 . 
msa45645.2{240 

msa45645 . 

msa45645 . 
msa45645 
msa45645.2{ 

msa45645 . 

msa45645 . 

msa4S645 . 
msa45645 ,2{ 



240_18RS2l} 
2{240_A909} 
_JM9130013} 
2{240_2603} 
2{240_H3 6B} 
.2{240_090} 
240_CJB110} 
2{240_M781} 
2{240_COH1} 
2{240_M732} 
240__1169NT} 
Consensus 



KKLTFIWDLD 
KKLTF I WDLD 
KKLTFIWDLD 
KKLTFIWDLD 
KKLTFIWDLD 
KKLTFIWDLD 
KKLTFIWDLD 
KKLTFIWDLD 
KKLTFIWDLD 
KKLTFIWDLD 
KKLTFIWDLD 



GTLIDSYVPI 
GTLIDSYVPI 
GTLIDSYVPI 
GTLIDSYVPI 
GTLIDSYVPI 
GTLIDSYVPI 
GTLIDSYVPI 
GTLIDSYVPI 
GTLIDSYVPI 
GTLIDSYVPI 
GTLIDSYVPI 



********** ********** 



mEALEETYRH 

mEALEETYRH 

mEALEETYRH 

mEALEETYRH 

mEALEETYRH 

mEALEETYRH 

mEALEETYRH 

mEALEETYRH 

mEALEETYRH 

mEALEETYRH 

i EALEETYRH 
_ ********* 



FGLIFDKELI 

FGLIFDKELI 

FGLIFDKELI 

FGLIFDKELI 

FGLIFDKELI 

FGLIFDKELI 

FGLIFDKELI 

FGLIFDKELI 

FGLIFDKELI 

FGLIFDKELI 

FGLIFDKELI 
********** 



50 

HEYILQESVG 

HEYILQESVG 

HEYILQESVG 

HEYILQESVG 

HEYILQESVG 

HEYILQESVG 

HEYILQESVG 

HEYILQESVG 

HEYILQESVG 

HEYILQESVG 

HEYILQESVG 
********** 



msa45645 . 2 {240_18RS21 : 
msa45645 . 2 {240_A909 ; 
msa45645.2{240_JM9130013 
msa45645 . 2 { 240_2603 * 
msa45645.2{240_H36B. 
msa45645 . 2 {240_090 
msa45645 . 2 { 240_CJB110 " 
msa45645.2l240_M781 
msa4 5645.2(24 0_COH1 
msa45645 . 2 {240_M732 
msa45645 . 2 { 240_1169NT 

Consensus 



msa45645 . 2 {240_18RS21 
msa4 5 645 . 2 { 24 0_A9 0 9 
msa45645 . 2 { 240_JM9130013 
msa45645 . 2 {240_2603 ■ 
msa45645.2{240_H36B] 
msa45645 . 2 {240_090 
msa45645 . 2 { 240_CJB110 
msa45645.2{240_M781 
ms a4 5 64 5 . 2 { 2 4 0_COH1 
msa45645 . 2 { 240_M732 \ 
msa45645.2{240_1169NT] 

Consensus 



51 1 

kLLVNLSEEE 
kLLVNLSEEE 
kLLVNLSEEE 
kLLVNLSEEE 
kLLVNLSEEE 
qLLVNLSEEE 
qLLVNLSEEE 
qLLVNLSEEE 
qLLVNLSEEE 
qLLVNLSEEE 
kLLVNLSEEE. 



QIPHEKLKAY 
QIPHEKLKAY 
QIPHEKLKAY 
QIPHEKLKAY 
QIPHEKLKAY 
QIPHEKLKAY 
QIPHEKLKAY 
QIPHEKLKAY 
QIPHEKLKAY 
QIPHEKLKAY 
QIPHEKLKAY 



FTKEQESRDs 
FTKEQESRDs 
FTKEQESRDs 
FTKEQESRDs 
FTKEQESRDs 
FTKEQESRDs 
FTKEQESRDs 
FTKEQESRDX 
FTKEQESRDs 
FTKEQESRDs 
FTKEQESRDs 



KIHLMPYAKE 
KIHLMPYAKE 
KIHLMPYAKE 
KIHLMPYAKE 
KIHLMPYAKE 
KIHLMPYAKE 
KIHLMPYAKE 
KIHLMPYAKE 
KIHLMPYAKE 
KIHLMPYAKE 
KIHLMPYAKE 



100 

ILEWTKEQDI 
ILEWTKEQDI 
ILEWTKEQDI 
ILEWTKEQDI 
ILEWTKEQDI 
ILEWTKEQDI 
ILEWTKEQDI 
ILEWTKEQDI 
ILEWTKEQDI 
ILEWTKEQDI 
ILEWTKEQDI 



.********* ********** *********_ ********** ********** 



101 

PNFMYTHKGA 

PNFMYTHKGA 

PNFMYTHKGA 

PNFMYTHKGA 

PNFMYTHKGA 

PNFMYTHKGA 

PNFMYTHKGA 

PNFMYTHKGA 

PNFMYTHKGA 

PNFMYTHKGA 

PNFMYTHKGA 
********** 



STHSVLETLQ 
STHSVLETLQ 
STHSVLETLQ 
STHSVLETLQ 
STHSVLETLQ 
STHSVLETLQ 
STHSVLETLQ 
STHSVLETLQ 
STHSVLETLQ 
STHSVLETLQ 
STHSVLETLQ 



ISHYFDEILT 
ISHYFDEILT 
ISHYFDEILT 
ISHYFDEILT 
ISHYFDEILT 
ISHYFDEILT 
ISHYFDEILT 
ISHYFDEILT 
ISHYFDEILT 
ISHYFDEILT 
ISHYFDEILT 



********** ********** 



GVSGFERKPH 

GVSGFERKPH 

GVSGFERKPH 

GVSGFERKPH 

GVSGFERKPH 

GVSGFERKPH 

GVSGFERKPH 

GVSGFERKPH 

GVSGFERKPH 

GVSGFERKPH 

GVSGFERKPH 
********** 



150 

PQGINYLVKR 

PQGINYLVKR 

PQGINYLVKR 

PQGINYLVKR 

PQGINYLVKR 

PQGINYLVKR 

PQGINYLVKR 

PQGINYLVKR 

PQGINYLVKR 

PQGINYLVKR 

PQGINYLVKR 
********** 



msa45645.2{ 
msa45645 . 
msa45645.2{240 
msa45645 . 
msa45645 . 
msa45645 

msa45645.2{ 
msa45645 . 
msa45645 - 
msa45645 . 

msa45645.2{ 



=0_18RS21> 
240_A909} 



241 

2{: 

_JM9130013 } 
2{240_2603} 
2{240_H36B} 
2{240__090) 
240_CJB110} 
2{240_M78l} 
2{240__COH1} 
2{240_M732} 
240_1169NT) 
Consensus 



151 

YSLDKSMTYY 

YSLDKSMTYY 

YSLDKSMTYY 

YSLDKSMTYY 

YSLDKSMTYY 

YSLDKSMTYY 

YSLDKSMTYY 

YSLDKSMTYY 

YSLDKSMTYY 

YSLDKSMTYY 

YSLDKSMTYY 
********** 



I GDRPLDLE V 
I GDRPLDLEV 
IGDRPLDLEV 
I GDRPLDLEV 
IGDRPLDLEV 
IGDRPLDLEV 
IGDRPLDLEV 
IGDRPLDLEV 
IGDRPLDLEV 
IGDRPLDLEV 
IGDRPLDLEV 
********** 



AQNAGI KS IN 
AQNAGIKSIN 
AQNAGIKSIN 
AQNAGIKSIN 
AQNAGIKSIN 
AQNAGIKSIN 
AQNAGIKSIN 
AQNAGIKSIN 
AQNAGIKSIN 
AQNAGIKSIN 
AQNAGIKSIN 



LRLENSKENY 
LRLENSKENY 
LRLENSKENY 
LRLENSKENY 
LRLENSKENY 
LRLENSKENY 
LRLENSKENY 
LRLENSKENY 
LRLENSKENY 
LRLENSKENY 
LRLENSKENY 



200 

NISSLKDIIS 
NISSLKDIIS 
NISSLKDIIS 
NISSLKDIIS 
NISSLKDIIS 
NISSLKDIIS 
NISSLKDIIS 
NISSLKDIIS 
NISSLKDIIS 
NISSLKDIIS 
NISSLKDIIS 



********** ********** ********** 



msa45645.2{ 

msa45645 . 
msa45645.2{240 

msa45645 . 

msa45645 . 
msa45645 
msa45645.2{ 

msa45645 . 

msa45645 . 

msa45645 . 
msa45645.2{ 



240_18RS21 
2{240_A909 
_JM9130013 
2(240__2603 
2{240_H36B 
.2{240_090 
240_CJB110 
2{240_M781 
2(240_COH1 
2{240_M732; 
240_1169NT/ 
Consensus 



201 

LDFTRld 
LDFTR — 
LDFTR — 
LDFTRld 
LDFTRld 
LDFTR — 
LDFTR — 
LDFTR — 
LDFTRld 
LDFTRld 

LDFTRld 
***★*__ 
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SEQ ID NO: 5701 
STRAIN 2603 

ATG CTTATG ACAAAAAT AATAGGACTGAGAGG AGGGATAG CTTCT 

GG AAAGT CAACGGTAAC AAAAATAAT ACG AGAAT C AGGTTTTAAAGT CAT AG AT G CG GAT 
CAAGTGGTT CAT AAATTG CAAG CT AAGGGTGGGAAACTTTAC CAAG CTTTATTAG AATGG 
TTGGGTC C CG AG AT ACTTG ATG CT GAT GGTGAGTTGG AT AGACCAAAGCTTT CT CAAATG 
ATTTTTG CT AAT C CAGACAATATG AAG ACAT C AG CTAGG CTACAAAATAGTAT CATT CGT 
CAAGAGTTAG CATGT CAG CGCG AC CAATTAAAACAAACAGAAGAGAT ATTTTT CATGGAT 
ATTC CTTT ATTGATTG AAGAAAAGT ATATAAAATGGTTTG AT GAGATTTG GTTGGTATTT 
GTTGATAAAGAAAAACAATTAGAACGATTAATGGCCCGTAAC 

G CAGAATTACGACTTTCACAC CAAATG CCTTTAAGAG AT AAAAAAAGTTT CG CTAGT CTT 

ATTATTGACAATAATC4GTGATTTAATAACTTTAAA^ 

CGTTTA 

SEQ ID NO: 5702 
STRAIN 090 

AAGTCAACGGTAACAAAAATAATACGAGAATCAG 

GTTTTAAAGTCATAG ATG CGGATCAAGTGGTT CAT AAATTGCAAG CTAAG 
GGTGGGAAACTTTACCAAGCTTTATTAGAATGGTTGGGTCCCGAGATACT 
TGATG CTGATGGTG AGTTGG AT AG ACCAAAG CTTT CT CAAATGATTTTTG 
CT AAT CCAGACAATATGAAGACAT CAG CTAGG CTACAAAATAGTATCATT 
CGTCAAGAGTTAGCATGTC!AGCGCGAlCCAATTAAAACAAACAGAAGAGAT 

ATTTTTCGTGGATATTCCTTTATTGATTGAA 
TTGATGAGATTTGGTTGGTATTTCTTGATAAA 

TTAATGG CC CGTAACAACTACAGTCGAGAAGAAG CAGAATTACG ACTTT C 
ACAC CA^TG CCTTTAACAGAT AAAAAAAGTTTCG CTAGT CTTATTATTA 
AT AATAATGGTG ATTTAATAACTTTAAAAG AGCAAAT ATTGGAT G CT CTT 

CAACGTTTA 

SEQ ID NO: 5703 
STRAIN A909 

AAGT CAACGGTAACAAAAATAATACGAGAATCAG 

GTTTTAAAGT CATAGATGCGGAT CAAGTGGTT CATAAATTG CAAG CTAAG 
GGTGGGAAACTTTACCAAGCTTTATTAGAATG GTTGGGTC CCGAGATACT 
TGATG CTGATGGTGAGTTGGAT AGACCAAAG CTTT CT CAAATGATTTTTG 
CTAATCCAGACAATATGAAGACATCAGCTAGG CTACAAAATAGTATCATT 
CGTCAAGAGTTAG CATGTCAG CGCGACCAATT AAA&CAAACAGAAGAGAT 
ATTTTTCATGGATATTCCTTTATTGATTGAAGAAAAGTATATAAAATGGT 
TTa^TGAGATTTGGTTGGTATTTGTTGATAAAGA7\AAACAATTACAACGA 
TTAATGG C CCGTaACAACTACAGTCGAGAAGAAG CAGAATTACGACTTT C 
ACACCAAATGCCITTAACAGATAAAAAAAGTTTCGCTAGTCTTATTATTG 
ACAATAATGGTGATTTAAT AACTTT AAAAGAG CAAATATTGG ATG CT CTT 
CAACGTTTA 

SEQ ID NO: 5704 
STRAIN H36B 

AAGT CAACGGTAACAAAAAT AATACGAGAATCAGG 

TTTT AAAGT CATAGATGCGGAT CAAGTGGTTCAT AAATTG CAAG CTAAGG 
GTGGGAAACTTTAC GAAGCTTT ATTAGAATGGTTGGGTC C CGAGATACTT 
GATG CTGATGGTGAGTTGGATAGAC CAAAG CTTT CTCAAATGAT1TTTG C 
TAATCCAGACAATATGAAGACATCAGCTAGGCTACAAAATAGTATCATTC 
GTCAAGAGT TAG CATGTCAG CG CGACCAATTAAAACAAACAGAAGAGAT A 
TTTTTCATGGATATT CCTTT ATTGrATTGAAGAAAAGTAT AT 
TGATGAGATTTGGTTGGTATTTGTTGATAAAGAAAAACAATTACAA 
TAATGGC CCG t^AACAACTACAGT CGAGAAGAAGCGGAATTACGACTTT CA 
CACCAAATAC CTTT AACAG ATAAAAAAAGTTT CG CTAGT CTTATTATTGA 
TAATAATGGTGATTTAATTU^CTTTAAAAGAGCAAATGTTGGATGCTCTTC 

AACGTTTA 

SEQ ID NO: 5705 
STRAIN 18RS2I 

AAGT CAACGGTAACAAAAAT AATACGAGAAT CAGG 

TTTTAAAGT CATAG ATG CGGAT CAAGTGGTTCATAAATTG CAAG CTAAGG 
GTGGGAAACITTACCAAGCTTTATTAGAATGGTTGGGTCCCGAGATA 
GATG CTGATGGTGAGTTGGATAGAC CAAAG CITTCT CAAATGATTTTTGC 
TAATC CAGACAATATGAAGACAT CAGCTAGGCTACAAAATAGTAT CATTC 
GT CAAGAGTTAG CATGT CAG CG CGACCAATTAAAACAAACAG AAGAGATA 
TTTTT CATGGATATTC CITTATTGATTGAAGAA 
TGATGAGATTTGGTTGGTATTTGTTGATAAAGAAAAACAACT 
TAATGGC CCGTAACAACTACAGT CGAGAAGAAG CAGAATTACGACTTT CA 
CACCAAATGCCTTTAACAGATAAAAAAAGTTTCX3OT 

CAAT AATGGTGATTTAATAACTTT AAAAGAG CAAATATTGGATG CTCTT C 
AACGTTTA 

SEQ ID NO: 5706 
STRAIN M732 

AAGT CAACGGTAACAAAAATAATACGAGAAT CAGGTT 

ttaaagtcatagatgcggatcaagtggttcataaattgcaagctaagggt 
gggaaactttaccaagctttattagaatggttgggtcccgagatact 
tgctgatggtgagttggatagaccaaagctttctcaaatgatttttgct 
atccac^caatatgaagacatcagctaggctacaaaatagtatcattcgt 
caagagttag catgt cag cg cg ac caatt aaaacaaacagaagagatatt 
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TTTCATGGATATTCCTTTATTGATTGAAGAAAAGTATATAT^AATGGTTTG 
ATGAGATTTGGTTGGTATTTGTTGATAAAGAAAAACAATTACAACGATTA 
ATGGCCCGTAACAACTACAGTCGAGAAGAAGCAGAATTACGACTTTCACA 
CCAAATGCCTTTAACAGATAAAAAAAGTTTCGCTAGTCTTATTATTGACA 
ATAATGGTG ATTT AATAACTTT AAAAG AG CAAAT ATTGGATG CT CTT CAA 
CGTTTA 

SEQ ID NO: 5707 
STRAIN COH1 

AAGTCAACGGTAACAAAAATAATACGAGAATCAGGT 

TTTAAAGTCATAGATGCGGATCAAGTGGTTCATAAATTGCAAGCTAAGGG 
TGGGAAACTTTACCAAGCTTTATTAGAATGGTTGGGTCCCGAGATACTTG 
ATGCTGATGGTGAGTTGGATAGACCAAAGCTTTCTCAAATGATTTTTGCT 
AAT C CAG ACAAT ATG AAGACAT CAG CTAGG CTACAAAATAGT ATCATT CG 
TCAAGAGTTAGCATGTCAGCGCGACCAATTAAAACAAACAGAAGAGATAT 
TTTT CATGGATATT C CTTT ATT GATTGAAGAAAAGTATATAAAATGGTTT 
GATGAGATTTGGTTGGTATTTGTTGATAAAGAAAAACAATTACAACGATT 
AATGG CC CGTaACAACTACAGT CGAGAAG AAG CAGAA.TTACG ACTTT CAC 
ACCAAATGCCTTTAACAGAT AAAAAAAGTTT CG CTAGTCTTATTATTGAC 
AAT AATGGTGATTT AAT AACTTTAAAAGAG CAAATATTGGATGCT CTT CA 
ACGTTTA 

SEQ ID NO: 5708 
STRAIN M781 

AAGTCAACGGTAACAAAAATAATACGAGAATCAGG 

'mTAMGTCATAGATGCGGATCMGTGGTTCATAMTTGCAAGCTAAGG 
GTGGGAAACTTTACCAAGCTTTATTAGAATGGTTGGGTCCaSAGATACTT 
GATG CTGATGGTGAGTTGGATAGAC CAAAG CTCT CTCAAATGATTTTTG C 
TAAT CCAGACAATATGAAGACATCAG CTAGGCTACAAAATAGTATCATTC 
GT CAAGAGTTAG CATGT CAGCG CG ACCAATTAAAACAAACAGAAG AGATA 
TTTTT CATGGATATT CCTTTATTGATTGAAGAAAAGTATATAAAATGGTT 
TGATGAC4ATTTGGTTGGTATTTGTTGATAAAGAAAAACAATTACAACGAT 
TAATGGCCCGTAACAACTACAGTCGAGAAGAAGCAGAATTACGACTTTCA 
CAC CAAATG C CTTT AACAGATAAAAAAAGTTT CG CTAGTCTTATTATTGA 
CAATAATGGTGATTTAATAACTTTAAAAG AG CAAATATTGGATG CT CTT C 
AACGTTTA 

SEQ ID NO: 5709 
STRAIN OB 110 

AAGTCAACGGTAACAAAAATAATACGAGAA 

T CAGGTTTTAAAGTCAT AGATGCGGAT CAAGTGGTTCATAA^TTGCAAG C 
TAAGGGTGGG AAACTTTAC CAAG CTTT ATTAGAATGGTTGGGTC C CGAGA 
TACTTGATG CTGAT GGTGAGTTGG ATAGACCAAAG CTTT CT CAAATG ATT 
TTTG CTAAT CCAGACAATATGAAGACATCAGCT AGGCTACAAAAT AGTAT 
CATT CGT CAAGAGTTAG CATGT CAG CG CG AC CAATTAAAAQ\AACAG AAG 
AGATATTTTTCGTGGATATTCCTTTATTGATTGAAGAAAAGTATATAAAA 
TGGTTTGATGAGATTTGGTTGGTATTTGTTGATAAAGAAAAAC^ 
ACGACTAATGGCCCGTaACAACTACAGTCGAGAAGAAGCAGAATTACGAC 
TTT CACACCAAATG CCTTTAACAGATAAAAAAAGTTTCGCTAGT CTTATT 
ACTAATAATAATGGTGATTTAATAACTCTAAAA.GAGCAAATATTGGATGC 

TCTTCAACX3TTTA 

SEQ ID NO: 5710 
STRAIN 1169NT 

AAGTCAACGGTAACAAAAATAATACGAGAATCAGG 

TTTTAAAGT CATAGATG CGGAT CAAGTGGTTCATAAATTG CAAGCTAAGG 
GTGGGAAACTCTACCAAGCTCTACTAGAATGGCTGGGTCCCGAGATACTT 
GATGCTGATGGTGAGCTGGATAGACCAAAGCTCTCTCAAATGATTTTTGC 
TAAT C CAGACAATATGAAG ACATCAGCTAGG CTACAAAAT AGTATCATTC 
GT CAAGAGTTAG CATGT CAG CG CGACCAATTAAAACAAACAGAAG AGATA 
TTTTT CATGGATACTCCCTT ATTG ATTGAAGAAAAGTATATAAAATG GTT 
TGATGAGATTTGGTTGGTATTTGTTGATAAAGAAAAACAATTACAACGAT 
TAATGGCCCGTAACAACTACAGTCGAGAAGAAGCAGAATTACGACTTTCA 
CACCAAATAC CTTTAACAGATAAAAAAAGTTT CG CTAGT CTTATTATTG A 
TAATAATGGTGATTTAATAACTTT AAAAG AG CAAATGTTGGATGCT CTT C 
AACGTTTA 

SEQ ID NO: 5711 
STRAIN JM9130013 

AAGT CAACGGTAACAAAAAT AATACGAGAATCAGGT 

TCTAAAGTCATAGATGCGGATCAAGTGGCTCATAAATTG CAAG CTAAGGG 
TGGGAAACTTTACCAAG CTTTATTAGAATGGTTGGGT C CCGAGATACTTG 
ATGCTGATGGTGAGTTGGATAGACCAAAGCTCTCTCAAATGAT1TCTGCT 
AAT CCAG ACAAT ATGAAGACAT CAG CTAGG CTACAAAATAGT AT CATTCG 
TCAAG AGTTAGCATGT CAG CG CGAC CAATT AAAACAAACAGAAGAGATAT 
TTTT CATGGATATT C CTTTACTGATTG AAGAAAAGTATAT AAA^TGGTTT 
GATGAGATTTGGCTGGTATCTGCTGATAAAGAAAAACAACTAa^CGATT 
AATGG C C CGTAACAACTACAGT CGAGAAG AAG CGGAACTACGACTTT CAC 
AC CAAATAC CTTTAACAGATAAAAAAAGTTTCG CT AGTCTTATTATTGAT 
AAT AATGGTGATTTAAT AACTTTAAAAGAG CAAATGTTGGATG CT CTT CA 
ACGTTTA 
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PRETTY of: /biotmp/msa221059 . 2 { * } February 10, 2003 07:07 



L 50 

msa221059 .2 {245_H36B} ~~ - ^ 

msa221059.2{245_JM9130013} AA 

msa221059 -2{245_1169NT} ^ 

msa2 21059 .2(24 5_090} ~ ^ 

msa221059 .2{245 CJB110) ^ 

AA 

atgcttatga caaaaataat aggactgaca ggagggatag cttctggaAA 

- AA 

AA 

~~ AA 

— ~AA 

********** ********** ********** ********** ********** 



{24 5 J 

msa221059 . 2 { 245_18RS21 
msa221059.2{245_2603 
msa221059.2{245_A909 
msa221059 .2{245_COHl} 
msa221059.2(245_M732j 
rasa221059.2{245_M78l} 

Consensus 



msa221059 . 
msa221059.2{245 
msa221059 .2{ 
msa221059 
msa221059 .2{ 
msa221059 .2{ 

msa221059 . 

msa221059 . 

msa221059 . 

msa221059 . 

msa221059 . 



2{245__H36B> 
_JM9130013 
245_1169NT 
.2{245_090 
245_CJB110' 
245_18RS2l] 
2{24S_2603 
2{245_A909 
2{245_COHl 
2f245JM732} 
2{245_M78l} 
Consensus 



msa221059 - 
msa221059.2{245 
msa221059.2{ 
msa221059 
msa221059 
msa221059 

msa221059. 

msa221059. 

msa221059 . 

msa221059 . 

msa221059. 



.2f 
,2{ 



2{245_H36B) 
i_JM9130013} 
245_1169NT} 
2{245_090} 
245_CJB110} 
245_18RS2l} 
2{245_2603} 
2{245_A909} 
245_COHl} 
245_M732} 
245_M78l} 
Consensus 



msa221059.2{245_H36B} 
msa221059.2{245_JM9130013; 
msa221059 . 2 { 245_1169NT 
msa221059 . 2 {245_090 ; 
msa221059.2f 245_CJB110} 
msa221059.2{245_18RS2l] 
msa221059.2{245__2603} 
msa221059.2{245_A909" 
msa2 2 10 5 9 . 2 { 24 5_COHl 
msa221059 .2 ( 245_M732 , 
msa221059.2{245_M78l} 

Consensus 



msa221059 . 
msa221059.2{245 
msa221059.2{ 
msa221059 
rasa221059.2{ 
msa221059.2{ 

msa221059 . 

msa221059 . 

msa221059 . 

rnsa221059 - 

msa221059 . 



2{245_H36B} 
_JM9130013} 
245_1169NT} 
.2 { 245^090} 
245_CJB110> 
245_18RS2l} 
2 f 245_2603} 
2{245_A909} 
2{245_COHl} 
2{245_M732} 
2{245_M78l} 
Consensus 



51 

GTCAACGGTA 

GTCAACGGTA 

GTCAACGGTA 

GTCAACGGTA 

GTCAACGGTA 

GTCAACGGTA 

GTCAACGGTA 

GTCAACGGTA 

GTCAACGGTA 

GTCAACGGTA 

GTCAACGGTA 
********** 

101 

CGGATCAAGT 

CGGATCAAGT 

CGGATCAAGT 

CGGATCAAGT 

CGGATCAAGT 

CGGATCAAGT 

CGGATCAAGT 

CGGATCAAGT 

CGGATCAAGT 

CGGATCAAGT 

CGGATCAAGT 
********** 

151 

GCTTTATTAG 

GCTTTATTAG 

GCTTTATTAG 

GCTTTATTAG 

GCTTTATTAG 

GCTTTATTAG 

GCTTTATTAG 

GCTTTATTAG 

GCTTTATTAG 

GCTTTATTAG 

GCTTTATTAG 
********** 

201 

GGATAGACCA 

GGATAGACCA 

GGATAGACCA 

GGATAGACCA 

GGATAGACCA 

GGATAGACCA 

GGATAGACCA 

GGATAGACCA 

GGATAGACCA 

GGATAGACCA 

GGATAGACCA 
********** 



ACAAAAATAA 

ACAAAAATAA 

ACAAAAATAA 

ACAAAAATAA 

ACAAAAATAA 

ACAAAAATAA 

ACAAAAATAA 

ACAAAAATAA 

ACAAAAATAA 

ACAAAAATAA 

ACAAAAATAA 
********** 



TACGAGAATC 

TACGAGAATC 

TACGAGAATC 

TACGAGAATC 

TACGAGAATC 

TACGAGAATC 

TACGAGAATC 

TACGAGAATC 

TACGAGAATC 

TACGAGAATC 

TACGAGAATC 
********** 



AGGTTTTAAA 

AGGTTTTAAA 

AGGTTTTAAA 

AGGTTTTAAA 

AGGTTTTAAA 

AGGTTTTAAA 

AGGTTTTAAA 

AGGTTTTAAA 

AGGTTTTAAA 

AGGTTTTAAA 

AGGTTTTAAA 
********** 



GGTTCATAAA 
GGTTCATAAA 
GGTTCATAAA 
GGTTCATAAA 
GGTTCATAAA 
GGTTCATAAA 
GGTTCATAAA 
GGTTCATAAA 
GGTTCATAAA 
GGTTCATAAA 

GGTTCATAAA 
********** 



AATGGTTGGG 

AATGGTTGGG 

AATGGTTGGG 

AATGGTTGGG 

AATGGTTGGG 

AATGGTTGGG 

AATGGTTGGG 

AATGGTTGGG 

AATGGTTGGG 

AATGGTTGGG 

AATGGTTGGG 
********** 



100 

GTCATAGATG 

GTCATAGATG 

GTCATAGATG 

GTCATAGATG 

GTCATAGATG 

GTCATAGATG 

GTCATAGATG 

GTCATAGATG 

GTCATAGATG 

GTCATAGATG 

GTCATAGATG 
********** 

150 

ACTTTACCAA 
ACTTTACCAA 
ACTTTACCAA 
ACTTTACCAA 
ACTTTACCAA 
ACTTTACCAA 
ACTTTACCAA 
ACTTTACCAA 
ACTTTACCAA 
ACTTTACCAA 

ACTTTACCAA 
********** 

200 

ATGGTGAGTT 
ATGGTGAGTT 
ATGGTGAGTT 
ATGGTGAGTT 
ATGGTGAGTT 
ATGGTGAGTT 
ATGGTGAGTT 
ATGGTGAGTT 
ATGGTGAGTT 
ATGGTGAGTT 
ATGGTGAGTT 
********** ********** ********** 



TTGCAAGCTA 

TTGCAAGCTA 

TTGCAAGCTA 

TTGCAAGCTA 

TTGCAAGCTA 

TTGCAAGCTA 

TTGCAAGCTA 

TTGCAAGCTA 

TTGCAAGCTA 

TTGCAAGCTA 

TTGCAAGCTA 
********** 



AGGGTGGGAA 
AGGGTGGGAA 
AGGGTGGGAA 
AGGGTGGGAA 
AGGGTGGGAA 
AGGGTGGGAA 
AGGGTGGGAA 
AGGGTGGGAA 
AGGGTGGGAA 
AGGGTGGGAA 
AGGGTGGGAA 
********** 



TCCCGAGATA 
TCCCGAGATA 
TCCCGAGATA 
TCCCGAGATA 
TCCCGAGATA 
TCCCGAGATA 
TCCCGAGATA 
TCCCGAGATA 
TCCCGAGATA 
TCCCGAGATA 
TCCCGAGATA 



CTTGATGCTG 
CTTGATGCTG 
CTTGATGCTG 
CTTGATGCTG 
CTTGATGCTG 
CTTGATGCTG 
CTTGATGCTG 
CTTGATGCTG 
CTTGATGCTG 
CTTGATGCTG 
CTTGATGCTG 



AAGCTTTCTC 

AAGCTTTCTC 

AAGCTTTCTC 

AAGCTTTCTC 

AAGCTTTCTC 

AAGCTTTCTC 

AAGCTTTCTC 

AAGCTTTCTC 

AAGCTTTCTC 

AAGCTTTCTC 

AAGCTTTCTC 
********** 



AAATGATTTT 

AAATGATTTT 

AAATGATTTT 

AAATGATTTT 

AAATGATTTT 

AAATGATTTT 

AAATGATTTT 

AAATGATTTT 

AAATGATTTT 

AAATGATTTT 

AAATGATTTT 
********** 



TGCTAATCCA 
TGCTAATCCA 
TGCTAATCCA 
TGCTAATCCA 
TGCTAATCCA 
TGCTAATCCA 
TGCTAATCCA 
TGCTAATCCA 
TGCTAATCCA 
TGCTAATCCA 
TGCTAATCCA 



250 

GACAATATGA 
GACAATATGA 
GACAATATGA 
GACAATATGA 
GACAATATGA 
GACAATATGA 
GACAATATGA 
GACAATATGA 
GACAATATGA 
GACAATATGA 
GACAATATGA 



********** ********** 



msa221059 . 
msa22l059.2{245 
m sa221059 .2{ 
msa221059 
m sa221059.2f 
ms a221059.2{ 
msa221059 . 
tnsa221059 . 
msa221059. 
. msa22i059 . 



2{24S_H36B} 
JM9130013* 
245 < _1169NT; 
,2{245_090 
245_CJB110) 
245_18RS21) 
2f 24S__2603} 
2(245_A909} 
2{245_COHl} 
2{245_M732} 



251 

AGACATCAGC. 
AGACATCAGC 
AGACATCAGC 
AGACATCAGC 
AGACATCAGC 
AGACATCAGC 
AGACATCAGC 
AGACATCAGC 
AGACATCAGC 
AGACATCAGC 



TAGGCTACAA 
TAGGCTACAA 
TAGGCTACAA 
TAGGCTACAA 
TAGGCTACAA 
TAGGCTACAA 
TAGGCTACAA 
TAGGCTACAA 
TAGGCTACAA 
TAGGCTACAA 



AATAGTATCA 
AATAGTATCA 
AATAGTATCA 
AATAGTATCA 
AATAGTATCA 
AATAGTATCA 
AATAGTATCA 
AATAGTATCA 
AATAGTATCA 
AATAGTATCA 



TTCGTCAAGA 
TTCGTCAAGA 
TTCGTCAAGA 
TTCGTCAAGA 
TTCGTCAAGA 
TTCGTCAAGA 
TTCGTCAAGA 
TTCGTCAAGA 
TTCGTCAAGA 
TTCGTCAAGA 



300 

GTTAGCATGT 
GTT AG CATGT 
GTTAGCATGT 
GTTAGCATGT 
GTTAGCATGT 
GTTAGCATGT 
GTTAGCATGT 
GTTAGCATGT 
GTTAGCATGT 
GTTAGCATGT 
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msa22 1059 . 2 {245_M78l} 

Consensus 



AGACATCAGC TAGGCTACAA AATAGTATCA TTCGTCAAGA GTTAGCATGT 
********** ********** ********** ********** ********** 



msa221059 . 
msa221059.2{245 
msa221059 .2{ 
msa221059 
msa221059.2{ 
msa221059 .2{ 

msa221059 . 

msa2210S9 . 

msa2210S9 . 

msa221059 . 

rasa221059 . 



2{24 5_H36B} 
_JM9130013 } 
245_1169NT} 
.2{245_090} 
245_CJB110} 
245_18RS21} 
2{245J2603} 
2{24S_A90 9} 
2f 245_COHl} 
2{245_M732} 
2{245_M78l} 
Consensus 



msa221059 . 
msa221059.2{245 
msa221059 .2{ 
msa2210S9 
msa221059 .2{ 
rnsa2210S9 .2{ 

rasa221059 . 

msa221059 . 

msa221059. 

msa221059 . 

msa221059 . 



2{245_H36B; 
_JM9130013 
245_1169NT 
.2{245_090; 
245_CJB110/ 
245__18RS21 
2{245_2603' 
2{245_A909 
2{245_COHl) 
2{24S__M732} 
2{245_M78l} 
Consensus 



msa221059 . 
msa221059.2{245 
msa221059 .2{ 
msa221059 
msa221059.2{ 
msa221059-2{ 

msa221059. 

msa221059 . 

msa221059 . 

msa221059. 

msa221059 . 



2{245_H36B} 
_JM9130013} 
245_1169NT 
.2{245_090 
245_CJB110 
245_18RS21/ 
2{245_2603; 
2{245_A909 
2{245__COHi; 
2{245_M732 
2{245_M781 
Consensus 



msa221059 . 
msa221059.2{245 
msa221059.2{ 
msa221059 
msa221059 
msa221059 

msa221059 . 

msa221059 . 

msa2210S9. 

msa221059 . 

msa221059. 



2{24S_H36B 
_JM9130013 
245_1169NT 
.2{245_090} 
.2(245_CJB110> 
.2{245_18RS21} 
2 {245^2603} 
2{245_A909} 
2{245__COHl} 
2{245_M732} 
2{24S_M781} 
Consensus 



301 

CAGCGCGACC 
CAGCGCGACC 
CAGCGCGACC 
CAGCGCGACC 
CAGCGCGACC 
CAGCGCGACC 
CAGCGCGACC 
CAGCGCGACC 
CAGCGCGACC 
CAGCGCGACC 
CAGCGCGACC 



msa221059 . 
msa221059.2{245 
msa221059.2{ 
msa221059 
msa221059.2{ 
msa221059.2{ 
msa221059. 
msa221059 . 
msa221059. 



2{245_H36B} 
_JM9130013} 
245_1169NT; 
.2{245_090 
245_CJB110 
245_18RS21 
2{245_2603 
2{245_A909 
2{245_COHl 



AATTAAAACA 
AATTAAAACA 
AATTAAAACA 
AATTAAAACA 
AATTAAAACA 
AATTAAAACA 
AATTAAAACA 
AATTAAAACA 
AATTAAAACA 
AATTAAAACA 
AATTAAAACA 



AACAGAAGAG 
AACAGAAGAG 
AACAGAAGAG 
AACAGAAGAG 
AACAGAAGAG 
AACAGAAGAG 
AACAGAAGAG 
AACAGAAGAG 
AACAGAAGAG 
AACAGAAGAG 
AACAGAAGAG 



ATATTTTTCa 
ATATTTTTCa 
ATATTTTTCa 
ATATTTTTCg 
ATATTTTTCg 
ATATTTTTCa 
ATATTTTTCa 
ATATTTTTCa 
ATATTTTTCa 
ATATTTTTCa 
ATATTTTTCa 



350 

TGGATATTCC 
TGGATATTCC 
TGGATATTCC 
TGGATATTCC 
TGGATATTCC 
TGGATATTCC 
TGGATATTCC 
TGGATATTCC 
TGGATATTCC 
TGGATATTCC 
TGGATATTCC 



********** ********** 



********** *********- ********** 



maa22 1059 .2(24 5_H36B} 
msa221059 . 2 { 245_JM9130 013 } 
msa221059.2{245_1169NT^ 
msa221059.2{245_090 
msa221059.2{245_CJB110 
msa221059.2{245_18RS2lJ 
msa221059 . 2 { 245_2603 } 
msa221059 . 2 { 245_A909} 
msa221059 . 2{245_COHl} 
msa221059.2(245_M732) 
tnsa221059.2{.245_M78l} 

Consensus 



GAAGAAAAGT 
GAAGAAAAGT 
GAAGAAAAGT 
GAAGAAAAGT 
GAAGAAAAGT 
GAAGAAAAGT 
GAAGAAAAGT 
GAAGAAAAGT 
GAAGAAAAGT 
GAAGAAAAGT 
GAAGAAAAGT 
********** 



351 

TTTATTGATT 
TTTATTGATT 
TTTATTGATT 
TTTATTGATT 
TTTATTGATT 
TTTATTGATT 
TTTATTGATT 
TTTATTGATT 
TTTATTGATT 
TTTATTGATT 
TTTATTGATT 
********** 

401 

TATTTGTTGA 

TATTTGTTGA 

TATTTGTTGA 

TATTTGTTGA 

TATTTGTTGA 

TATTTGTTGA 

TATTTGTTGA 

TATTTGTTGA 

TATTTGTTGA 

TATTTGTTGA 

TATTTGTTGA 
********** ********** 



ATATAAAATG 
ATATAAAATG 
ATATAAAATG 
ATATAAAATG 
ATATAAAATG 
ATATAAAATG 
ATATAAAATG 
ATATAAAATG 
ATATAAAATG 
ATATAAAATG 
ATATAAAATG 
********** 



GTTTGATGAG 
GTTTGATGAG 
GTTTGATGAG 
GTTTGATGAG 
GTTTGATGAG 
GTTTGATGAG 
GTTTGATGAG 
GTTTGATGAG 
GTTTGATGAG 
GTTTGATGAG 
GTTTGATGAG 
********** 



TAAAGAAAAA 
TAAAGAAAAA 
TAAAGAAAAA 
TAAAGAAAAA 
TAAAGAAAAA 
TAAAGAAAAA 
TAAAGAAAAA 
TAAAGAAAAA 
TAAAGAAAAA 
TAAAGAAAAA 
TAAAGAAAAA 



400 

ATTTGGTTGG 
ATTTGGTTGG 
ATTTGGTTGG 
ATTTGGTTGG 
ATTTGGTTGG 
ATTTGGTTGG 
ATTTGGTTGG 
ATTTGGTTGG 
ATTTGGTTGG 
ATTTGGTTGG 

ATTTGGTTGG 
********** 

450 

CCGTAACAAC 
CCGTAACAAC 
CCGTAACAAC 
CCGTAACAAC 
CCGTAACAAC 
CCGTAACAAC 
CCGTAACAAC 
CCGTAACAAC 
CCGTAACAAC 
CCGTAACAAC 
CCGTAACAAC 
********** ********** ********** 



CAATTACAAC 
CAATTACAAC 
CAATTACAAC 
CAATTACAAC 
CAATTACAAC 
CAATTACAAC 
CAATTACAAC 
CAATTACAAC 
CAATTACAAC 
CAATTACAAC 
CAATTACAAC 



GATTAATGGC 
GATTAATGGC 
GATTAATGGC 
GATTAATGGC 
GATTAATGGC 
GATTAATGGC 
GATTAATGGC 
GATTAATGGC 
GATTAATGGC 
GATTAATGGC 
GATTAATGGC 



451 

TACAGTCGAG 
TACAGTCGAG 
TACAGTCGAG 
TACAGTCGAG 
TACAGTCGAG 
TACAGTCGAG 
TACAGTCGAG 
TACAGTCGAG 
TACAGTCGAG 
TACAGTCGAG 
TACAGTCGAG 
********** 

501 

AGATAAAAAA 

AGATAAAAAA 

AGATAAAAAA 

AGATAAAAAA 

AGATAAAAAA 

AGATAAAAAA 

AGATAAAAAA 

AGATAAAAAA 

AGATAAAAAA 

AGATAAAAAA 

AGATAAAAAA 
********** 

551 

TAACTTTAAA 
TAACTTTAAA 
TAACTTTAAA 
TAACTTTAAA 
TAACTTTAAA 
TAACTTTAAA 
TAACTTTAAA 
TAACTTTAAA 
TAACTTTAAA 



AAGAAGCgGA 
AAGAAGCgGA 
AAGAAG CaGA 
AAGAAGCaGA 
AAGAAG CaGA 
AAGAAGCaGA 
AAGAAGCaGA 
AAGAAGCaGA 
AAGAAGCaGA 
AAGAAGCaGA 

AAGAAGCaGA 
*******_** 



ATTACGACTT 
ATTACGACTT 
ATTACGACTT 
ATTACGACTT 
ATTACGACTT 
ATTACGACTT 
ATTACGACTT 
ATTACGACTT 
ATTACGACTT 
ATTACGACTT 
ATTACGACTT 



TCACACCAAA 
TCACACCAAA 
TCACACCAAA 
TCACACCAAA 
TCACACCAAA 
TCACACCAAA 
TCACACCAAA 
TCACACCAAA 
TCACACCAAA 
TCACACCAAA 
TCACACCAAA 



500 

TaCCTTTAAC 
TaCCTTTAAC 
TaCCTTTAAC 
TgCCTTTAAC 
TgCCTTTAAC 
TgCCTTTAAC 
TgCCTTTAAC 
TgCCTTTAAC 
TgCCTTTAAC 
TgCCTTTAAC 
TgCCTTTAAC 



********** ********** *_******** 



AGTTTCGCTA 
AGTTTCGCTA 
AGTTTCGCTA 
AGTTTCGCTA 
AGTTTCGCTA 
AGTTTCGCTA 
AGTTTCGCTA 
AGTTTCGCTA 
AGTTTCGCTA 
AGTTTCGCTA 

AGTTTCGCTA 
********** 



GTCTTATTAT 

GTCTTATTAT 

GTCTTATTAT 

GTCTTATTAT 

GTCTTATTAT 

GTCTTATTAT 

GTCTTATTAT 

GTCTTATTAT 

GTCTTATTAT 

GTCTTATTAT 

GTCTTATTAT 
********** 



TgAtAATAAT 
TgAtAATAAT 
TgAtAATAAT 
TaAtAATAAT 
TaAtAATAAT 
TgAcAATAAT 
TgAcAATAAT 
TgAcAATAAT 
TgAcAATAAT 
TgAcAATAAT 
TgAcAATAAT 



550 

GGTGATTTAA 
GGTGATTTAA 
GGTGATTTAA 
GGTGATTTAA 
GGTGATTTAA 
GGTGATTTAA 
GGTGATTTAA 
GGTGATTTAA 
GGTGATTTAA 
GGTGATTTAA 
GGTGATTTAA 



AGAGCAAATg 
AGAGCAAATg 
AGAGCAAATg 
AGAGCAAATa 
AGAGCAAATa 
AGAGCAAATa 
AGAGCAAATa 
AGAGCAAATa 
AGAGCAAATa 



TTGGATGCTC 
TTGGATGCTC 
TTGGATGCTC 
TTGGATGCTC 
TTGGATGCTC 
TTGGATGCTC 
TTGGATGCTC 
TTGGATGCTC 
TTGGATGCTC 



*_*_****** ********** 
591 

TTCAACGTTT A 
TTCAACGTTT A 
TTCAACGTTT A 
TTCAACGTTT A 
TTCAACGTTT A 
TTCAACGTTT A 
TTCAACGTTT A 
TTCAACGTTT A 
TTCAACGTTT A 



900 



WO 2004/018646 



Table 57: Comparative Sequences relating to SAG 1488 



msa221059.2{245 M732} TAACTTTAAA AGAGCAAATa TTGGATGCTC TTCAACGTTT A 
tnsa221059.2{245_M78l} TAACTTTAAA AGAGCAAATa TTGGATGCTC TTCAACGTTT A 

Consensus ********** *********- ********** ********** * 

SEQ ID NO: 5712 
STRAIN 2603 frame: 1 

MLMTKI IGLTGG I ASGKSTVTKI I RESGFKVI DADQWHKLQAKGGKLYQALLEWLGPE I 
LDADGELDRPKLSQMI FANPDNMKTSARLQNS 1 1 RQELACQRDQLKQTEE I PFMD I PLL I 
EEKYIKWFDE IWLVFVDKEKQLQRLMARNNYSREEAELRLSHQMPLTDKKSFASLI IDNN 
GDLITLKEQILDALQRL 

SEQ ID NO: 5713 
STRAIN 090 frame: 1 

KSTVTKI I RESGFKVIDADQVVHKLQAKGGKLYQALLEWLGPE I LDADGELDRPKLSQMI 
FANPDNMKTSARLQNS 1 1 RQELACQRDQLKQTEE I FFVDI PLL I EEKY IKWFDE I WLVFV 
DKEKQLQRLMARNNYSREEAELRLSHQMPLTDKKSFASLI INNNGDLITLKEQI LDALQR 

L 

SEQ ID NO: 5714 
STRAIN A909 frame: 1 

KSTVTKI IRESGFKVIDADQWHKLQAKGGKLYQALLEWLGPEI LDADGELDRPKLSQMI 
FANPDNMKTSARLQNS 1 1 RQELACQRDQLKQTEE I FFMDI PLLI EEKYI KWFDE I WLVFV 
DKEKQLQRLiMARNNYSREEAELRLSHQMPLTDKKSFASLIIDNNGDLITLKEQILDALQR 

L 

SEQ ID NO: 5715 
STRAIN H36B frame: 1 

KSTVTKI IRESGFKVIDADQVVHKLQAKGGKLYQALLEWLGPEILDADGELDRPKLSQMI 
FANPDNMKTSARLQNS 1 1 RQELACQRDQLKQTEE I FFMD I PLL I EEKY I KWFDE I WLVFV 
DKEKQLQRLMARNNYSREEAELRLSHQIPLTDKKS FASLI I DNNGDL I TLKEQMLDALQR 

L 

SEQ ID NO: 5716 
STRAIN 18RS21 frame: 1 

KSTVTKI I RESGFKVI DADQWHKLQAKGGKLYQALLEWLGPE I LDADGELDRPKLS QM I 
FANPDNMKTSARLQNS 1 1 RQELACQRDQLKQTEE I FFMD I PLL I EEKY I KWFDE I WLVFV 
DKEKQLQRLMARNNYSREEAELRLSHQMPLTDKKS FASL I IDNNGDL ITLKEQI LDALQR 

L 

SEQ ID NO: 5717 
STRAIN M732 frame: 1 

KSTVTKI I RESGFKVI DADQWHKLQAKGGKLYQALLEWLGPE I LDADGELDRPKLS QM I 
FANPDNMKTSARLQNS 1 1 RQELACQRDQLKQTEE I FFMD I PLLI EEKYI KWFDE I WLVFV 
DKEKQLQRLMARNNYSREEAELRLSHQMPLTDKKS FASLI I DNNGDL ITLKEQI LDALQR 

L 

SEQ ID NO: 5718 
STRAIN COH1 frame: 1 

KSTVTKI I RE SGFKVI DADQWHKLQAKGGKLYQALLEWLGPE I LDADGELDRP KLS QM I 
FANPDNMKTSARLQNS 1 1 RQELACQRDQLKQTEE I FFMD I PLLIEEKYI KWFDE I WLVFV 
DKEKQLQRLMARNNYSREEAELRLSHQMPLTDKKSFASLIIDNNGDLITLKEQILDALQR 

L 

SEQ ID NO: 5719 
STRAIN M781 frame: 1 

KSTVTKI IRE SGFKV I DADQWHKLQAKGGKLYQALLEWLGPE I LDADGELDRPKLS QM I 
FANPDNMKTSARLQNS 1 1 RQELACQRDQLKQTEE I FFMD I PLLIEEKYI KWFDE I WLVFV 
DKEKQLQRLMARNNYSREEAELRLSHQMPLTDKKSFASLI I DNNGDL ITLKEQI LDALQR 

L 

SEQ ID NO: 5720 
STRAIN CJB1 10 frame: 1 

KSTVTKI IRESGFKVIDADQWHKLQAKGGKLYQALLEWLGPE I LDADGELDRPKLSQMI 
FANPDNMKTSARLQNS 1 1 RQELACQRDQLKQTEE I FFVDI PLLI EEKYI KWFDE I WLVFV 
DKEKQLQRLMARNNYSREEAELRLSHQMPLTDKKSFASLI INNNGDLITLKEQI LDALQR 

L 

SEQ ID NO: 5721 

STRAIN 1 169NT frame: 1 

KSTVTKI I RE SGFKV I DADQWHKLQAKGGKLYQALLEWLGPE I LDADGELDRPKLSQMI 
FANPDNMKTSARLQNS 1 1 RQELACQRDQLKQTEE I FFMD I PLLIEEKYI KWFDE I WLVFV 
DKEKQLQRLMARNNYSREEAELRLSHQI PLTDKKS FASL 1 1 DNNGDL I TLKEQMLDALQR 

L 

SEQ ID NO: 5722 
STRAIN JM9 130013 frame: 1 

KSTVTKI IRESGFKVIDADQWHKLQAKGGKLYQALLEWLGPE I LDADGELDRPKLSQMI 
FANPDNMKTSARLQNS 1 1 RQELACQRDQLKQTEE I FFMD I PLLI EEKY I KWFDE I WLVFV 
DKEKQLQRLMARNNYSREEAELRLSHQI PLTDKKS FASLI IDNNGDL I TLKEQMLDALQR 
L 
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Table 57: Comparative Sequences relating to SAG 1488 



PRETTY of: /biotmp/msa221398 . 2 { * } February 10, 2003 07:15 



msa221398 
msa221398 
msa2213 98 

msa221398 . 
msa221398 .2(245 
msa221398 .2{ 

msa221398 . 

msa221398 . 

msa221398'. 

msa221398 . 

msa221398 . 



^-w .2{245_0'90} 
.2{245_CJB110) 
.2{245_1169NT) 
2{245_H36B} 
_JM9130013} 
245_18RS2l} 
2{245_2603} 
2{245_A909} 
2{245_COHl 
2{245_M732 
2{245_M781 
Consensus 



1 

KSTV TKIIRESGFK 

KSTV TKIIRESGFK 

KSTV TKIIRESGFK 

KSTV TKIIRESGFK 

_ KSTV TKIIRESGFK 

KSTV TKIIRESGFK 

mlmtkiiglt ggiasgKSTV TKIIRESGFK 

KSTV TKIIRESGFK 

KSTV TKIIRESGFK 

KSTV TKIIRESGFK 

KSTV TKIIRESGFK 

********** ********** ********** 



50 

LQAKGGKLYQ 

LQAKGGKLYQ 

LQAKGGKLYQ 

LQAKGGKLYQ 

LQAKGGKLYQ 

LQAKGGKLYQ 

LQAKGGKLYQ 

LQAKGGKLYQ 

LQAKGGKLYQ 

LQAKGGKLYQ 

LQAKGGKLYQ 
********** ********** 



VIDADQWHK 
VIDADQWHK 
VIDADQWHK 
VIDADQWHK 
VIDADQWHK 
VIDADQWHK 
VIDADQWHK 
VIDADQWHK 
VIDADQWHK 
VIDADQWHK 
VIDADQWHK 



msa22139 
msa221398 .2 
msa221398 .2 
msa221398 
msa221398 .2(24 
msa221398 .2 
msa221398 
msa221398 
msa221398 
msa221398 
msa221398 



8 .2{245_090} 
{245_CJB110} 
{245_1169NT} 
.2{245_H36B} 
5_JM9130013 • 
{245_18RS21 
.2(245_2603 
-2{245_A909; 
,2{245_COHl 
.2{245_M732 
.2{245_M781; 
Consensus 



51 

ALLEWLGPEI 
ALLEWLGPEI 
ALLEWLGPEI 
ALLEWLGPEI 
ALLEWLGPEI 
ALLEWLGPEI 
ALLEWLGPEI 
ALLEWLGPEI 
ALLEWLGPEI 
ALLEWLGPEI 
ALLEWLGPEI 



LDADGELDRP 
LDADGELDRP 
LDADGELDRP 
LDADGELDRP 
LDADGELDRP 
LDADGELDRP 
LDADGELDRP 
LDADGELDRP 
LDADGELDRP 
LDADGELDRP 
LDADGELDRP 



KLSQMIFANP 
KLSQMIFANP 
KLSQMIFANP 
KLSQMIFANP 
KLSQMIFANP 
KLSQMIFANP 
KLSQMIFANP 
KLSQMIFANP 
KLSQMIFANP 
KLSQMIFANP 
KLSQMIFANP 



DNMKTSARLQ 
DNMKTSARLQ 
DNMKTSARLQ 
DNMKTSARLQ 
DNMKTSARLQ 
DNMKTSARLQ 
DNMKTSARLQ 
DNMKTSARLQ 
DNMKTSARLQ 
DNMKTSARLQ 
DNMKTSARLQ 



100 

NSIIRQELAC 
NSIIRQELAC 
NSIIRQELAC 
NSIIRQELAC 
NSIIRQELAC 
NSIIRQELAC 
NSIIRQELAC 
NSIIRQELAC 
NSIIRQELAC 
NSIIRQELAC 
NSIIRQELAC 



********** ********** ********** ********** ********** 



msa22139 
msa221398 .2 
msa221398 .2 
msa221398 
msa221398.2{24 
msa221398 .2 
msa221398 
msa221398 
msa221398 
msa221398 
msa221398 



8 .2{245_090j 
{245_CJB110} 
{245_1169NT 
.2{245_H36B 
5_JM9130013 
{245_18RS21 
.2{245__2603; 
.2l245_A909^ 
.21 245_C0H1 
.2{245_M732 
.2{245_M781 
Consensus 



101 

QRDQLKQTEE 
QRDQLKQTEE 
QRDQLKQTEE 
QRDQLKQTEE 
QRDQLKQTEE 
QRDQLKQTEE 
QRDQLKQTEE 
QRDQLKQTEE 
QRDQLKQTEE 
QRDQLKQTEE 
QRDQLKQTEE 



IFFvDIPLLI 
IFFvDIPLLI 
IFFmDIPLLI 
IFFtnDIPLLI 
IFFmDIPLLI 
IFFmDIPLLI 
IFFmDIPLLI 
IFFmDIPLLI 
IFFmDIPLLI 
IFFmDIPLLI 
IFFmDIPLLI 



EEKYIKWFDE 
EEKYIKWFDE 
EEKYIKWFDE 
EEKYIKWFDE 
EEKYIKWFDE 
EEKYIKWFDE 
EEKYIKWFDE 
EEKYIKWFDE 
EEKYIKWFDE 
EEKYIKWFDE 
EEKYIKWFDE 



I WLVFVDKE K 
IWLVFVDKEK 
I WLVFVDKE K 
IWLVFVDKEK 
IWLVFVDKEK 
IWLVFVDKEK 
IWLVFVDKEK 
IWLVFVDKEK 
IWLVFVDKEK 
IWLVFVDKEK 
IWLVFVDKEK 



150 

QLQRLMARNN 
QLQRLMARNN 
QLQRLMARNN 
QLQRLMARNN 
QLQRLMARNN 
QLQRLMARNN 
QLQRLMARNN 
QLQRLMARNN 
QLQRLMARNN 
QLQRLMARNN 
QLQRLMARNN 



********** ***_****** ********** ********** ********** 



msa221398 
msa221398 
msa221398 

msa221398 . 
msa221398 .2(245 
msa221398.2{ 

msa221398 . 

msa221398 . 

msa221398 . 

msa221398 . 

msa221398 . 



. .2{245_090 
.2{245_CJB110 
.2{245_1169NT 
2{245_H36B 
_JM9130013; 
245_18RS21 
2(245_2603 
245_A909 
245_C0H1 
2{245_M732 
" 245_M781/ 
Consensus 



151 

YSREEAELRL 
YSREEAELRL 
YSREEAELRL 
YSREEAELRL 
YSREEAELRL 
YSREEAELRL 
YSREEAELRL 
YSREEAELRL 
YSREEAELRL 
YSREEAELRL 
YSREEAELRL 



SHQmPLTDKK 
SHQmPLTDKK 
SHQiPLTDKK 
SHQiPLTDKK 
SHQiPLTDKK 
SHQmPLTDKK 
SHQmPLTDKK 
SHQmPLTDKK 
SHQmPLTDKK 
SHQmPLTDKK 
SHQmPLTDKK 



********** ***_****** 



SFASLIInNN 

SFASLIInNN 

SFASLIIdNN 

SFASLIIdNN 

SFASLIIdNN 

SFASLIIdNN 

SFASLIIdNN 

SFASLIIdNN 

SFASLIIdNN 

SFASLIIdNN 

SFASLIIdNN 
*******_** 



GDLITLKEQi 
GDLITLKEQi 
GDLITLKEQm 
GDLITLKEQm 
GDLITLKEQm 
GDLITLKEQi 
GDLITLKEQi 
GDLITLKEQi 
GDLITLKEQi 
GDLITLKEQi 
GDLITLKEQi 



197 
LDALQRL 
LDALQRL' 
LDALQRL 
LDALQRL 
LDALQRL 
LDALQRL 
LDALQRL 
LDALQRL 
LDALQRL 
LDALQRL 
LDALQRL 



*********_ ******* 



902 



WO 2004/018646 
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Table 58: Comparative Sequences relating to SAG0182 



SEQ. ID NO. 5801 
STRAIN 2603 

ATGTTGATGGTGTTGTTAT T C CAAAGG CT AGG AATTATT ATGATTTTAG C CTTTTTATT G 
GTAAATAATAGTTATTTTAG ACAGTTAATTGAAG AG CGGT CTAAACGTG AAACGGTAGT C 
CTTGTCAT CATTTT CGG CTTGTTTGTT ATT AT ATCTAAT ATAACAGG AATTG AAATAAAA 
GGGGATCGAAGTTTGGTCGAGCGCCCTTTTCTAACAACGATTTCrrCATTCTGACT 
GCTAATACAAGGACTTTAGTTATTACAACGGG^GTTTGGTTGGTGGACCrCTGGTTGGA 
TC^TTGTTGGTTTTATTGGAGGAGTTCATCGCrTTTTT 

TTCTATATTGTCAGTTCAGTT CT AGT CGG CATTGTTAG CGGAAAGATTGGTG AT AAG CTT 
AAGGAAAACCAT CT CTACC CTT CAACAAG CCAAGTTATTTTAATTAGTATTATTGCCGAA 
AGTAT CCAGATG CTATTTGTTGGCATTTTTACAGGATGGGAACTTGTCLAAAATGATTGT C 
ATTCCAATGATGATTTTAAATAGTTTAGGTTCCACACTTTTCCT 

TATTTGTCAAATGAAAGTCAGTTACGCGCAGTTCAAACGAGAGATGTTCTTGAATTGACT 
CGACAGACTCTGCCCTACCTTAGACAAGGTTTGACACCGGAATCTGCrAGGAGCGT^ 
GAAATTATAAAGAG G CATACTAACTTTG ATG CTGTGGGAT TAACAGATCGGT CAAACGTA 
TTAG CTCATATT GGTGTTGG CCAT GATCAC CATATTG CAGGACAACCGGT CAAAACAGAC 
TT AT CTAAAAGTGTT ATTTTTGATGG CG AAC CAAGAATTG CG CAAGATAAAGCGGCGATT 
TCTTGTC CAG AT CACAACTGTCAGTTAAATTCTG CTATTGTAGTTCut u i. AAAAATAAAT 
GATAAAACT GTGGGTG C CTTTAAAAATGTACTTTG CAGGAGATAAGACAATGT CTGAGGTG 
GAGGAAAACCTAGT CCTTGGTTTAG CG CAAATATTTT CAGGACAACTGG CAATG GGGATA 
ACAGAGGAACAAAATAAGTT AG CCAGT ATGG CAG AGATAAAGGCTTTACAAG CACAAATC 
AACC CTCATT" J.' CTT Cl'iT'AATG CCATTAACACAATTAGTG CATTAAT CCGTATTGATTCT 
GATAAAG CACGTTATG CACTGATG CAGTTAAGTACUTTTT'lU AGAACAAGTTTG CAGGGT 
GGTCAGGAT CGTGAGGTAAC G CTTG AG CAAGAAAAATCACATGTG GATG CTTATATGAAT 
GTTGAAAAATTACGTTT CC CTG AT AAAT ATCAGTTATCTTATGA 
AAAATGAAGTTACCACCTTTTGGTTTACAGGTACTGGTAGAGAATGC^ 
TT CAAAGAACGTAAGACGGACAAC CAT ATATTGGTT CAAATAAAG CCAGATGGT CATTAT 
TATTGTGTTT CTGTTAGTGACAATGGACAAGGAATCTCAGATACTAT CATTGATAAATTA 
GGTCAAGAAACAGTTG CAGAGAGTAAGGGT ACAGGTACTGCT 
AGGCTGAATTTATTATATGGTAGTGTAAGTTGCCTTCATTT^ 

ACAAAAGTTTGGTATCGAATACCTAATAGAATAAGGGAGGATGAGCATGAAAATTTTAAT 
TCT 



SEQ ID NO. 5802 
STRAIN 090 

TTGATGGTGTTGTTATT CCAAAGG CTAGGAATTATTAT 
GATTTTAGCCTTTTTATTGGTAAATAATAGTTATTTC^ 
AAGAGCGGTCTAAACGTGAAACGGTAGTACTTGTCATCATTTTCGGCTTG 
TTTGTTATT ATATCTAATATAACA.GGAATTGAAAT AAAAGGGGAT CG AAG 
TTTGGTCGAGCGCCCITTTCTAACAACGATTTCC 

CrAATACAAGGACTTTAGTTATTACAACGG CAAGTTTGGTT^ CT 
CTGGTTGGAT CAATTGTTGGTTTTATTGGAGGAGTTCAT CGCTT^ 
AGGAAGCTTTTCAGGTT CTTT CTATATTGT CAGTTCAGTT CTAGT CGG CA 
TTGTTAGCGGAAAGATTGGTGATAAGCTTAAGGAAAACCATCrCTACCCT 
TCAACAAG C CAAGTTATTTTAATTAGTATT ATTG CCGAAAGTATCCAGAT 
GCT'ATTTGTTGGTATTTTTACAGGATGGG AACTTGT (2AAAATGATTGTCA 
TT CCAATGATGATTTTAAAT AGTTTAGGTT C CACACTTTT CCTTGCGATT 
TTGAAAACTTATTTGTCAAATGAAAGT C^GTT ACGCG CAGTT CAAACGAG 
AGATGTTCI^GAATTGACT CGACAGACT CTGCCCTAC CT CAGACAAGGTT 
T<^CACCGCAATCTGCTAGGAGCGTTTGCGAAATTATAAAGAGGCATACT 
AACTTTGATG CTGTAGGATTAACAGATCGGT CAAACGTATTAGCT CATAT 
TGGTGTTGG CCATGATCAC CAT ATTGCAGGACAACCAGT CAAAACAGAC C 
TATCTAAAAGTGTTATTTTTGATGGCGAAC CAAGAATTG CGCAAGAT AAA 
GCGGCGATTTCTTGT C CAGAT CACAACTGTCAGTTAAATT CTGCTATTGT 
AGTTC CT CT AAAAATAAATGATAAAACTGTGGGTG CCTTAAAAATGTACT 
TTGCAGGAGATAAGACAATGTCTGAGGTGGAGGAAAAC CTAGTC CTTGGT 
TTAGCGCAAATATTTT*CAGGACAACTC^CAATGGGG^ 
AAATAAGTTAGC CAGTATGG CAGAGATAAAGG CTTTACAAGCACAAATCA 
AC CCTCA'ITT' CTT' CTTTAATGC CATTAACACAATTAGTG CATTAATCCGT 
ATTGATTCTGATAAAGCACGTTATGCACTGATGCAGTTAAGTAC M riT''lT"i' 
TAGAACAAGTTTGCAAGGTGGT(^G4GATCGTGAGGTAACGCTTGAGCAAG 
AAAAATCACATGTGGATGCTTATATGAATGTTCAAAAATTACGTTTCCCT 
GATAAATAT CAGTTATCTTATGATATTAGTG CAC CAGAAAAAATGAAGTT 
AC CG CCTTTTGGTTTACAGGTACT'GGT AG AGAATGCAGTTAGAC^ CTT 
TCAAAGAACGTAAGACGGAGAAC CATAT ATTGGTT CAAATAAAG CCAGAT 
GGTCATTATTATTGTGTTTCTGTTAGTGACAATGG ACAAGGAAT CT CAGA 
TACTATCATTGATAAATTAGGTCAAGA^CAGTTGCAGAGAGTAAGGGTA 
CAGGTACTGCTCTAGTTAATCrAAAT^ 

AGTGTAAGTTGCCTTCATTTTTCGAGCGACAAGAATGGTACAAAAGT^ 
GTAT CGAATACCTAATAGAATAAGGGAGGATGAG CATGAAAATTTTAATT 
CT 



SEQ ID NO. 5803 
STRAIN A909 

TTGATGGTGTTGTTATT CCAAAGG CTAGGAATTATTAT 

GATTTTAGCCTTTTTATTGGTAAATAATAGTTATTTCAGACAGTTAATTG 

AAGAG CGGTCTAAACGTGAAACGGT AGT C CTTGT CAT CATTTTCGG CTTG 

TTTGTTATTATATCrrAATATAACAGGAATTGAAATAAAAGGGGATCGAAG 

TTTGGTCGAGCGCCCTTTTCTAACAACGATTTCTCAT^ 

CTAATACAAGGACTTTAGTTATTACAACGGCAAGTTTG^ 

CTGGTTGGATCAATTGTTGGTTTTATT 

AGGAAGCTTTTCAGGTT CTTTCTATATTGT CAGTT CAGTT CTAGT CGG CA 
TTGTTAGCGGAAAGATTGGTGATAAGCTTA^GGAAAACCATCTCTACCCr 
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TCAACAAGCCAAGTTATTTTAATTAGTATTATTGCCGAAAGTATCCAGAT 
GCTATTTGTTGGCATTTTTACAGGATGGGAACTTGTCAAAATGATTGTCA 
TTCCAATGATGATTTTAAATAGTTTAGGTTCCACACTTTTCCTTGCGATT 
TTGAAAACTTATTTGTCAAATGAAAGTCAGTTACGCGCAGTTCAAACGAG 
AGATGTTCTTGAATTGACTCGACAGACTCTGCCCTACCTTAGACAAGGTT 
TGACACCGCAATCTGCTAGGAGCGTTTGCGAAATTATAAAGAGGCATACT 
AACTTTGATGCTGTGGGATTAACAGATCGGTCAAACGTATTAGCTCATAT 
TGGTGTTGGCCATGATCACCATATTGCAGGACAACCGGTCAAAACAGACT 
TATCTAAAAGTGTTATTTTTGATGGCGAACCAAGAATTGCGCAAGATAAA 
GCGGCGATTTCTTGTCCAGATCACAA.CTGTCAGTTAAATTCTGCTATTGT 
AGTTCCTCTAAAAATAAATGATAAAACTGTGGGTGCCTTAAAAATGTACT 
TTGCAGGAGATAAGACAATGTCTGAGGTGGAGGAAAACCTAGTCCTTGGT 
TTAG CG CAAATATTTT CAGGACAACTGG CAATGG G GATAACAGAGGAACA 
AAAT AAGTTAGC CAGTATGG CAGAGATAAAGGCTTTACAAG CACAAATCA 
AC C CT CATTT CTT CTTTAATGC CATTAACACAAT TAGTG CATTAAT C CGT 
ATTGATT CTGATAAAG CACGTT ATG CACTGATG CAGTTAAGT ACTTTTTT 
TAGAACAAGTTTGCAGGGTGGT CAGGATCGTGAGGTAACG CTTG AGCAAG 
AAAAATCACATGTGGATGCTTATATGAATGTTGAAAAATTACGTTTCCCT 
GATAAATATCAGTTATCTTATGATATT AGTG CAC CAG AAAAAATGAAGTT 
ACCACCTTTTGGTTTACAGGTACTGGTAGAGAATGCAGTTCGACATGCTT 
T CAAAGAAC GTAAG ACGGACAACCATATATTGGTT CAAAT AAAG C CAGAT 
GGTCATTATTATTGTGTTT CTGTTAGTGACAATGGACAAGGAAT CT CAG A 
TACTATCATTGATAAATTAGGTCAAGAAACAGTTGCAGAGAGTAAGGGTA 
CAGGT ACTG CTCTAGTTA^T CTAAATAACAGG CTGAATTTATTAT ATGGT 
AGTGTAAGTTGCCTT CATTTTT CGAGCGACAAGAATGGTACAAAAGTTTG 
GT AT CGAATACCTAATAGAATAAGGGAGG ATGAG CATGAAAATTTTAATT 
CT 

SEQ ID HO. 5804 
STRAIN H36B 

TTGATGGTGTTGTTATTCCAAAGGCTAGGAATTATTATG 

ATTTT AG C CTTTTT ATTGGTAAATAATAGTTATTT CAGACAGTTAATTG A 

AGAGCGGTCTAAACGTGAAACGGTAGTCCTTGTCATCATTTTCGG-CTTGT 

TTGTT ATTATAT CTAAT ATAACAGGAATTG AAAT AAAAGGGG AT C GAAGT 

TTGGTCGAGCGCCCTTTTCTAACAACGATTTerCATTCT 

TAATACAAGGACTTTAGTTATTACAACGGC1AAGTTTG 

TGGTTGGATCAATTGTTGGTTTTATTGGAGGAGTT CAT CG CTTTTTTCAA 

GG AAG CTTTTCAGGTTCTTT CTAT ATTGT CAGTT CAGTTCTAGT CGGCAT 

TGTTAGCGGAAAGATTGGTGATAAGCITAAGGAAAACCATCTCTACCCTT 

CAACAAG CCAAGTTATTTTAAT TAGTATTATTG C CGAAAGTATC CAGATG 

CTATTTGTTGGCATTTTTACAGGATGGGAACTT^ 

T C CAATGATGATTTTAAATAGTTTAGGTT CCAQ^CTTTT CCT 

TGAAAACTTATTTGTCAAATGAAAGTCAGTTACG CGCAGTT CAAACGAGA 

GATGTTCTTGAATTGACTCGACAGACTCTGCCCT^ 

GACAC CG CAATCTGCTAGGAGCGTTTG CG AAATT ATAAAG AGG CATACTA 

ACTTTGATGCTGTGGGATTAACAGATCGGTCAAACGTATTAGCT 

GGTGTTGGCCATGATCACCATATTGCAGGACAACCGGTCAAAACAGACT 

ATCTAAAAGTGTTATTTTTGATGGCGAACCAAGAATTGCGCAAGATAAA 

CGGCGATTTCTTGTCCAGATCACAACTGTCAGTTAAATTOT 

GTTCCTCTAAAAATAAATGATAAAACTGTGGGTGCCTTAAAAATGTACTT 

TGCAGCAGATAAGACAATGTCTGAGGTGGAGGAAAACCTAGTCCTTGGTT 

TAGCGCAAATATTTTCAGGACAACTGG CAATGGGG ATAACAGAGGAACAA. 

AATAAGTTAG CCAGTATGGCAGAGATAAAGG CTTTACAAG CACAAAT CAA 

CCCT CATTT CTT CTTTAATG CCATTAACACAATT AGTG CATTAAT CCGT A 

TTGATTCTGATAAAG CACGTTATG CACTGATG CAGTTAAGTACiTTTTTT 

AGAACAAGTTTGC^GGGTGGTCAGGATCGTGAGGTAA.CGCTTGAGCAAGA 

AAAATCACATGTGGATGCTTATATGAATGTTGAAAAATTACGTTTCCCTG 

ATAAATATCAGTTATCTTATGATATTAGTGCACCAGAAAAA^TGAAGTTA 

CCACCTTTTGGTTTACAGGTACTGGTAGAGAATG CAGTT CGACAT 

CAAAGAACGTAAGACGGACAAC CATATATTGGTT CAAATAAAG C CAGATG 

GTCATTATTATTGTGTTTCTGTTAGTGACAATGGACAAGGA^ 

ACTATCATTGATAAATTAGGTCAAGAAACAGTTGCAGAGAGTAAGGGTAC 

AGGTACTG CT CTAGTTAAT CTAAATAACAGG CTGAATTTATTAT ATGGTA 

GTGTAAGTTGCCTTCATTTTTCGAGCGACAAGAATGGTACAAAAGTT^ 

TATCGAATACCTAATAGAATAAGGGAGGATGAGCATGAAAATTTTAATTC 

T 

SEQ ID HO. 5805 
STRAIH 18RS21 

TTGATGGTGTTGTTATTCCAAAGG CTAGGAATTATTATG 

ATTTTAG C CTTTTTATTGGTAAAT AATAGTTATTTTAGACAGTTAATTG A 

AGAGCGGTCTAAACGTGAAACGGTAGTCCTTGTCATCATTTTCGGCTTGT 

TTGTT ATTATAT CT AATATAACAGGAATTGAAATAAAAGGGGAT CGAAGT 

TTGGTCGAGCGCCCTTTTCTAACAACGATTTCT 

TAATACAAGGACTTTAGTTATTACAACGGCAAGTTTGGTTGGTGGACCTC 
TGGTTGGAT CAATTGTTGGTTTTATTGGAGGAGTTCATCG CTTTTTT CAA. 
GG AAG CTTTT CAGGTTCTTT CTATATTGT CAGTT CAGTT CTAGTCGG CAT 
TGTTAG CGG AAAGATTGGTG ATAAG CTTAAGG AAAAC CATCTCTAC CCTT 
CAACAAGCCAAGTTATTTTAATTAGTATTATTGCCGA 
CTATTTGTTGGCATTTTTACAGGATGGGAACrTGTCAAAATGAT^ 
TCCAATGATGATTTTAAATAGTTT AGGTT C CACACTTTT C CTTG CGATTT 
TGAAAACTTATTTGTCAAATGAAAGTCAGTTACGCGCAGTTC^ 
GATGTTCTTGAATTGACrCGACAGACTCTGCCCTACCTTAGACAAGG 
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GACACCGCAATCTGCTAGGAGCGTTTGCGAAATTATAAAGAGGCATACTA 
ACTTTGATGCTGTGGGATTAACAGATCGGTCAAACGTATTAGCTCATATT 
GGTGTTGGC(^TGATCACCATATTGCAGGACAACCGGTCAAAACAGACTT 
AT CTAAAAGTGTTATTTTTG ATGG CGAAC CAAGaATT G CG CAAG ATAAAG 
CGGCG ATTTCTTGT C CAGAT CACAACTGT CAGTT AAATT CTG CT ATTGT A 
GTTCCTCTAAAAATAAATGATAAAACTGTGGGTGCCTTAAAAATGTACTT 
TGCAGGAGATAAGACAATGTCTGAGGTGGAGGAAAACCTAGTCCTTGGTT 
TAGCG CAAATATTTT CAGGACAACTGG CAATGGGGAT AACAGAGG AACAA 
AATAAGTTAGCCAGTATGGCAGAGATAAAGGCTTTACAAGCACAAATCAA 
C C CT CATTT CTT CTTTAATG CC ATT AACACAATTAGTGCATTAAT C CGT A 
TTGATTCTGATAAAG CACGTTATG CACTGATG CAGTTAAGT ACTTTTTTT 
AGAACAAGTTTGCAGGGTGGTCAGGATCGTGAGGTAACGCTTGAGCAAGA 
AAAATCACATGTGGATGCTTATATGAATGTTGAAAAATTACGTTTCCCTG 
ATAAATATCAGTTATCTTATGATATTAGTGCACCAGAAAAAATGAAGTTA 
CCACCTTTTGGTTTACAGGTACTGGTAGAGAATG CAGTTCGACATGCTTT 
CAAAGAACGTAAGACGGACAAC CATAT ATTGGTTCAAAT AAAG C CAGATG 
GTCATTATTATTGTGTTTCrGTTAGTGACAATGGACAAGGAATCTCAGAT 
ACTAT CATTGATAAATTAG GTCAAG AAACAGTTG CAGAG AGTAAGGGTAC 
AGGTACTGCT CTAGTTAATCTAAAT AACAGG CTGAATTTATTATATGGTA 
GTGTAAGTTGCCTTCATTTTTCGAGCGACAAGAATGGTACAAAAGTTTGG 
TAT CGAATAC CTAAT AGAATAAGGGAG GATGAG CATG AAAATTTTAATT C 
T 

SEQ XD NO. 5806 
STRAIN M732 

TTGATGGTGTTGTTATTCCAAAGGCTAGGAATTATTATGAT 

TTTAG CCTTTTT ATTGGTAAATAATAGTTATTT CAGACAGTTAATTGAAG 

AGCGGTCTAAACGTGAAACGGTAGTCCTTGTCATCATTTTCGGCTTGTTT 

GTTATT ATAT CTAATATAACAGGAATTGAAAT AAAAGGGG ATCGAAGTTT 

GGTCGAGCGCCCITTTCTAACAACGATTTCCCATTCT 

ATACAAGGACTTTAGTTATTACAACGGCAAGTTTGGTTGGTGGACCT 

GTTGGAT CAATTGTTGGTTTTATTGGAGGAGTT CAT CG CTTTTTT CAAGG 

AAGCTTTTCAGGTT CTTTCT ATATTGT CAGTT CAGTTCTAGTCGGCATTG 

TTAGCGGAAAGATTGGTGATAAGCTTAAGGAAAACCATCTCrACCCTTCA 

ACAAG CCAAGTTATTTTAATTAGT ATTATTG CCG A^GTAT C CAGATG CT 

ATTTGTTGG CATTTTTACAGGATGGGAACTTGT CAAAATGATTGT CATTC 

CAATGATGATTTTAAATAGTTTAGGTT CCAGACTTTTCCTTG CGATTTTG 

AAAACTTATTTGT CAAATG AAAGT CAGTT ACG CG CAGTT CAAACGAGAGA 

TGTT CTTGAATTGACTCGACAG ACTCTGC C CT AC CTT AG ACAAGGTTTGA 

CAC CG CAAT CTGCTAGGAG CGTTTG CGAAATTATAAAGAGG CATACTAAC 

TTTGATG CTGTGG<3ATT AACAGATCGGTCAAACGTATTAG CT CATATTGG 

TATTGG C CATGATCACCATATTG CAGGACAAC CGGTCAAAACAGAC^^ 

CTAAAAGTGTTATTTTTGATGG CGAAC CAAGAATTG CG CAAGATAAAGCG 

GCGAtTTCTTGTCCAGATCACAACTGTCAGTTAAATTCT 

T C CTCTAAAAAT AAATGATAAAACTGT GTGTG C CTTAAAAATGTACTTTG 

CAGGAGATAAGACAATGTCTGAGGTGGAGG AAAACCT AGT CCTTGGTTTA 

GCG CAAATATTTTCAGGACAACTGG CAATGGGGATAACAGAGGAACAAAA 

TAAGTTAGCCAGTATGG CAGAGATAAAGG CTTTACAAGCACAAAT CAAC C 

CTCATTTCTTCTTTAATGC CATTAACACAATTAGTG CATTAATCCGTATT 

GATTCTGATAAAGCACGTTATGCACTGATGCAGTTAAGTA 

AACAAGTTTG CAAGGTGGTCAGGAT CGTG AGGTAACG CTTGAGCAAGAAA 

AATCACATGTGGATGCTTATATGAATGTTGAAAAATTACGTTTCCCTGAT 

AAATATCAGTTATCTTATGATATTAGTGCACCAGAAAAAATGAAGTTACC 

GCCTTTTGGTTT ACAGGTACTGGTAGAGAATG CAGTT CGACATGCTTTCA 

AAGAACGTAAGACGGACAAC CATATATTGGTT CAAATAAAGC CAGATGGT 

CATTATTATTGTGTTTCTGTTAGTGACAATGGACAAGGA^ 

TATCATTGATAAATTAGGTCAAGAAACAGTTGCAGAGAGTAAGGGGACAG 

GTACTGCTCTAGTTAATCTAAATAACAGGCTGAATTTATTATATGGTAGT 

GT AAGTTGC CTT CATTTTT CGAGCGACAAG AATGGTACAAAAGTTTGGT A 

TCGrAATACCTAATAGAATAAGGGAGGATGAGCATGAAAATTTTAATTCT 

SEQ ID NO. 5807 
STRAIN COH1 

TTGATGGTGTTGTTATT C CAAAGGCTAGGAATTAT 

TATGATTTTAGCCTTTTTATTGGTAAATAATAGTTATTTCAGACAGTTAA 

TTGAAGAGCGGT CTAAACGTGAAACGGTAGTC CTTGT CAT CATTTTCGG C 

TTGTTTGTTATTATATCTAATATAACAGGAATTGAAATAAAAGGGGATCG 

AAGTTTGGTCGAGCGCCCTTTTCTAACAACGATTTCCCATTCT 

TTGCTAATACAAGGACTTTAGTTATTACAACGGCM 

C CTCTGGTTGGATCAATTGTTGGTTTT ATTGGAGGAGTC 

TCAAGGAAG CTTTT CAGGTT CTTT CTATATTGT CAGTT CAGTTCT AGTCG 

GCATTGTTAGCGGAAAGATTGGTGATAAG CTTAAGGAAAACCATCTCTAC 

CCTTCAACAAGCCAAGTTATTTTAATTAGTATTATTGCCGAAAG 

GATG CTATTTGTTGG CATTTTTACAGG ATGGGAACTTGTCAAAATGATTG 

TCATT CGAATGATGATTTTAAATAGTTTAGGTT C GACACTTTTC CTTG CG 

ATTTTGAAAACTTATTTGT CAAATGAA^GT CAGTTACGCG CAGTT CAAAC 

GAGAGATGTTCETGAATTGACTCGACAGACT CTG C CCTAC CTTAGACAAG 

GTTTGACAC CGCAATCTGCTAGGAG CGTTTG CGAAATTATAAAGAGG CAT 

ACTAACTTTGATGCTGTGGGATTAACAGATCGGTCAAA 

TATTGGTGTTGG CCATGAT CAC CAT ATTG CAGGACAACCGGT CAAAACAG 

ACTTATCTAAAAGTGTT ATTTTTG ATGG CG AACCAAGAATTG CG CAAGAT 

AAAG CGG CGATTT CTTGT C CAGAT CACAACTGTCAGTTAAATTCTG CTAT 

TGTAGTTCCTCTAAAAATAAATGATAAAACTGTGTGTGCCTTAAAAATGT 
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ACTTTGCAGG AG AT AAG ACAATGT CTGAGGTGGAGGAAAAC CT AGT C CTT 
GGTTT AG CG CAAATATTTT CAGGACAACTGGCAAT GG GG AT AACAGAGG A 
ACAAAATAAGTTAG CCAGTATGG CAGAGATAAAG G CTTT ACAAG CACAAA 
TCAAC CCT CATTT CTTTCT/TTAATGC CATTAACACAATTAGTG CATTAAT C 
CGTATTGATTCTGATAAAGCACGTTATGCACrGATGCAGTTAAGTACTTT 
TTTTAGAACAAGTTTGCAAGGTGGTCAGGATCGTGAGGTAACGCTTGAGC 
AAGAAAAAT CACATGTGG ATG CTT ATATG AATGTTGAAAAATTACGTTTC 
CCTG ATAAAT AT CAGTTAT CTTAT G AT ATTAGTG CAC CAG AAAAAATGAA 
GTTAC CG GCTTTTGGTTTACAGGTACTGGT AGAG AATG CAGTT CGACATG 
CTTTCAAAGAACGTAAGACGGACAACCATATATTGGTTCAAATAAAGCCA 
GATGGTCATTATTATTGTGTTTCTGTTAGTGACAATGGACAAGGAATCTC 
AG ATACTATCATTG ATAAATTAGGT CAAGAAACAGTTG CAGAGAGTAAGG 
GG ACAGGTACTG CT CTAGTTAAT CTAAATAACAGG CTGAATTTATTATAT 
GGTAGTGTAAGTTGCCTTCATTTTTCGAGCGACAAGAATGGTACAAAAGT 
TTGGT AT CG AATAC CTAATAGAATAAGGG AGG AT GAG CATG AAAATTTTA 
ATTCT 

SEQ ZD NO. 5808 
STRAIN M781 

TTGATGGTGTTGTT ATT CCAAAGG CTAGG AATTATTA 

TGATTTTAGCCITTTTATTGGTAAATAATAGTTATTTCA 

GAAGAGCGGT CTAAACGTGAAACGGTAGT C CTTGT CAT CATTTT CGG CTT 

GTTTGTTATTATAT CTAATATAACAGGAATTG A^vATAAAAGGGGATCGAA 

GTTTGGTCGAGCGCCCTTTTCTAACAACX3ATTTCCCAT^ 

GCTAATACAAG<^CTTTAGTTATTACAACGG CAAGTTTGGTTGGTGGAC C 

TCTGGTTGGATCAATTGTTGGTTTTATTGGAGGAGTTCA 

AAGGAAG CTTTT CAGGTTCTTT CT AT ATTGT CAGTTCAGTT CTAGTCGGC 

ATTGTTAGCGGAAAGATTGGTGATAAG CTTAAGGAAAACCAT CTCTACC C 

TT CAACAAGCCAAGTTATTTTAATTAGTATTATTG CCGAAAGTAT CCAGA 

TGCTATTTGTTCGCATTTTTACAGGATG 

ATTCCAATGATGATTTTAAATAGTTTAGGTT^ 

TTTGAAAACTTATTTGT CAAATGAAAGTCAGTTACGCG CAG t T CAAACGA 
GAGATGl'1'ClTGAATTGACTCGACAGACrrCTGCCCTACCTTAGACAAGGT 
TTGACACCX3CAATCTGCTAGGAGCGTTTGCGAAATTATAAAGAGGCATAC 
TAA.CTTTGATG CTGTGGGATTAACAGATCGGT CAAACGTATTAG CTCATA 
TTGGTGTTGGCCATGATCACCATATTGCAGGACAACCG^^ 
TTATCTAAAAGTGTT ATTTTTGATGG CGAACCAAG AATTG CG CAAGATAA 
AG CGG CGATTTCTTGTCCAGATCACAACTGT CAGTTAAATT CTGCTATTG 
TAGTTCCTCTAAAAATAAATGATAAAACTGTGTGTGCCTTAAAAATGTAC 
TTTG CAGGAGATAAGACAATGTCTGAGGTGGAGGAAAACCTAGT C CTTGG 
TTTAGCGCAAATATTTTCAGGACAACTGGCAATGC^ 

AAAAT AAGTTAGCCAGT ATGG CAGAGATAAAGGCTTTACAAG CACAAAT C 
AACCCTCATTTCTTCTTTAATGCCATTAACACA 

TATTGATTCTGATAAAGCACGTTATGCACTGATGCAGTTAAGTACTTTTT 
TTAGAACAAGTTTGCAAGGTGGTCAGGATCGTGAGGTAACGCTTGAGCAA. 
GAAAAATCACATGTGGATG CTTATATGAATGTTGAAAAATTACGTTT CC C 
TGATAAATATCAGTTAT CTTATGATATTAGTGCAC CAGAAAAAATGAAGT 
TACCGCCTTTTGGTTTACAGGTACTGGTAGAGAATG CAGTT C 
TTCAAAGAACGTAAGACGGACAAC CAT ATATTGGTTCAAATAAAGCCAG A 
TGGTCATTATTATTGTGTTTCTGTTAGTGACAATGGACAAGGAA 
ATACTAT CATTGATAAATTAGGTCAAGAAA.CAGTTG CAGAGAGTAAGGGG 
ACAGGTACTGCTCTAGTTAATCTAAATAACAGGCTG/yVTTTATTATATGG 
TAGTGTAAGTTGCCTTCATTTTTCGAGCGACAAGAATGGTACAAAAGTT^ 
GGTATCGAATACCTAATAGAATAAGGGAGGATGAGCATGAAAATTTTAAT 

TCT 

SEQ ID NO. 5809 
STRAIN CJB1 10 

TTGATGGTGTTGTTATT C CAAAGG CTAGGAATTATTAT 
GATTTTAGCCITTTTATTGGTAAATAATAGTO 

AAGAG CGGT CTAAACGTGAAACGGTAGTACTTGT CAT CATTTTCGG CTTG 

TTTGTTATTATATCTAATATAACAGGAATTGAAATAAAAGGGGATCGAAG 

TTTGGTCGAGCGCCLTTTTCT'AACAACGATTTCCCATTCTGACTCACTTG 

CTAATACAAGGACTTTAGTTATTACAACGGCAAGTTTGGTTGGTGGACCT 

CTGGTTGGATCAATTGTTGGTTTTATTGGAGGAGTTCATCGCTT^ 

AGGAAGCTTTTCAGGTTCTTTCTATATTGTCAGTTCAGTTCT 

TTGTTAG CGGAAAGATTGGTGATAAG CTTAAGGAAAACCATCTCTAC CCT 

TCAACAAGCCAAGTTATTTTAATTAGTATTATTGCCGAAAGTATCCAGAT 

GCTATTTGTTGGTATTTTTACAGGATGGGAACT^ 

TT CCAATGATGATTTTAAATAGTTT AGGTT CCACACTTTTCCT^ 

TTGAAAACITATTTGTCAAATGAAAGTCAGTTACGCX3CAGTTCAAACGAG 

AGATGTTCTTGAATTGACTCGACAGACrrcrrGCCCTACCT 

TGACACCGCAATCTG CTAGGAG CGTTTG CGAAATTATAAAGAGG CATACT 

AACIT'TGATGCTGTAGGATTAACAGATCGGTCAAACGTAT^ 

TGGTGTTGGCCATGATCACCATATTGCAGGACAACCAGTCAAAACAGACC 

TATCTAAAAGTGTT ATTTTTGATGG CGAAC CAAGAATTGCG CAAGAT AAA 

GCGGCGATTT CTTGT C CAGATCACAACTGTCAGTTAAATTCTG CT ATTGT 

AGTTCCTCTAAAAATAAATGATAAARCrrGTGGGTGCCTT'AAAAATGTACT 

TTGCAGGAGATAAGACAATGTCrTGAGGTGGAGGAAAACCTAGTCCTTGGT 

TTAGCGCAAATATTTTCAGGACAACTGGCAATGGG 

AAAT AAGTTAG C CAGTATGG CAGAGAT AAAGG CTTTACAAG CACAAAT CA 
AC C CT CATTTTTT CTTT AATG C CATTAACACAATTAGTG CATTAAT C CGT 
ATTGATT CTGATAAAG CACGTTATG CACTGATG CAGTTAAGT ACTTTTTT 
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TAGAACAAGTTT G CAAGGT GGT CAGGATCGTG AGGTAACG CTTG AG CAAG 
AAAAATCACATGTGG ATG CTTATATGAAT GTTGAAAAATT ACGTTT C CCT 
GATAAATAT<^GTTATCTTATGATATTAGTGCACCAGAAAAAATGAAGTT 
ACCX3CCTTTTGGTTTACAGGTACTGGTAGAGAATGCAGTTAGACATGCTT 
T CAAAGAACGTAAG ACGG ACAACCATATATTGGTT CAAAT AAAG C CAGAT 
GGT CATT ATT ATTGTGTTT CTGTT AGTGACAATGG ACAAGG AAT CTCAG A 
TACTATCATTGATAAATTAGGT CAAGAAACAGTT G CAGAG AGTAAGG GTA 
CAGGTACTG CT CTAGTT AAT CT AAATAAG?\GG CTGAATTT ATTAT ATGGT 
AGTGTAAGTTGCCTTCATTTTTCGAGCGACAAGAATGGTACAAAAGTTTG 
GT AT CGAATACCTAATAGAATAAGGGAGGATGAG CATGAAAATTTTAATT 
CT 

SEQ XD MO. 5810 
STRAIN 1169NT 

TTGAT GGTGTTGTTATT C CAAAGG CTAGGAATTATT 

ATGATTTTAG CCTTTTT ATTGGTAAATAAT AGTT ATTT CAGACAGTT AAT 
TGAAG AG CGGTCTAAACGTG AAACG GTAGTACTT GT CAT CATTTT CGG CT 
TGTTTGTTATTATATCTAATATAACAGGAATTGAAATAAAAGGGGATCGA 
AGTTTGGTCGAGCGCCCTTTTCTAACAACGATTTCT 

TGCTAATACAAGGACTTTAGTTATTACAACGGCAAGTTTGGTTGGTGGAC 

CTCTGGTTGGATCAATTGTTGGTTTTATTGGAGGAGTTCATCGCTTTTTT 

CAAGGAAGCTTTTCAGGTTCTTTCTATATTGTCAGTTCA 

CATTGTG AG CGGAAAGATTGGTGATAAG CTTAAGGAAAAC CAT CT CTACC 

CITGAACAAG CCAAGTTATTTTAATTO CG AAAGT AT CCAG 

ATGCTATTTGTTGGCATTTTTA(^GGATGGGA 

CATT C GAATG ATGATTTTAAATAGTTTAGGTT CCACAC^ 

TTTTGAAAACTTATTTGTCAAATGAAAGTCAGTTAC^ 

AGAGATGTT CTTGAATTGACT CGACAGACT CTGC C CTAC CTTAGACAAGG 

TTTGACACCGCAATCTGCTAGGAGCGTTTGCGAAATTATAAAGAGGCATA 

CTAATTTTGATGCTGTGGGATTAACAGATCGGTCAAAC^TATT 

ATTGGTGTTGGCCATGATCACCATATTGCAGGACIAACCAG 

C CT ATCTAAAAGTGTTATTTTTGATGG CGAACCAAGAATTGCGCAAG ATA 

AAGC GGCGATTT CTTGTCCAGATCACAACTGT CAGTT A£u\TT CTG CTATT 

GTAGTTCCTCTAAAAATAAATGATAAAACTGTGGGTG CCTTAAAAATGT A 

CTTTG CAGGAGAT AAGACAATGTCTGAGGTGGAGG AAAAC CTAGT CCTTG 

GTTTAGCGCAAATATTTTCAGG ACAACTGG CAATGGGGATAACAGAGGAA 

CAAAATAAGTTAGC CAGTATGG CAGAG AT AAAGG CTTTACAAG CACAAAT 

CAACCCTCATTT CTT CTTTAATGCCATTAACA.CAATT AGTGCATTAAT C C 

GTATTGATTCTGATAAAGCACGTTATGCACTGATGCAGTTAAGTACTT^ 

TTTAGAA.CAAGTTTG CAAGGTGGT CAGGATCGTG AGGTAACG CTTGAGC^ 

AGAAAAATCACATGTGGATGCTTATATGAATGTTGAAAAATTACGTTTCC 

CTGATAAATATCAGTTATCTTATGATATTAGTGCACCAGAAAAAATGAAG 

TTAC CG C CTTTTGGTTTACAGGTACTGGTAGAGAATG CAGTT CGACATG C 

TTTTAAAGAACGTAAGACGGACAAC CATATATTGGTTCAAATAAAGCCAG 

ATGGTCATTATTATTGTGTTTCTGTTAGTGACAATGGACAAGGAATCTCA 

GATACTATCATT GATAAATTAGGTCAAGAAACAGTTG CAGAGAGTAAGGG 

TACAGGTACTGCTCTAGTTAATCTAAATAACAGGCTGAATTTATTATATG 

GTAGTGTAAGTTG CCITCATTTTT CGAGCGACAAGAATGGTACAAA 

TGGTATCGAATACCTAATAGAATAAGGGAGGATGAGCATGAAAATTTTAA 

TTCT 

SEQ ID NO. 5810 
STRAIN JM9 1300 13 

TTGATGGTGTTGTTATT CCAAAGG CTAGGAATTATT 

ATGATTTTAG CCTTTTT ATTGGTAAATAATAGTT ACT 

TGAAGAG CG GT CTAAACGTGAAACGGTAGT CCTTGT CAT CATTTT CGG CT 

TGTTTGTTATTATAT CTAATATAACAGGAATTGAAATAAAAGGGGAT CG A 

AGTCTGGTCGAGCGCCCTTCTCTAACAACGATCTCTCACT 

TGCTAATACAAGGACITTAGTTATTACAACGGCAAGTT^ 

CTCTGGTTGGAT CAATTGTTGGTTTTATTGGAGGAGTT CAT CGCTTiTTT 

CAAGGAAG CTTTTCAGGTT CTTT CTATATTGT CAGTT CAGTT CTAGT CGG 

CATTGTTAGCGGAAAGATTGGTGATAAGCrrTAAGGAAAACCATCT CTACC 

(TTTCAACAAGCCAAGTTATTTTAATTAGTATTATTGCCGAAAGTAT 

ATGCTATTTGTTGGCATTTTTACAGGATGGGAACTTGTCAAAATGATTG 

CATTCCAATGATGATTTTAAATAGTTTAGGTTCCACACTTTTCCTTGCGA 

TTTTGAAAACTTATTTGTCAAATG AAAGT CAGTTACG CG CAGTTCAAACG 

AGAGATGTT CITGAATTGACTCGACAGACT CTGC C CTAC CTTAGACAAGG 

TTTGACACCG CAAT CTG CT AGGAGCGTTTG CGAAATTATAAAGAGGCATA 

CTAACTTTG ATG CTGTGGG ATTAACAG AT CGGTCAA^CX3T ATT CT CAT 

ATTGGTGTTGGC CATGATCACCAT ATTG CAGGACAAC CGGTCAAAACAGA 

CTTATCTAAAAGTGTTATTTTTGATGGCX^AACCAAGAAT^ 

AAGC GGCGATTT CTTGTCCAG AT CACAACT GT CAGTTAAATT CTG CTATT 

GTAGTTCCTCTAAAAATAAATGATAAAACTGTGGGTGCCTTAAAAATGTA 

CTTTGCAGGAGAT AAGACAATGTCTGAGGTGGAGG AAAACCTA 

GTTTAGCGCAAATATTTTCAGGACAACTGG CAATGGGGATAACAGAGGAA 

CAAAATAAGTTAGC CAGTATGG CAGAGAT AAAGG CTTTACAAGCACAAAT 

CAACCCTCATTT CTE^CTTTAATGCCATTA^ 

GTATTGATT CTG ATAAAGCACGTTATG CACTGAT G CAGTTAAGTACTTTT 
TTT AGAACAAGTTTG CAGGGTGGT CAGGATCGTGAGGTAACG CTTGAGCA 
agAAAAAT CACATGTGG ATG CTTATATGAATGTTGAAAAATTACGTTT CC 
CTGATAAATATCAGTTATCTTATGATATTAGTGCACCAGAAAAAATGAAG 
TTACCACCTTTTGGTTTACAGGTACrGGTAG 
TTTCAAAGAACX5TAAGACGGACAACCATATATTGGCT 
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Table 58: Comparative Sequences relating to SAGO 182 



ATGGTCATTATTATTGTGTTTCTGTTAGTGACAATGGACAAGGAATCTCA 
GATACTAT CATTGATAAATT AGGT CAAGAAACAGTTG CAG AG AGTAAGGG 
TACAGGTACTG CT CTAGTT AAT CT AAATAACAGG CTGAATTTATTAT ATG 
GTAGTGTAAGTTGCCTTCATTTTTCGAGCGACAAGAATGGTACAAAAGTT 
TGGTATCGAATAC CTAATAG AATAAGGGAGGATGAG CATG AAAATTTTAA 
TTCT 

MSA Alignment Results: Pretty output 

PRETTY of: /biotmp/msa442667 . 2 { * } January 13, 2003 06:34 



msa442667 .2{ 

msa442667 . 

msa442667 . 

msa442667 . 
msa442667 .2(248 

msa442667 . 

msa442667 . 

tnsa442667 . 
msa442667 
msa442667 .2{ 
msa442667 ,2{ 



248_18RS2i; 

2{248_26U3; 

2{248_A909; 

2{248_H36B 

_«JM9130013 

2{248_COHl 

2{248_M781 

2{248_M732 

.2{248_090; 

248_CJB110; 

248_1169NT/ 
Consensus 



TTGATGGTGT 

TTGATGGTGT 

TTGATGGTGT 

TTGATGGTGT 

TTGATGGTGT 

TTGATGGTGT 

TTGATGGTGT 

TTGATGGTGT 

TTGATGGTGT 

TTGATGGTGT 

TTGATGGTGT 
********** 



TGTTATTCCA 

TGTTATTCCA 

TGTTATTCCA 

TGTTATTCCA 

TGTTATTCCA 

TGTTATTCCA 

TGTTATTCCA 

TGTTATTCCA 

TGTTATTCCA 

TGTTATTCCA 

TGTTATTCCA 
********** 



AAGGCTAGGA 

AAGGCTAGGA 

AAGGCTAGGA 

AAGGCTAGGA 

AAGGCTAGGA 

AAGGCTAGGA 

AAGGCTAGGA 

AAGGCTAGGA 

AAGGCTAGGA 

AAGGCTAGGA 

AAGGCTAGGA 
** ******** 



ATTATTATGA 

ATTATTATGA 

ATTATTATGA 

ATTATTATGA 

ATTATTATGA 

ATTATTATGA 

ATTATTATGA 

ATTATTATGA 

ATTATTATGA 

ATTATTATGA 

ATTATTATGA 
********** 



50 

TTTTAGCCTT 

TTTTAGCCTT 

TTTTAGCCTT 

TTTTAGCCTT 

TTTTAGCCTT 

TTTTAGCCTT 

TTTTAGCCTT 

TTTTAGCCTT 

TTTTAGCCTT 

TTTTAGCCTT 

TTTTAGCCTT 
********** 



msa442667.2{248_18RS2l} 
msa442667 . 2 ( 248_2603 } 
msa442667.2{248_A909} 
TTisa442667.2{248JK36B} 
msa442667 -2{248_JM9130013} 
msa442667 .2(248_COHl} 
msa442667 . 2 { 248_M781 } 
msa442667 . 2 { 248_M732 } 
msa442667 . 2 {248_090 * 
msa442667 . 2{248_CJB110 
msa442667 . 2 {248_11S9NT; 

Consensus 



51 

TTTATTGGTA 
TTTATTGGTA 
TTTATTGGTA 
TTTATTGGTA 
TTTATTGGTA 
TTTATTGGTA 
TTTATTGGTA 
TTTATTGGTA 
TTTATTGGTA 
TTTATTGGTA 
TTTATTGGTA 



AATAATAGTT 
AATAATAGTT 
AATAATAGTT 
AATAATAGTT 
AATAATAGTT 
AATAATAGTT 
AATAATAGTT 
AATAATAGTT* 
AATAATAGTT 
AATAATAGTT 
AATAATAGTT 



********** ********** 



ATTTtAGACA 
ATTTtAGACA 
ATTTcAGACA 
ATTTcAGACA 
ATTTCAGACA 
ATTTCAGACA 
ATTTcAGACA 
ATTTcAGACA 
ATTTcAGACA 
ATTTcAGACA 

ATTTcAGACA 
****_***** 



GTTAATTGAA 

GTTAATTGAA 

GTTAATTGAA 

GTTAATTGAA 

GTTAATTGAA 

GTTAATTGAA 

GTTAATTGAA 

GTTAATTGAA 

GTTAATTGAA 

GTTAATTGAA 

GTTAATTGAA 
********** 



100 

GAG CGGTCTA 

GAGCGGTCTA 

GAGCGGTCTA 

GAGCGGTCTA 

GAGCGGTCTA 

GAGCGGTCTA 

GAGCGGTCTA 

GAGCGGTCTA 

GAGCGGTCTA 

GAGCGGTCTA 

GAGCGGTCTA 
********** 



msa442667.2(248_18RS2l} 
msa442667 . 2 { 248_2603 } 
tnsa442667 . 2 { 248_A909 J 
msa442667.2{248_H36B) 
msa442667 . 2 {248_JM9130013 } 
msa442667 .2{248_COHl" 
msa442667.2(248_M781 
msa442667.2{248_M732 
msa442667 . 2 (248_090 } 
msa442667 . 2 { 248_CJB110 } 
rasa442667.2{248_1169NT} 

Consensus 



101 

AACGTGAAAC 

AACGTGAAAC 

AACGTGAAAC 

AACGTGAAAC 

AACGTGAAAC 

AACGTGAAAC 

AACGTGAAAC 

AACGTGAAAC 

AACGTGAAAC 

AACGTGAAAC 

AACGTGAAAC 
********** 



GGTAGTcCTT 

GGTAGT cCTT 

GGTAGTcCTT 

GGTAGTcCTT 

GGTAGTcCTT 

GGTAGTcCTT 

GGTAGTcCTT 

GGTAGTcCTT 

GGTAGTaCTT 

GGTAGTaCTT 

GGTAGTaCTT 
****** _*** 



GTCATCATTT 
GTCATCATTT 
GTCATCATTT 
GTCATCATTT 
GTCATCATTT 
GTCATCATTT 
GTCATCATTT 
GTCATCATTT 
GTCATCATTT 
GTCATCATTT 
GTCATCATTT 



TCGGCTTGTT 
TCGGCTTGTT 
TCGGCTTGTT 
TCGGCTTGTT 
TCGGCTTGTT 
TCGGCTTGTT 
TCGGCTTGTT 
TCGGCTTGTT 
TCGGCTTGTT 
TCGGCTTGTT 
TCGGCTTGTT 



150 

TGTTATTATA 
TGTTATTATA 
TGTTATTATA 
TGTTATTATA 
TGTTATTATA 
TGTTATTATA 
TGTTATTATA 
TGTTATTATA 
TGTTATTATA 
TGTTATTATA 
TGTTATTATA 



********** ********** ********** 



msa442667.2{ 

msa4426S7 . 

msa442667 . 

msa442667 . 
msa442667.2{248 

msa442667 . 

msa442667 . 

msa442667 . 
msa442667 
msa442667.2{ 
msa442667 .2{ 



248_18RS21) 
2{248_2603} 
2{248_A909} 
2{248_H36B} 
_JM9130013j 
2{248_C0H1} 
2{248_M781} 
2{248_M732) 
.2{248_090} 
248__CJB110} 
248_1169NT} 
Consensus 



151 

T CT AAT AT AA 
TCTAATATAA 
TCTAATATAA 
TCTAATATAA 
TCTAATATAA 
TCTAATATAA 
TCTAATATAA 
TCTAATATAA 
TCTAATATAA 
TCTAATATAA 
TCTAATATAA 



CAGGAATTGA 
CAGGAATTGA 
CAGGAATTGA 
CAGGAATTGA 
CAGGAATTGA 
CAGGAATTGA 
CAGGAATTGA 
CAGGAATTGA 
CAGGAATTGA 
CAGGAATTGA 
CAGGAATTGA 



AATAAAAGGG 
AATAAAAGGG 
AATAAAAGGG 
AATAAAAGGG 
AATAAAAGGG 
AATAAAAGGG 
AATAAAAGGG 
AATAAAAGGG 
AATAAAAGGG 
AATAAAAGGG 
AATAAAAGGG 



GAT CGAAGTT 
GATCGAAGTT 
GATCGAAGTT 
GATCGAAGTT 
GATCGAAGTT 
GATCGAAGTT 
GATCGAAGTT 
GATCGAAGTT 
GATCGAAGTT 
GATCGAAGTT 
GATCGAAGTT 



200 

TGGTCGAGCG 
TGGTCGAGCG 
TGGTCGAGCG 
TGGTCGAGCG 
TGGTCGAGCG 
TGGTCGAGCG 
TGGTCGAGCG 
TGGTCGAGCG 
TGGTCGAGCG 
TGGTCGAGCG 
TGGTCGAGCG 



********** ********** ********** ********** ********** 



msa442667 .2{ 
msa442667 . 
msa442667 . 
msa442667 . 
msa442667 ,2{248 
msa442667 . 
msa442€67 - 
msa442667 . 
msa442667 
msa442667 .2( 
msa442667 .2\ 



248_18RS21 
2(248_2603 
2{248_A909 
2{248_H36B) 
_JM9130013* 
2{248__C0H1 
2{248_M781 
2{248_M732 
.2{248_090, 
248_CJB110) 
248_1169NT) 
Consensus 



201 

CCCTTTTCTA 
CCCTTTTCTA 
CCCTTTTCTA 
CCCTTTTCTA 
CCCTTTTCTA 
CCCTTTTCTA 
CCCTTTTCTA 
CCCTTTTCTA 
CCCTTTTCTA 
CCCTTTTCTA 

CCCTTTTCTA 
********** 



ACAACGATTT 

ACAACGATTT 

ACAACGATTT 

ACAACGATTT 

ACAACGATTT 

ACAACGATTT 

ACAACGATTT 

ACAACGATTT 

ACAACGATTT 

ACAACGATTT 

ACAACGATTT 
********** 



C t CATTCTGA 
CtCATTCTGA 
Ct CATTCTGA 
CtCATTCTGA 
CtCATTCTGA 
Cc CATTCTGA 
Cc CATTCTGA 
Cc CATTCTGA 
CcCATTCTGA 
C C CATTCTGA 

CtCATTCTGA 
*_******** 



CTCACTTGCT 
CTCACTTGCT 
CTCACTTGCT 
CTCACTTGCT 
CTCACTTGCT 
CTCACTTGCT 
CTCACTTGCT 
CTCACTTGCT 
CTCACTTGCT 
CTCACTTGCT 
CTCACTTGCT 



250 

AATACAAGGA 
AATACAAGGA 
AATACAAGGA 
AATACAAGGA 
AATACAAGGA 
AATACAAGGA 
AATACAAGGA 
AATACAAGGA 
AATACAAGGA 
AATACAAGGA 
AATACAAGGA 



********** ********** 



msa4 42667. 2 { 2 4 8__1 8RS2 1 } 
msa442667 . 2 { 248_2603 } 



251 300 
CTTTAGTTAT TACAACGGCA AGTTTGGTTG GTGGACCTCT GGTTGGATCA 
CTTTAGTTAT TACAACGGCA AGTTTGGTTG GTGGACCTCT GGTTGGATCA 



908 
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Table 58: Comparative Sequences relating to SAG0182 



msa442667 . 
msa442667 . 
msa442667 .2(248 
msa442667 . 
msa442667 ♦ 
msa442667 . 
msa442667 
msa442667 .2{ 
msa442667 .2{ 



2{248_A909' 
2{248_H36B; 
^1^9130013* 
2{248_COHl 
2{24 8_M781 
2{248_M732 
.2{248_090 
248_CJB110; 
248_1169NT; 
Consensus 



CTTTAGTTAT 

CTTTAGTTAT 

CTTTAGTTAT 

CTTTAGTTAT 

CTTTAGTTAT 

CTTTAGTTAT 

CTTTAGTTAT 

CTTTAGTTAT 

CTTTAGTTAT 
********** 



TACAACGGCA 

TACAACGGCA 

TACAACGGCA 

TACAACGGCA 

TACAACGGCA 

TACAACGGCA 

TACAACGGCA 

TACAACGGCA 

TACAACGGCA 
********** 



AGTTTGGTTG 

AGTTTGGTTG 

AGTTTGGTTG 

AGTTTGGTTG 

AGTTTGGTTG 

AGTTTGGTTG 

AGTTTGGTTG 

AGTTTGGTTG 

AGTTTGGTTG 
********** 



GTGGACCTCT 
GTGGACCTCT 
GTGGACCTCT 
GTGGACCTCT 
GTGGACCTCT 
GTGGACCTCT 
GTGGACCTCT 
GTGGACCTCT 
GTGGACCTCT 



GGTTGGATCA 
GGTTGGATCA 
GGTTGGATCA 
GGTTGGATCA 
GGTTGGATCA 
GGTTGGATCA 
GGTTGGATCA 
GGTTGGATCA 
GGTTGGATCA 



********** ********** 



msa442667 .2{ 

msa442667 . 

msa442667 . 

msa442667 . 
msa442667.2{248 

msa442667 . 

msa442667 . 

msa442667 . 
msa442667 
msa442667 
msa442667 



248_18RS2l} 
2 {248^2603} 
2{248_A909* 
2{248_H36B 
_JM9130013 
2{248_COHl; 
2{248_M781; 
2{248_M732; 
uu , .2{248_090 
.2{248_CJB1±0 
,2{248_JL169NT} 
Consensus 



301 

ATTGTTGGTT 
ATTGTTGGTT 
ATTGTTGGTT 
ATTGTTGGTT 
ATTGTTGGTT 
ATTGTTGGTT 
ATTGTTGGTT 
ATTGTTGGTT 
ATTGTTGGTT 
ATTGTTGGTT 
ATTGTTGGTT 



TTATTGGAGG 
TTATTGGAGG 
TTATTGGAGG 
TTATTGGAGG 
TTATTGGAGG 
TTATTGGAGG 
TTATTGGAGG 
TTATTGGAGG 
TTATTGGAGG 
TTATTGGAGG 
TTATTGGAGG 



AGTT CAT CG C 
AGTTCATCGC 
AGTTCATCGC 
AGTTCATCGC 
AGTTCATCGC 
AGTTCATCGC 
AGTTCATCGC 
AGTTCATCGC 
AGTTCATCGC 
AGTTCATCGC 
AGTTCATCGC 



********** ********** ********** 



TTTTTTCAAG 
TTTTTTCAAG 
TTTTTTCAAG 
TTTTTTCAAG 
TTTTTTCAAG 
TTTTTTCAAG 
TTTTTTCAAG 
TTTTTTCAAG 
TTTTTTCAAG 
TTTTTTCAAG 
TTTTTTCAAG 
********** 



350 

GAAGCTTTTC 

GAAGCTTTTC 

GAAGCTTTTC 

GAAGCTTTTC 

GAAGCTTTTC 

GAAGCTTTTC 

GAAGCTTTTC 

GAAGCTTTTC 

GAAGCTTTTC 

GAAGCTTTTC 

GAAGCTTTTC 
********** 



msa442667.2{ 

msa442667 . 

msa442667 . 

msa442667 . 
msa442667.2{248 

msa442667 . 

msa442667 . 

msa442667 . 
msa442667 
tnsa442667 
msa442667 



248_18RS21 
2{248_2603 
2{248_A909 
2(248_H36B] 
_JM9130013; 
2{248_COHl 
2{248_M781 
2{248_M732 
,w , .2{248_090 
.2f248_CJB110 
.2{248_1169NT 
Consensus 



351 

AGGTTCTTTC 
AGGTTCTTTC 
AGGTTCTTTC 
AGGTTCTTTC 
AGGTTCTTTC 
AGGTTCTTTC 
AGGTTCTTTC 
AGGTTCTTTC 
AGGTTCTTTC 
AGGTTCTTTC 
AGGTTCTTTC 
********** 



TATATTGTCA 
TATATTGTCA 
TATATTGTCA 
TATATTGTCA 
TATATTGTCA 
TATATTGTCA 
TATATTGTCA 
TATATTGTCA 
TATATTGTCA 
TATATTGTCA 
TATATTGTCA 
********** 



GTTCAGTTCT 

GTTCAGTTCT 

GTTCAGTTCT 

GTTCAGTTCT 

GTTCAGTTCT 

GTTCAGTTCT 

GTTCAGTTCT 

GTTCAGTTCT 

GTTCAGTTCT 

GTTCAGTTCT 

GTTCAGTTCT 
********** 



AGTCGGCATT 

AGTCGGCATT 

AGTCGGCATT 

AGTCGGCATT 

AGTCGGCATT 

AGTCGGCATT 

AGTCGGCATT 

AGTCGGCATT 

AGTCGGCATT 

AGTCGGCATT 

AGTCGGCATT 
********** 



400 

GT t AGCGGAA 

GTtAGCGGAA 

GTtAGCGGAA 

GTtAGCGGAA 

GTtAGCGGAA 

GTtAGCGGAA 

GTtAGCGGAA 

GTtAGCGGAA 

GTtAGCGGAA 

GTtAGCGGAA 

GTgAGCGGAA 
**_******* 



msa442667 .2{ 

msa442667 . 

msa442667 . 

msa442667 . 
msa442667.2{248 

msa442667 . 

tnsa442667 . 

msa442667 . 
msa442667 
msa442667 .2{ 
msa442667 ,2{ 



248_1BRS21] 
2{248_2603/ 
2f248_A909 
2{248_H36B 
_JM9130013 
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Consensus 



401 

AGATTGGTGA 

AGATTGGTGA 

AGATTGGTGA 

AGATTGGTGA 

AGATTGGTGA 

AGATTGGTGA 

AGATTGGTGA 

AGATTGGTGA 

AGATTGGTGA 

AGATTGGTGA 

AGATTGGTGA 
********** 



TAAGCTTAAG 

TAAGCTTAAG 

TAAGCTTAAG 

TAAGCTTAAG 

TAAGCTTAAG 

TAAGCTTAAG 

TAAGCTTAAG 

TAAGCTTAAG 

TAAGCTTAAG 

TAAGCTTAAG 

TAAGCTTAAG 
********** 



GAAAACCATC 

GAAAACCATC 

GAAAACCATC 

GAAAACCATC 

GAAAACCATC 

GAAAACCATC 

GAAAACCATC 

GAAAACCATC 

GAAAACCATC 

GAAAACCATC 

GAAAACCATC 
********** 



TCTACCCTTC 

TCTACCCTTC 

TCTACCCTTC 

TCTACCCTTC 

TCTACCCTTC 

TCTACCCTTC 

TCTACCCTTC 

TCTACCCTTC 

TCTACCCTTC 

TCTACCCTTC 

TCTACCCTTC 
********** 



450 

AACAAGCCAA 

AACAAGCCAA 

AACAAGCCAA 

AACAAGCCAA 

AACAAGCCAA 

AACAAGCCAA 

AACAAGCCAA 

AACAAGCCAA 

AACAAGCCAA 

AACAAGCCAA 

AACAAGCCAA 
********** 



msa442667 .2 {248_18RS21 
msa442667 . 2 {248__2603 
msa442 667 . 2 { 24 8_A9 0 9 > 
msa442667.2{248_H36B> 
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msa442667.2{248_1169NT 

Consensus 



451 

GTTATTTTAA 

GTTATTTTAA 

GTTATTTTAA 

GTTATTTTAA 

GTTATTTTAA 

GTTATTTTAA 

GTTATTTTAA 

GTTATTTTAA 

GTTATTTTAA 

GTTATTTTAA 

GTTATTTTAA 
********** 



TTAGTATTAT 

TTAGTATTAT 

TTAGTATTAT 

TTAGTATTAT 

TTAGTATTAT 

TTAGTATTAT 

TTAGTATTAT 

TTAGTATTAT 

TTAGTATTAT 

TTAGTATTAT 

TTAGTATTAT 
********** 



TGCCGAAAGT 

TGCCGAAAGT 

TGCCGAAAGT 

TGCCGAAAGT 

TGCCGAAAGT 

TGCCGAAAGT 

TGCCGAAAGT 

TGCCGAAAGT 

TGCCGAAAGT 

TGCCGAAAGT 

TGCCGAAAGT 
********** 



ATCCAGATGC 
ATCCAGATGC 
ATCCAGATGC 
ATCCAGATGC 
ATCCAGATGC 
ATCCAGATGC 
ATCCAGATGC 
ATCCAGATGC 
ATCCAGATGC 
ATCCAGATGC 
ATCCAGATGC 



500 

TATTTGTTGG 
TATTTGTTGG 
TATTTGTTGG 
TATTTGTTGG 
TATTTGTTGG 
TATTTGTTGG 
TATTTGTTGG 
TATTTGTTGG 
TATTTGTTGG 
TATTTGTTGG 
TATTTGTTGG 



********** ********** 



msa442667 .2{ 

msa442667 . 

msa442667 . 

msa442667 . 
msa442667.2{248 

msa442667 . 

msa442667 . 

msa442667 . 
msa442667 
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Consensus 



501 

CATTTTTACA 

CATTTTTACA 

CATTTTTACA 

CATTTTTACA 

CATTTTTACA 

CATTTTTACA 

CATTTTTACA 

CATTTTTACA 

tATTTTTACA 

tATTTTTACA 

CATTTTTACA 
_ ********* 



GGATGGGAAC 
GGATGGGAAC 
GGATGGGAAC 
GGATGGGAAC 
GGATGGGAAC 
GGATGGGAAC 
GGATGGGAAC 
GGATGGGAAC 
GGATGGGAAC 
GGATGGGAAC 
GGATGGGAAC 



TTGTCAAAAT 
TTGTCAAAAT 
TTGTCAAAAT 
TTGTCAAAAT 
TTGTCAAAAT 
TTGTCAAAAT 
TTGTCAAAAT 
TTGTCAAAAT 
TTGTCAAAAT 
TTGTCAAAAT 
TTGTCAAAAT 



GATTGTCATT 
GATTGTCATT 
GATTGTCATT 
GATTGTCATT 
GATTGTCATT 
GATTGTCATT 
GATTGTCATT 
GATTGTCATT 
GATTGTCATT 
GATTGTCATT 
GATTGTCATT 



550 

CCAATGATGA 
CCAATGATGA 
CCAATGATGA 
CCAATGATGA 
CCAATGATGA 
CCAATGATGA 
CCAATGATGA 
CCAATGATGA 
CCAATGATGA 
CCAATGATGA 
CCAATGATGA 



********** ********** ********** ********** 



msa442667 .2{248_18RS21} 



551 600 
TTTTAAATAG TTTAGGTTCC ACACTTTTCC TTGCGATTTT GAAAACTTAT 
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Table 58: Comparative Sequences relating to SAG0182 
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msa442667. 2(248 

msa442667 . 

msa442667 . 

msa442667 . 
msa442667 
msa442667 .2( 
msa442667 .2 { 



2(248_2603 
2(248_A909 • 
2{248_H36B : 
_JM9130013 ! 
2(248_COHl; 
2{248_M781 
2{248_M732 
.2{248_090 
248_CJB110 
248_1169NT} 
Consensus 



msa442667 .2{ 

msa442667 . 

msa442667 . 

msa442667 . 
msa442667.2{248 

msa442667. 

msa442667 . 

msa442667 . 
msa442667 
msa442667 .2{ 
msa442667 .2{ 



248_18RS2l} 
2{248_2603} 
2(248_A909} 
2{248JH36B} 
_JM9130013} 
2{248_COHl} 
2{248_M78l} 
2{248_M732} 
.2{248_090} 
248_CJB110} 
248_1169NT} 
Consensus 



TTTAGGTTCC 
TTTAGGTTCC 
TTTAGGTTCC 
TTTAGGTTCC 
TTTAGGTTCC 
TTTAGGTTCC 
TTTAGGTTCC 
TTTAGGTTCC 
TTTAGGTTCC 
TTTAGGTTCC 
********** 



AAAGTCAGTT 
AAAGTCAGTT 
AAAGTCAGTT 
AAAGTCAGTT 
AAAGTCAGTT 
AAAGTCAGTT 
AAAGTCAGTT 
AAAGTCAGTT 
AAAGTCAGTT 
AAAGTCAGTT 
AAAGTCAGTT 



ACACTTTTCC 
ACACTTTTCC 
ACACTTTTCC 
ACACTTTTCC 
ACACTTTTCC 
ACACTTTTCC 
ACACTTTTCC 
ACACTTTTCC 
ACACTTTTCC 
ACACTTTTCC 



TTGCGATTTT 
TTGCGATTTT 
TTGCGATTTT 
TTGCGATTTT 
TTGCGATTTT 
TTGCGATTTT 
TTGCGATTTT 
TTGCGATTTT 
TTGCGATTTT 
TTGCGATTTT 



GAAAACTTAT 
GAAAACTTAT 
GAAAACTTAT 
GAAAACTTAT 
GAAAACTTAT 
GAAAACTTAT 
GAAAACTTAT 
GAAAACTTAT 
GAAAACTTAT 
GAAAACTTAT 



********** ********** ********** 



TTTTAAATAG 
TTTTAAATAG 
TTTTAAATAG 
TTTTAAATAG 
TTTTAAATAG 
TTTTAAATAG 
TTTTAAATAG 
TTTTAAATAG 
TTTTAAATAG 
TTTTAAATAG 
********** 

601 

TTGTCAAATG 
TTGTCAAATG 
TTGTCAAATG 
TTGTCAAATG 
TTGTCAAATG 
TTGTCAAATG 
TTGTCAAATG 
TTGTCAAATG 
TTGTCAAATG 
TTGTCAAATG 
TTGTCAAATG 

********** ********** ********** ********** ********** 



ACGCGCAGTT 
ACGCGCAGTT 
ACGCGCAGTT 
ACGCGCAGTT 
ACGCGCAGTT 
ACGCGCAGTT 
ACGCGCAGTT 
ACGCGCAGTT 
ACGCGCAGTT 
ACGCGCAGTT 
ACGCGCAGTT 



CAAACGAGAG 
CAAACGAGAG 
CAAACGAGAG 
CAAACGAGAG 
CAAACGAGAG 
CAAACGAGAG 
CAAACGAGAG 
CAAACGAGAG 
CAAACGAGAG 
CAAACGAGAG 
CAAACGAGAG 



650 

ATGTTCTTGA 
ATGTTCTTGA 
ATGTTCTTGA 
ATGTTCTTGA 
ATGTTCTTGA 
ATGTTCTTGA 
ATGTTCTTGA 
ATGTTCTTGA 
ATGTTCTTGA 
ATGTTCTTGA 
ATGTTCTTGA 



msa442667 .2{248_18RS21) 
msa442667 .2{248_2603 
msa442667 . 2 { 24 8_A909 
msa442667.2(248_H36B 
msa442667.2{248_JM9130013 
msa44 2667.2(24 8_COHl 
msa442667 .2(248_M781 
msa442667 . 2 {248_M732 
msa442667 .2{248_090 
msa442667 . 2 { 248_CJB110 
msa442667 . 2 { 248_1169NT} 

Consensus 



msa442667.2(248_18RS2l} 
msa442667 . 2 1 248_2 603 } 
msa442667.2{248_A909) 
msa442667 .2(248 JH36B' 
msa442667.2(248_JM9130013* 
msa442667 .2(248_COHi; 
msa442667 .2(248_M781} 
msa442667 . 2 { 248_M732 } 
msa442667.2(248_090} 
msa442667.2f 248_CJB110} 
msa442667 . 2 { 248_1169NT} 

Consensus 



msa442667.2{ 

msa442667 . 

msa442667 . 

msa442667 . 
msa442667 .2(248 

msa442667 . 

msa442667 . 

msa442667 . 
msa442667 
msa442667 ,2{ 
msa442667 .2{ 



248_18RS21 
2(248__2603 
2{248_A909 
2{248_H36B 
_JM9 130013 
2{248_COHl 
2(248__M781 
2{248_M732' 
.2{248_090 
248_CJB110 
248_1169NT} 
Consensus 



msa442667 . 2 { 248_18RS2l) 
msa442667 .2(24 8_2 603} 
msa442667.2(248_A909} 
tnsa442 667 .2(24 8_H36B} 
msa442667 . 2 { 248_JM9130013 } 
msa442667 . 2 ( 24 8_C0H1 } 
msa442667 .2(248_M78l} 
tnsa442667 . 2 (248__M732 } 
msa442667 . 2 { 248_090 ) 
msa442667 . 2 ( 248_CJB110 } 
msa442667 . 2 { 248_1169NT} 

Consensus 



651 

ATTGACTCGA 
ATTG ACT CGA 
ATTGACTCGA 
ATTGACTCGA 
ATTGACTCGA 
ATTGACTCGA 
ATTGACTCGA 
ATTGACTCGA 
ATTGACTCGA 
ATTGACTCGA 

ATTGACTCGA 
********** 

701 

CTGCTAGGAG 

CTGCTAGGAG 

CTGCTAGGAG 

CTGCTAGGAG 

CTGCTAGGAG 

CTGCTAGGAG 

CTGCTAGGAG 

CTGCTAGGAG 

CTGCTAGGAG 

CTGCTAGGAG 

CTGCTAGGAG 
********** 



CAGACTCTGC 
CAGACTCTGC 
CAGACTCTGC 
CAGACTCTGC 
CAGACTCTGC 
CAGACTCTGC 
CAGACTCTGC 
CAGACTCTGC 
CAGACTCTGC 
CAGACTCTGC 
CAGACTCTGC 
********** 



CCTACCTtAG 

CCTACCTtAG 

CCTACCTtAG 

CCTACCTtAG 

CCTACCTtAG 

CCTACCTtAG 

CCTACCTtAG 

CCTACCTtAG 

CCTACCTcAG 

CCTACCTcAG 

CCTACCTtAG 
*******_** 



ACAAGGTTTG 
ACAAGGTTTG 
ACAAGGTTTG 
ACAAGGTTTG 
ACAAGGTTTG 
ACAAGGTTTG 
ACAAGGTTTG 
ACAAGGTTTG 
ACAAGGTTTG 
ACAAGGTTTG 
ACAAGGTTTG 



700 

ACACCGCAAT 
ACACCGCAAT 
ACACCGCAAT 
ACACCGCAAT 
ACACCGCAAT 
ACACCGCAAT 
ACACCGCAAT 
ACACCGCAAT 
ACACCGCAAT 
ACACCGCAAT 
ACACCGCAAT 



********** ********** 



ATTATAAAGA 
ATTATAAAGA 
ATTATAAAGA 
ATTATAAAGA 
ATTATAAAGA 
ATTATAAAGA 
ATTATAAAGA 
ATTATAAAGA 
ATTATAAAGA 
ATTATAAAGA 
ATTATAAAGA 



GGCATACTAA 
GGCATACTAA 
GGCATACTAA 
GGCATACTAA 
GGCATACTAA 
GGCATACTAA 
GGCATACTAA 
GGCATACTAA 
GGCATACTAA 
GGCATACTAA 
GGCATACTAA 



CGTTTGCGAA 
CGTTTGCGAA 
CGTTTGCGAA 
CGTTTGCGAA 
CGTTTGCGAA 
CGTTTGCGAA 
CGTTTGCGAA 
CGTTTGCGAA 
CGTTTGCGAA 
CGTTTGCGAA 
CGTTTGCGAA 

********** ********** ********** _********* 



750 

cTTTGATGCT 
CTTTGATGCT 
cTTTGATGCT 
cTTTGATGCT 
cTTTGATGCT 
CTTTGATGCT 
cTTTGATGCT 
CTTTGATGCT 
CTTTGATGCT 
cTTTGATGCT 
tTTTGATGCT 



751 

GTgGGATTAA 

GTgGGATTAA 

GTgGGATTAA 

GTgGGATTAA 

GTgGGATTAA 

GTgGGATTAA 

GTgGGATTAA 

GTgGGATTAA 

GTaGGATTAA 

GTaGGATTAA 

GTgGGATTAA 
**„******* 

801 

TGATCACCAT 

TGATCACCAT 

TGATCACCAT 

TGATCACCAT 

TGATCACCAT 

TGATCACCAT 

TGATCACCAT 

TGATCACCAT 

TGATCACCAT 

TGATCACCAT 

TGATCACCAT 
********** 



CAGATCGGTC 
CAGATCGGTC 
CAGATCGGTC 
CAGATCGGTC 
CAGATCGGTC 
CAGATCGGTC 
CAGATCGGTC 
CAGATCGGTC 
CAGATCGGTC 
CAGATCGGTC 

CAGATCGGTC 
********** 



ATTGCAGGAC 
ATTGCAGGAC 
ATTGCAGGAC 
ATTGCAGGAC 
ATTGCAGGAC 
ATTGCAGGAC 
ATTGCAGGAC 
ATTGCAGGAC 
ATTGCAGGAC 
ATTGCAGGAC 
ATTGCAGGAC 
********** 



AAACGTATTA 

AAACGTATTA 

AAACGTATTA 

AAACGTATTA 

AAACGTATTA 

AAACGTATTA 

AAACGTATTA 

AAACGTATTA 

AAACGTATTA 

AAACGTATTA 

AAACGTATTA 
********** 



GCTCATATTG 

GCTCATATTG 

GCTCATATTG 

GCTCATATTG 

GCTCATATTG 

GCTCATATTG 

GCTCATATTG 

GCTCATATTG 

GCTCATATTG 

GCTCATATTG 

GCTCATATTG 
********** 



AACCgGTCAA 

AACCgGTCAA 

AACCgGTCAA 

AACCgGTCAA 

AACCgGTCAA 

AACCgGTCAA 

AACCgGTCAA 

AACCgGTCAA 

AACCaGTCAA 

AACCaGTCAA 

AACCaGTCAA 
* * * * _ ** * * * 



AACAGACtTA 

AACAGACtTA 

AACAGACtTA 

AACAGACtTA 

AACAGACtTA 

AACAGACtTA 

AACAGACtTA 

AACAGACtTA 

AACAGACcTA 

AACAGACCTA 

AACAGACcTA 
*******^** 



800 

GTgTTGGCCA 

GTgTTGGCCA 

GTgTTGGCCA 

GTgTTGGCCA 

GTgTTGGCCA 

GTgTTGGCCA 

GTgTTGGCCA 

GTaTTGGCCA 

GTgTTGGCCA 

GTgTTGGCCA 

GTgTTGGCCA 
**_******* 

850 

TCTAAAAGTG 

TCTAAAAGTG 

TCTAAAAGTG 

TCTAAAAGTG 

TCTAAAAGTG 

TCTAAAAGTG 

TCTAAAAGTG 

TCTAAAAGTG 

TCTAAAAGTG 

TCTAAAAGTG 

TCTAAAAGTG 
********** 



851 



900 



910 



WO 2004/018646 



PCT/US2003/026827 



Table 58: Comparative Sequences relating to SAG0182 



msa442667 . 2 { 248_18RS2l} 
msa442667 .2{248_2 603} 
msa442667 ,2{248_A909} 
msa442667 .2{248_H36B} 
msa442667 .2(248 JM9130013) 
tnsa442667 ,2{24B_COHl 
msa442667 .2{248_M781 
msa442667 ,2{248_M732 
msa442667 .2{248_090 
msa442667 .2{248_CJB110} 
msa442667 .2{248_JL169NT} 

Consensus 



msa442667 .2{ 
msa442667 . 
tnsa442667 . 
msa442667 . 
msa442667.2{248 
msa442667 - 
msa442667 . 
msa442667 . 
msa442667 
msa442667 .2 
msa442667 .2 



248_18RS2l} 
2{248_2603^ 
2{24 8_A909 
2{248_H36B 
_JM9130013 
2{248_C0H1 
2{248_M78i; 
2{2'48_M732 
.2{248_090 
248__CJB110 
248_1169NT 
Consensus 



msa442667.2{ 
msa442667 . 
msa442667 - 
msa442667 . 
msa442667.2{248 
msa442667 . 
msa442667 . 
msa442667 . 
msa442667 
msa442667 .2{ 
msa442667 .2{ 



248_18RS2l} 
2{248_2603} 
2{248_A909} 
2{248_H36B} 
;_JM9130013} 
2{248_COHlj 
2{248_M781~ 
2{248_M732 
.2{248_090 
248__CJB110} 
248_1169NT} 
Consensus 



msa442667 . 2 {248_18RS21 } 
msa442667.2{248_2603} 
rasa442667.2{248_A909} 
msa442667 ,2{248_H36B} 
msa442667.2{248_JM9130013} 
msa442667 .2{248_COHl} 
msa442667 .2{248_M78l} 
msa442667 .2{248_M732> 
msa442667 . 2 {248_090 
msa442667 .2 {248__CJB110 
msa442667 .2{248_1169NT 

Consensus 



msa442667 .2{248__18RS2i; 
msa442667 .2{248_2603 
msa442667 . 2 { 248_A909 
msa442667 .2{248_H36B. 
msa442667 - 2 { 248_JM913 0013 
msa442 667 .2(24 8_COHl 
msa4 4 2 6 67 . 2 { 24 8 JVI7 8 1 
msa442667.2{248_M732 
msa442667 . 2 (248_090 } 
msa442667 .2(248_CJB110j 
msa442667 . 2 {248_1169NT} 

Consensus 



msa442667 .2{ 

msa442667 . 

msa442667 . 

msa442667 . 
msa442667.2{248 

msa442667 . 

msa442667 . 

rasa442667 . 
msa442667 
msa442667.2{ 
msa442667 ,2{ 



TTATTTTTGA 
TTATTTTTGA 
TTATTTTTGA 
TTATTTTTGA 
TTATTTTTGA 
TTATTTTTGA 
TTATTTTTGA 
TTATTTTTGA 
TTATTTTTGA 
TTATTTTTGA 
TTATTTTTGA 



TGGCGAACCA 
TGGCGAACCA 
TGGCGAACCA 
TGGCGAACCA 
TGGCGAACCA 
TGGCGAACCA 
TGGCGAACCA 
TGGCGAACCA 
TGGCGAACCA 
TGGCGAACCA 
TGGCGAACCA 



AGAATTGCGC 
AGAATTG CGC 
AGAATTGCGC 
AGAATTGCGC 
AGAATTGCGC 
AGAATTGCGC 
AGAATTGCGC 
AGAATTGCGC 
AGAATTGCGC 
AGAATTGCGC 
AGAATTGCGC 



AAGATAAAGC 
AAGATAAAGC 
AAGATAAAGC 
AAGATAAAGC 
AAGATAAAGC 
AAGATAAAGC 
AAGATAAAGC 
AAGATAAAGC 
AAGATAAAGC 
AAGATAAAGC 
AAGATAAAGC 



GGCGATTTCT 
GGCGATTTCT 
GGCGATTTCT 
GGCGATTTCT 
GGCGATTTCT 
GGCGATTTCT 
GGCGATTTCT 
GGCGATTTCT 
GGCGATTTCT 
GGCGATTTCT 
GGCGATTTCT 



********** ********** ********** ********** ********** 



248_18RS21; 
2{248_2603 
2{248_A909 
2{248_H36B' 
_JM9130013 
2{248_COHl} 
2{248_M781} 
2{248_M732} 
.2{248_090} 
248_CJB110} 
248_1169NT} 
Consensus 



901 

TGTCCAGATC 
TGTCCAGATC 
TGTCCAGATC 
TGTCCAGATC 
TGTCCAGATC 
TGTCCAGATC 
TGTCCAGATC 
TGTCCAGATC 
TGTCCAGATC 
TGTCCAGATC 
TGTCCAGATC 



ACAACTGTCA 
ACAACTGTCA 
ACAACTGTCA 
ACAACTGTCA 
ACAACTGTCA 
ACAACTGTCA 
ACAACTGTCA 
ACAACTGTCA 
ACAACTGTCA 
ACAACTGTCA 
ACAACTGTCA 



GTTAAATTCT 
GTTAAATTCT 
GTTAAATTCT 
GTTAAATTCT 
GTTAAATTCT 
GTTAAATTCT 
GTTAAATTCT 
GTTAAATTCT 
GTTAAATTCT 
GTTAAATTCT 
GTTAAATTCT 



GCTATTGTAG 
GCTATTGTAG 
GCTATTGTAG 
GCTATTGTAG 
GCTATTGTAG 
GCTATTGTAG 
GCTATTGTAG 
GCTATTGTAG 
GCTATTGTAG 
GCTATTGTAG 
GCTATTGTAG 



950 

TTCCTCTAAA 
TTCCTCTAAA 
TTCCTCTAAA 
TTCCTCTAAA 
TTCCTCTAAA 
TTCCTCTAAA 
TTCCTCTAAA 
TTCCTCTAAA 
TTCCTCTAAA 
TTCCTCTAAA 
TTCCTCTAAA 



********** ********** ********** ********** ********** 



951 

AATAAATGAT 
AATAAATGAT 
AATAAATGAT 
AATAAATGAT 
AATAAATGAT 
AATAAATGAT 
AATAAATGAT 
AATAAATGAT 
AATAAATGAT 
AATAAATGAT 
AATAAATGAT 



AAAACTGTGg 
AAAACTGTGg 
AAAACTGTGg 
AAAACTGTGg 
AAAACTGTGg 
AAAACTGTGt 
AAAACTGTGt 
AAAACTGTGt 
AAAACTGTGg 
AAAACTGTGg 
AAAACTGTGg 



********** *********_ 



GTGCCTTAAA 

GTGCCTTAAA 

GTGCCTTAAA 

GTGCCTTAAA 

GTGCCTTAAA 

GTGCCTTAAA 

GTGCCTTAAA 

GTGCCTTAAA 

GTGCCTTAAA 

GTGCCTTAAA 

GTGCCTTAAA 
********** 



AATGTACTTT 

AATGTACTTT 

AATGTACTTT 

AATGTACTTT 

AATGTACTTT 

AATGTACTTT 

AATGTACTTT 

AATGTACTTT 

AATGTACTTT 

AATGTACTTT 

AATGTACTTT 
********** 



1000 

GCAGGAGATA 

GCAGGAGATA 

GCAGGAGATA 

GCAGGAGATA 

GCAGGAGATA 

GCAGGAGATA 

GCAGGAGATA 

GCAGGAGATA 

GCAGGAGATA 

GCAGGAGATA 

GCAGGAGATA 
********** 

1050 
AGCGCAAATA 
AGCGCAAATA 
AGCGCAAATA 
AGCGCAAATA 
AGCGCAAATA 
AGCGCAAATA 
AGCGCAAATA 
AGCGCAAATA 
AGCGCAAATA 
AGCGCAAATA 
AGCGCAAATA 

********** ********** ********** ********** ********** 



1001 

AGACAATGTC 
AGACAATGTC 
AGACAATGTC 
AGACAATGTC 
AGACAATGTC 
AGACAATGTC 
AGACAATGTC 
AGACAATGTC 
AGACAATGTC 
AGACAATGTC 
AGACAATGTC 



TGAGGTGGAG 
TGAGGTGGAG 
TGAGGTGGAG 
TGAGGTGGAG 
TGAGGTGGAG 
TGAGGTGGAG 
TGAGGTGGAG 
TGAGGTGGAG 
TGAGGTGGAG 
TGAGGTGGAG 
TGAGGTGGAG 



GAAAACCTAG 
GAAAACCTAG 
GAAAACCTAG 
GAAAACCTAG 
GAAAACCTAG 
GAAAACCTAG 
GAAAACCTAG 
GAAAACCTAG 
GAAAACCTAG 
GAAAACCTAG 
GAAAACCTAG 



TCCTTGGTTT 
TCCTTGGTTT 
TCCTTGGTTT 
TCCTTGGTTT 
TCCTTGGTTT 
TCCTTGGTTT 
TCCTTGGTTT 
TCCTTGGTTT 
TCCTTGGTTT 
TCCTTGGTTT 
TCCTTGGTTT 



1051 

TTTTCAGGAC 

TTTTCAGGAC 

TTTTCAGGAC 

TTTTCAGGAC 

TTTTCAGGAC 

TTTTCAGGAC 

TTTTCAGGAC 

TTTTCAGGAC 

TTTTCAGGAC 

TTTTCAGGAC 

TTTTCAGGAC 
********** 

1101 

CAGTATGGCA 
CAGTATGGCA 
CAGTATGGCA 
CAGTATGGCA 
CAGTATGGCA 
CAGTATGGCA 
CAGTATGGCA 
CAGTATGGCA 
CAGTATGGCA 
CAGTATGGCA 
CAGTATGGCA 



AACTGGCAAT 
AACTGGCAAT 
AACTGGCAAT 
AACTGGCAAT 
AACTGGCAAT 
AACTGGCAAT 
AACTGGCAAT 
AACTGGCAAT 
AACTGGCAAT 
AACTGGCAAT 

AACTGGCAAT 
********** 



GGGGATAACA 

GGGGATAACA 

GGGGATAACA 

GGGGATAACA 

GGGGATAACA 

GGGGATAACA 

GGGGATAACA 

GGGGATAACA 

GGGGATAACA 

GGGGATAACA 

GGGGATAACA 
********** 



GAGGAACAAA 

GAGGAACAAA 

GAGGAACAAA 

GAGGAACAAA 

GAGGAACAAA 

GAGGAACAAA 

GAGGAACAAA 

GAGGAACAAA 

GAGGAACAAA 

GAGGAACAAA 

GAGGAACAAA 
********** 



GAGATAAAGG 
GAGATAAAGG 
GAGATAAAGG 
GAGATAAAGG 
GAGATAAAGG 
GAGATAAAGG 
GAGATAAAGG 
GAGATAAAGG 
GAGATAAAGG 
GAGATAAAGG 
GAGATAAAGG 



********** ********** 



CTTTACAAGC 

CTTTACAAGC 

CTTTACAAGC 

CTTTACAAGC 

CTTTACAAGC 

CTTTACAAGC 

CTTTACAAGC 

CTTTACAAGC 

CTTTACAAGC 

CTTTACAAGC 

CTTTACAAGC 
********** 



ACAAATCAAC 
ACAAATCAAC 
ACAAATCAAC 
ACAAATCAAC 
ACAAATCAAC 
ACAAATCAAC 
ACAAATCAAC 
ACAAATCAAC 
ACAAATCAAC 
ACAAATCAAC 
ACAAATCAAC 
********** 



1100 

ATAAGTTAGC 

ATAAGTTAGC 

ATAAGTTAGC 

ATAAGTTAGC 

ATAAGTTAGC 

ATAAGTTAGC 

ATAAGTTAGC 

ATAAGTTAGC 

ATAAGTTAGC 

ATAAGTTAGC 

ATAAGTTAGC 
********** 

1150 
CCTCATTTcT 
CCTCATTTcT 
CCTCATTTcT 
CCTCATTTcT 
CCTCATTTcT 
CCTCATTTcT 
CCTCATTTcT 
CCTCATTTcT 
CCTCATTTcT 
CCTCATTTtT 

CCTCATTTcT 
********_* 



911 



WO 2004/018646 



PCT/US2003/026827 



Table 58: Comparative Sequences relating to SAG0182 



msa442667.2{ 

rnsa442667 . 

msa442667 . 

msa442667 . 
msa442667.2{248 

msa442667 . 

msa442667 . 

msa442667 . 
msa442667 
msa442667 
msa442667 



248_18RS21 
2{248_2603 : 
2{248_A909 
2{248_H36B 
_JM9130013' 
2{248_COHlj 
2{248_M78l} 
2{248_M732} 
, .2{248_090} 
.2f248_CJB110} 
.2{248_1169NT} 
Consensus 



1151 

TCTTTAATGC 

TCTTTAATGC 

TCTTTAATGC 

TCTTTAATGC 

TCTTTAATGC 

TCTTTAATGC 

TCTTTAATGC 

TCTTTAATGC 

TCTTTAATGC 

TCTTTAATGC 

TCTTTAATGC 
********** 



CATTAACACA 
CATTAACACA 
CATTAACACA 
CATTAACACA 
CATTAACACA 
CATTAACACA 
CATTAACACA 
CATTAACACA 
CATTAACACA 
CATTAACACA 
CATTAACACA 



ATTAGTGCAT 
ATTAGTGCAT 
ATTAGTGCAT 
ATTAGTGCAT 
ATTAGTGCAT 
ATTAGTGCAT 
ATTAGTGCAT 
ATTAGTGCAT 
ATTAGTGCAT 
ATTAGTGCAT 
ATTAGTGCAT 



TAATCCGTAT 
TAATCCGTAT 
TAATCCGTAT 
TAATCCGTAT 
TAATCCGTAT 
TAATCCGTAT 
TAATCCGTAT 
TAATCCGTAT 
TAATCCGTAT 
TAATCCGTAT 
TAATCCGTAT 



1200 
TGATTCTGAT 
TGATT CTGAT 
TGATTCTGAT 
TGATTCTGAT 
TGATTCTGAT 
TGATTCTGAT 
TGATTCTGAT 
TGATTCTGAT 
TGATTCTGAT 
TGATTCTGAT 
TGATTCTGAT 



********** ********** ********** ********** 



msa442667.2{ 

msa442667 . 

msa442667 . 

msa442667 . 
msa442667 .2(248 

rnsa442667. 

msa442667 . 

msa442667 . 
msa442667 
msa442667 .2 { 
msa442667 .2{ 



248_18RS21 
2{248_2603 
2(248_A909 
2{248_H36B> 
_JM9130013j 
2{248_COHl> 
2(248_M78l} 
2{248_M732" 
,2{248_090 
248_CJB110 
248_1169NT} 
Consensus 



1201 

AAAGCACGTT 
AAAGCACGTT 
AAAGCACGTT 
AAAGCACGTT 
AAAGCACGTT 
AAAGCACGTT 
AAAGCACGTT 
AAAGCACGTT 
AAAGCACGTT 
AAAGCACGTT 
AAAGCACGTT 



ATGCACTGAT 
ATGCACTGAT 
ATGCACTGAT 
ATGCACTGAT 
ATGCACTGAT 
ATGCACTGAT 
ATGCACTGAT 
ATGCACTGAT 
ATGCACTGAT 
ATGCACTGAT 
ATGCACTGAT 



GCAGTTAAGT 
GCAGTTAAGT 
GCAGTTAAGT 
GCAGTTAAGT 
GCAGTTAAGT 
GCAGTTAAGT 
GCAGTTAAGT 
GCAGTTAAGT 
GCAGTTAAGT 
GCAGTTAAGT 
GCAGTTAAGT 



********** ********** ********** 



ACTTTTTTTA 
ACTTTTTTTA 
ACTTTTTTTA 

ACTTTTTTTA 

ACTTTTTTTA 

ACTTTTTTTA 

ACTTTTTTTA 

ACTTTTTTTA 

ACTTTTTTTA 

ACTTTTTTTA 
********** 



1250 

GAACAAGTTT 

GAACAAGTTT 

GAACAAGTTT 

GAACAAGTTT 

GAACAAGTTT 

GAACAAGTTT 

GAACAAGTTT 

GAACAAGTTT 

GAACAAGTTT 

GAACAAGTTT 

GAACAAGTTT 
********** 



msa4426G7 . 2 { 248_18RS21 } 
msa442667 .2{248_2603} 
msa442667.2f248_A909} 
msa442667 .2(248 H36B} 
msa442667 .2 (248_JM9 130013} 
msa442667 . 2 { 248_C0H1 } 
msa442667 .2i248_M78l) 
msa442667 . 2 { 248_M732 } 
msa442667 .2{248_090 
msa442667 . 2 {248_CJB110 
msa442667 . 2 ( 248_1169NT 

Consensus 



1251 

GCAgGGTGGT 
GCAgGGTGGT 
GCAgGGTGGT 
GCAgGGTGGT 
GCAgGGTGGT 
GCAaGGTGGT 
GCAaGGTGGT 
GCAaGGTGGT 
GCAaGGTGGT 
GCAaGGTGGT 
GCAaGGTGGT 



CAGGATCGTG 
CAGGATCGTG 
CAGGATCGTG 
CAGGATCGTG 
CAGGATCGTG 
CAGGATCGTG 
CAGGATCGTG 
CAGGATCGTG 
CAGGATCGTG 
CAGGATCGTG 
CAGGATCGTG 



***_****** ********** 



AGGTAACGCT 

AGGTAACGCT 

AGGTAACGCT 

AGGTAACGCT 

AGGTAACGCT 

AGGTAACGCT 

AGGTAACGCT 

AGGTAACGCT 

AGGTAACGCT 

AGGTAACGCT 

AGGTAACGCT 
********** 



TGAGCAAGAA 

TGAGCAAGAA 

TGAGCAAGAA 

TGAGCAAGAA 

TGAGCAAGAA 

TGAGCAAGAA 

TGAGCAAGAA 

TGAGCAAGAA 

TGAGCAAGAA 

TGAGCAAGAA 

TGAGCAAGAA 
********** 



1300 

AAATCACATG 

AAATCACATG 

AAATCACATG 

AAATCACATG 

AAATCACATG 

AAATCACATG 

AAATCACATG 

AAATCACATG 

AAATCACATG 

AAATCACATG 

AAATCACATG 
********** 



msa442667 .2{ 
msa442667 . 
msa442667 . 
msa442667 . 
msa442667.2{248 
msa442667 . 
msa442667 
msa442667 
msa442667 
msa442667.2{ 
msa442667 .2{ 



248_18RS21 
2{248_2603 
2<248_A909 
2(248_H36B ! ' 
_JM9130013 1 
2(248_C0H1 
2(248_M781 
2{248_M732 
2{248_090 
248_CJB110; 
248_1169NT' 
Consensus 



1301 

TGGATGCTTA 

TGGATGCTTA 

TGGATGCTTA 

TGGATGCTTA 

TGGATGCTTA 

TGGATGCTTA 

TGGATGCTTA 

TGGATGCTTA 

TGGATGCTTA 

TGGATGCTTA 

TGGATGCTTA 
********** 



TATGAATGTT 

TATGAATGTT 

TATGAATGTT 

TATGAATGTT 

TATGAATGTT 

TATGAATGTT 

TATGAATGTT 

TATGAATGTT 

TATGAATGTT 

TATGAATGTT 

TATGAATGTT 
********** 



GAAAAATTAC 

GAAAAATTAC 

GAAAAATTAC 

GAAAAATTAC 

GAAAAATTAC 

GAAAAATTAC 

GAAAAATTAC 

GAAAAATTAC 

GAAAAATTAC 

GAAAAATTAC 

GAAAAATTAC 
********** 



GTTTCCCTGA 

GTTTCCCTGA 

GTTTCCCTGA 

GTTTCCCTGA 

GTTTCCCTGA 

GTTTCCCTGA 

GTTTCCCTGA 

GTTTCCCTGA 

GTTTCCCTGA 

GTTTCCCTGA 

GTTTCCCTGA 
********** 



1350 

TAAATATCAG 

TAAATATCAG 

TAAATATCAG 

TAAATATCAG 

TAAATATCAG 

TAAATATCAG 

TAAATATCAG 

TAAATATCAG 

TAAATATCAG 

TAAATATCAG 

TAAATATCAG 
********** 



rnsa442667 . 2{ 

msa442667 . 

msa442667 . 

msa442667 . 
msa442667. 2(248 

msa442667 . 

msa442667 . 

msa442667 . 
msa442667 
msa442667 
msa442667 



248_18RS21 
2f248_2603 
2{248__A909 
2{248_H36B 
_JM9130013 
2{248__COHl} 
2(248_M781) 
2{248_M732 
U v> / .2{248_090 
.2{248_CJB110 
.2{248_1169NT} 
Consensus 



1351 

TTATCTTATG 

TTATCTTATG 

TTATCTTATG 

TTATCTTATG 

TTATCTTATG 

TTATCTTATG 

TTATCTTATG 

TTATCTTATG 

TTATCTTATG 

TTATCTTATG 

TTATCTTATG 
********** 



ATATTAGTGC 

ATATTAGTGC 

ATATTAGTGC 

ATATTAGTGC 

ATATTAGTGC 

ATATTAGTGC 

ATATTAGTGC 

ATATTAGTGC 

ATATTAGTGC 

ATATTAGTGC 

ATATTAGTGC 
********** 



ACCAGAAAAA 

ACCAGAAAAA 

ACCAGAAAAA 

ACCAGAAAAA 

ACCAGAAAAA 

ACCAGAAAAA 

ACCAGAAAAA 

ACCAGAAAAA 

ACCAGAAAAA 

ACCAGAAAAA 

ACCAGAAAAA 
********** 



ATGAAGTTAC 

ATGAAGTTAC 

ATGAAGTTAC 

ATGAAGTTAC 

ATGAAGTTAC 

ATGAAGTTAC 

ATGAAGTTAC 

ATGAAGTTAC 

ATGAAGTTAC 

ATGAAGTTAC 

ATGAAGTTAC 
********** 



1400 

CaCdTTTGG 

CaCCTTTTGG 

CaCCTTTTGG 

CaCCTTTTGG 

CaCCTTTTGG 

CgCCTTTTGG 

CgCCTTTTGG 

CgCCTTTTGG 

CgCCTTTTGG 

CgCCTTTTGG 

CgCCTTTTGG 
*_******** 



msa442667.2 { 

msa442667 . 

msa442667 . 

msa442S67 . 
msa442667.2{248 

msa442667 . 

msa442S67 . 

msa442S67 . 
msa442667 
msa442667.2( 
msa442667.2{ 



248_18RS2l} 
2{248_2603} 
2{248_A909} 
2{248_H36B) 
_JM9130013} 
2 j248_COHl} 
2(248_M78l} 
2{248__M732> 
.2{248_090} 
248_CJB110} 
248_1169NT} 
Consensus 



1401 

TTTACAGGTA 

TTTACAGGTA 

TTTACAGGTA 

TTTACAGGTA 

TTTACAGGTA 

TTTACAGGTA 

TTTACAGGTA 

TTTACAGGTA 

TTTACAGGTA 

TTTACAGGTA 

TTTACAGGTA 
********** 



CTGGTAGAGA 

CTGGTAGAGA 

CTGGTAGAGA 

CTGGTAGAGA 

CTGGTAGAGA 

CTGGTAGAGA 

CTGGTAGAGA 

CTGGTAGAGA 

CTGGTAGAGA 

CTGGTAGAGA 

CTGGTAGAGA 
********** 



ATGCAGTTcG 

ATGCAGTTcG 

ATGCAGTTcG 

ATGCAGTTcG 

ATGCAGTTcG 

ATGCAGTTcG 

ATGCAGTTcG 

ATGCAGTTcG 

ATGCAGTTaG 

ATGCAGTTaG 

ATGCAGTTcG 
********_* 



ACATGCTTTc 

ACATGCTTTc 

ACATGCTTTc 

ACATGCTTTc 

ACATGCTTTc 

ACATGCTTTc 

ACATGCTTTc 

ACATGCTTTc 

ACATGCTTTc 

ACATGCTTTc 

ACATGCTTTt 
*********„ 



1450 

AAAGAACGTA 

AAAGAACGTA 

AAAGAACGTA 

AAAGAACGTA 

AAAGAACGTA 

AAAGAACGTA 

AAAGAACGTA 

AAAGAACGTA 

AAAGAACGTA 

AAAGAACGTA 

AAAGAACGTA 
********** 
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msa442667 .2{248_JL8RS2l} 
msa442667 .2{248_2603 
msa442667.2f248_A909 
msa442667 .2{248_H36B 
msa442667 -2{248_JM9 130013 
msa442667.2{248_COHl; 
msa442667 .2(248_M78i; 
msa442667 . 2 {248_M732 ' 
msa442667.2{248_090 
msa442667 .2{248_CJB110 
tnsa442667.2{248_1169NT} 

Consensus 



msa442667 .2{ 

msa442667 . 

msa442667 . 

msa442667 . 
msa442667.2{248 

rasa442667 . 

msa442667 . 

msa442667 . 
msa442667 
msa442667 ,2{ 
msa442667 . 2{ 



248_18RS21} 
2{248J2603} 
2{248_A909} 
2{248_H36B} 
_JM9130013} 
2{248_C0H1} 
2{248_M781} 
2{248JM732} 
.2{248_090} 
248_CJB110} 
248_1169NT} 
Consensus 



msa442667.2{ 

msa442667 . 

msa442667 . 

msa442667 . 
msa442667 .2(248 

msa442667 . 

msa442667 . 

msa442667 . 
msa442667 
msa442667 -2{ 
msa442667.2{ 



248_18RS2l} 
2{248_2603} 
2{248_A909} 
2{248_H36B} 
_JM9130013} 
2{248_C0H1) 
2{248_M78l) 
2{248_M732} 
.2{248_090} 
248_CJB110} 
248_1169NT} 
Consensus 



msa442667 .2{ 

msa442667 . 

msa442667 . 

msa442667 . 
msa442667 .2(248 

msa442667 . 

msa442667 . 

msa442667 - 
msa442667 
msa442667 .2 
msa442667 .2 



248_18RS2l} 
2 {248^2603} 
2{248_A909) 
2{248_H36B} 
_CTM9130013} 
2{248__C0H1} 
2{248_M78l! 
2{248_M732} 
.2{248_090} 
248_CJB110} 
248_1169NT} 
Consensus 



msa442667.2{248_18RS2l| 
msa442667 . 2 { 248_2603 
msa442667 .2{248_A909 
msa442667 . 2 { 248JH36B 
msa442667.2{248_JM9130013 ]- 
msa442667 . 2 { 248_C0H1 
msa442667.2{248_M781 
msa442667 . 2 { 248_M732 
msa442667 .2{248_090 
msa442667.2{248_CJB110 
msa442667.2{248_1169NT; 

Consensus 



msa442667.2{248_18RS2i; 
msa442667 .2(248_2603; 
msa442667 . 2 f 248_A909 . 
msa442667 .2{248_H36B 
msa442667 .2{248_JM9130013 
msa4 4 2 6 6 7 . 2 ( 2 4 8_C0H1 
msa442667 .2{248_M781 
msa442667.2{248_M732; 
msa442667.2{248_090 
m sa442667 .2(248_CJB110 
ms a442667 . 2 { 248_1169NT; 



1451 

AGACGGACAA 
AGACGGACAA 
AGACGGACAA 
AGACGGACAA 
AGACGGACAA 
AGACGGACAA 
AGACGGACAA 
AGACGGACAA 
AGACGGACAA 
AGACGGACAA 
AGACGGACAA 



CCATATATTG 
CCATATATTG 
CCATATATTG 
CCATATATTG 
CCATATATTG 
CCATATATTG 
CCATATATTG 
CCATATATTG 
CCATATATTG 
CCATATATTG 
CCATATATTG 



GTTCAAATAA 
GTTCAAATAA 
GTTCAAATAA 
GTTCAAATAA 
GTTCAAATAA 
GTTCAAATAA 
GTTCAAATAA 
GTTCAAATAA 
GTTCAAATAA 
GTTCAAATAA 
GTTCAAATAA 



AGCCAGATGG 
AGCCAGATGG 
AGCCAGATGG 
AGCCAGATGG 
AGCCAGATGG 
AGCCAGATGG 
AGCCAGATGG 
AGCCAGATGG 
AGCCAGATGG 
AGCCAGATGG 
AGCCAGATGG 



********** ********** ********** ********** 



TTAGTGACAA 

TTAGTGACAA 

TTAGTGACAA 

TTAGTGACAA 

TTAGTGACAA 

TTAGTGACAA 

TTAGTGACAA 

TTAGTGACAA 

TTAGTGACAA 

TTAGTGACAA 

TTAGTGACAA 
********** 



1501 

TGTGTTTCTG 

TGTGTTTCTG 

TGTGTTTCTG 

TGTGTTTCTG 

TGTGTTTCTG 

TGTGTTTCTG 

TGTGTTTCTG 

^GTGTTTCTG 

TGTGTTTCTG 

TGTGTTTCTG 

TGTGTTTCTG 
********** 

1551 

TAAATTAGGT 

TAAATTAGGT 

TAAATTAGGT 

TAAATTAGGT 

TAAATTAGGT 

TAAATTAGGT 

TAAATTAGGT 

TAAATTAGGT 

TAAATTAGGT 

TAAATTAGGT 

TAAATTAGGT 
********** ********** 



TGGACAAGGA 

TGGACAAGGA 

TGGACAAGGA 

TGGACAAGGA 

TGGACAAGGA 

TGGACAAGGA 

TGGACAAGGA 

TGGACAAGGA 

TGGACAAGGA 

TGGACAAGGA 

TGGACAAGGA 
********** 



CAAGAAACAG 
CAAGAAACAG 
CAAGAAACAG 
CAAGAAACAG 
CAAGAAACAG 
CAAGAAACAG 
CAAGAAACAG 
CAAGAAACAG 
CAAGAAACAG 
CAAGAAACAG 
CAAGAAACAG 



TTGCAGAGAG 

TTGCAGAGAG 

TTGCAGAGAG 

TTGCAGAGAG 

TTGCAGAGAG 

TTGCAGAGAG 

TTGCAGAGAG 

TTGCAGAGAG 

TTGCAGAGAG 

TTGCAGAGAG 

TTGCAGAGAG 
********** 



ATCTCAGATA 

AT CT CAGAT A 

ATCTCAGATA 

ATCTCAGATA 

ATCTCAGATA 

ATCTCAGATA 

ATCTCAGATA 

ATCTCAGATA 

ATCTCAGATA 

ATCTCAGATA 

ATCTCAGATA 
********** 



TAAGGG t ACA 

TAAGGGtACA 

TAAGGG tACA 

TAAGGGtACA 

TAAGGGtACA 

TAAGGGgACA 

TAAGGGgACA 

TAAGGGgACA 

TAAGGGtACA 

TAAGGGtACA 

TAAGGGtACA 
******_*** 



1601 

TAGTTAATCT 

TAGTTAATCT 

TAGTTAATCT 

TAGTTAATCT 

TAGTTAATCT 

TAGTTAATCT 

TAGTTAATCT 

TAGTTAATCT 

TAGTTAATCT 

TAGTTAATCT 

TAGTTAATCT 
********** 

1651 

CTTCATTTTT 
CTTCATTTTT 
CTTCATTTTT 
CTTCATTTTT 
CTTCATTTTT 
CTTCATTTTT 
CTTCATTTTT 
CTTCATTTTT 
CTTCATTTTT 
CTTCATTTTT 
CITCATTTTT 
********** 

1701 

TAATAGAATA 
TAATAGAATA 
TAATAGAATA 
TAATAGAATA 
TAATAGAATA 
TAATAGAATA 
TAATAGAATA 
TAATAGAATA 
TAATAGAATA 
TAATAGAATA 
TAATAGAATA 



AAATAACAGG 

AAATAACAGG 

AAATAACAGG 

AAATAACAGG 

AAATAACAGG 

AAATAACAGG 

AAATAACAGG 

AAATAACAGG 

AAATAACAGG 

AAATAACAGG 

AAATAACAGG 
********** 



CTGAATTTAT 

CTGAATTTAT 

CTGAATTTAT 

CTGAATTTAT 

CTGAATTTAT 

CTGAATTTAT 

CTGAATTTAT 

CTGAATTTAT 

CTGAATTTAT 

CTGAATTTAT 

CTGAATTTAT 
********** 



GAATGGTACA 
GAATGGTACA 
GAATGGTACA 
GAATGGTACA 
GAATGGTACA 
GAATGGTACA 
GAATGGTACA 
GAATGGTACA 
GAATGGTACA 
GAATGGTACA 
GAATGGTACA 
********** ********** 



CGAGCGACAA 
CGAGCGACAA 
CGAGCGACAA 
CGAGCGACAA 
CGAGCGACAA 
CGAGCGACAA 
CGAGCGACAA 
CGAGCGACAA 
CGAGCGACAA 
CGAGCGACAA 
CGAGCGACAA 



TATATGGTAG 

TATATGGTAG 

TATATGGTAG 

TATATGGTAG 

TATATGGTAG 

TATATGGTAG 

TATATGGTAG 

TATATGGTAG 

TATATGGTAG 

TATATGGTAG 

TATATGGTAG 
********** 



AAAGTTTGGT 

AAAGTTTGGT 

AAAGTTTGGT 

AAAGTTTGGT 

AAAGTTTGGT 

AAAGTTTGGT 

AAAGTTTGGT 

AAAGTTTGGT 

AAAGTTTGGT 

AAAGTTTGGT 

AAAGTTTGGT 
********** 



1500 
T CATT ATT AT 
TCATTATTAT 
TCATTATTAT 
TCATTATTAT 
TCATTATTAT 
TCATTATTAT 
TCATTATTAT 
TCATTATTAT 
TCATTATTAT 
TCATTATTAT 
TCATTATTAT 
********** 

1550 

CTATCATTGA 

CTATCATTGA 

CTATCATTGA 

CTATCATTGA 

CTATCATTGA 

CTATCATTGA 

CTATCATTGA 

CTATCATTGA 

CTATCATTGA 

CTATCATTGA 

CTATCATTGA 
********** 

1600 
GGTACTGCTC 
GGTACTGCTC 
GGTACTGCTC 
GGTACTGCTC 
GGTACTGCTC 
GGTACTGCTC 
GGTACTGCTC 
GGTACTGCTC 
GGTACTGCTC 
GGTACTGCTC 
GGTACTGCTC 
********** 

1650 

TGTAAGTTGC 

TGTAAGTTGC 

TGTAAGTTGC 

TGTAAGTTGC 

TGTAAGTTGC 

TGTAAGTTGC 

TGTAAGTTGC 

TGTAAGTTGC 

TGTAAGTTGC 

TGTAAGTTGC 

TGTAAGTTGC 
********** 

1700 

ATCGAATACC 

ATCGAATACC 

ATCGAATACC 

ATCGAATACC 

ATCGAATACC 

ATCGAATACC 

ATCGAATACC 

ATCGAATACC 

ATCGAATACC 

ATCGAATACC 

ATCGAATACC 
********** 



AGGGAGGATG 
AGGGAGGATG 
AGGGAGGATG 
AGGGAGGATG 
AGGGAGGATG 
AGGGAGGATG 
AGGGAGGATG 
AGGGAGGATG 
AGGGAGGATG 
AGGGAGGATG 
AGGGAGGATG 



AGCATGAAAA 
AGCATGAAAA 
AGCATGAAAA 
AGCATGAAAA 
AGCATGAAAA 
AGCATGAAAA 
AGCATGAAAA 
AGCATGAAAA 
AGCATGAAAA 
AGCATGAAAA 
AGCATGAAAA 



1740 
TTTTAATTCT 
TTTTAATTCT 
TTTTAATTCT 
TTTTAATTCT 
TTTTAATTCT 
TTTTAATTCT 
TTTTAATTCT 
TTTTAATTCT 
TTTTAATTCT 
TTTTAATTCT 
TTTTAATTCT 
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Table 58: Comparative Sequences relating to SAG0182 



Consensus ********** ********** ********** ********** 



SEQ XD NO. 5811 
STRAIN 2603 frame: 1 

LMVLLFQRLGI IMILAFLLVNNSYFRQLIEERSKRETWLVI IFGLFVI I SNITGIEIKG 
DRSLVERPFLTT I SHSDSLANTRTLVI TTASLVGGPLVGS I VGF I GG VHR FFQGS FSGS F 
YI VSSVLVGI VSGKIGDKLKENHLYPSTSQVILI S 1 1 AES IQMLFVGI FTGWELVKMI VX 
PMMILNSI^STLFTiAILKTYLSNESQLRAVQTRDVLELTRQTLPYLRQGIiTPQSARSVCE 
1 1 KRHTNFDAVGLTDRSNVIiAH I GVGHDHH I AGQPVKTDLSKSVI FDGE PRI AQDKAAI S 
CPDHNCQLNSAI WPLKINDKTVGALKMYFAGDKTMSEVEENLVLGLAQ I FSGQLAMGIT 
EEQNKLASMAEI KALQAQINPHFFFNAINTr SAL I RIDSDKARYAIJ4QLSTFFRTSLQGG 
QDREVTLEQEKSHVDAYMNVEKLRFPDKYQLS YD I S APEKMKLP PFGLQVLVENAVRHAF 
KERKTDNHI LVQI KPDGHYYCVS VSDNGQG I SDT 1 I DKLGQETVAESKGTGTAL VNLNNR 
IjNLLYGSVSCLHFSSDKNGTKVWYRI PNRIREDEHENFNS 

SEQ XD NO. 5812 
STRAIN 090 frame: 1 

LMVLiLFQRLG I IMIIiAFLLVNNSYFRQLIEERSKRETWLVI IFGLFVI I SNITGIEIKG 
DRSLVERPFLTT I SHSD SLANTRTLVI TTASLVGGPLVGS I VGFIGGVHRFFQGS FSGSF 
Y I VSS VL VG I VSGKIGDKLKENHL YPSTSQVI L I S 1 1 AES I QMLFVG I FTGWEL VKMI VI 
PMM I LJNSLGSTLFLAI LKTYLSNE SQLRAVQyTRDVLELTRQTLPYLRQGLTPQS ARS VCE 
1 1 KRHTNFDAVGLTDRSNVIiAH I GVGHDHH I AGQPVKTDLSKSVI FDGEPRIAQDKAAI S 
CPDHNCQLNSAI WPLKINDKTVGALKMYFAGDKTMSEVEENLVLGLAQ I FSGQLAMG I T 
EEQNKLASMAEIKMjQAQINPHFFFNAINTISALIRIDSDKARYALMQLSTFFRTSLQGG 
QDREVTLEQEKSHVDAYMNVEKLRFPDKYQLS YD I SAPEKMKLPPFGLQVLVENAVRHAF 
KERKTDNHILVQIKPDGHYYCVSVSDNGC<3ISDTIIDKLGQETVAESKGTGTALVN^ 
LNLLYGSVSCLHFSSDKNGTKVWYRI PNRIREDEHENFNS 

SEQ XD NO. 5813 
STRAIN A909 frame: 1 

LMVLLFQRLG I IMILAFLLVNNSYFRQLI EERSKRETWLVI I FGLFVI I SNITG I E I KG 
DRSLVERPFLTT I SHSD SLANTRTLVI TTASLVGGPLVGS I VGFIGGVHRFFQGS FSGSF 
Y I VS S VLVG I VSGKI GDKLKENHL YPSTS QVI L I S I IAES IQMLFVGI FTGWELVKMIVI 
PMMI LNSl^STLFIAI LKTYLSNESQLRAVQTRD VLELTRG^LP YLRQGLTPQSARS VCE 
1 1 KRHTNFDAVGLTDRSNVLAHIGVGHDHHI AGQPVKTDLSKSVI FDGEPRI AQDKAAI S 
CPDHNCQLNSAI VVPLKINDKTVGALKMYFAGDKTMSEVEENLVLGLAQI FSGQLAMGIT 
EEQNKLASMAEIKALQAQINPHFFFNAINTISALIRIDSDKARYALMQLSTFFRTSLQGG 
QDREVTLEQEKSHVDAYMNVEKLRFPDKYQLS YD I SAPEKMKLPPFGLQVLVENAVRHAF 
KERKTDNHI LVQ I KPDGHYYCVS VSDNGQG I SDTI I DKLGQETVAESKGTGTAL VNLNNR 

LNLLYGSVSCLHFSSDKNGTKVWYRI PNRIREDEHENFNS i 

SEQ XD NO. 5814 

STRAIN H36B frame: 1 

LMVLLFQRLG I IMILAFLLVNNSYFRQLIEERSKRETWLVI IFGLFVI I SNITGIEIKG 
DRSLVERPFLTT I SHSDSLANTRTLVI TTASLVGGPLVGS I VGFIGGVHRFFQGS FSGSF 
YI VSSVLVGI VSGKIGDKLKENHL YPSTSQVILISI IAES IQMLFVGIFTGWELVKMIVI 
PMMI LNSLGSTLFLAI LKTYLSNE S QLRAVQTRDVLELTRQTLP YLRQGLTPQS ARS VCE 
1 1 KRHTNFDAVGLTDRSNVLAHIGVGHDHHI AGQPVKTDLSKSVI FDGEPRIAQDKAAI S 
CPDHNCQLNSAI WPLKI NDKTVGALKMYFAGDKTMSEVEENLVLGLAQ I FSGQLAMG I T 
EEQNKIiASMAEIKALQAQINPHFFFl^INTISALIRIDSDKARYALMQLSTFFRTSLCX^ 
QDRE VTLEQEKSHVDAYMNVEKLRFPDKYQLS YD I SAPEKMKLPPFGLQVLVENAVRHAF 
KERKTDNH I LVQ I KPDGHYYCVS VSDNGQG I SDTI I DKLGQETVAESKGTGTAL VNLNNR 
LNLLYGSVSCLHFSSDKNGTKVWYRI PNRI REDEHENFNS 

SEQ XD NO. 5815 
STRAIN 18RS21 frame: 1 

IjMVLLFQRLGI IMILAFLLVNNSYFRQLIEERSKRETVVLVI I FGLFVI I SNITG IEIKG 
DRSLVERPFLTT I SHSDSLANTRTLVI TTASLVGGPLVGS I VGFIGGVHRFFQGS FSGS F 
YI VSSVLVGI VSGKI GDKLKENHL YPSTSQVILI S 1 1 AESIQMLFVGI FTGWELVKMIVI 
PMMIIJ^SLGSTLFLAILKTYLSNESQLRAVQTRDVLELTRQTLPYLRQGLTPQSARSVCE 
1 1 KRHTNFDAVGLTDRSNVLAH I GVGHDHH I AGQPVKTDLS KS VI FDGEPRIAQDKAAI S 
CPDHNCQLNSAI VVPLKINDKTVGALKMYFAGDKTMSEVEENLVLGLAQI FSGQLAMGI T 
EEQNKIASMAEIKALQAQINPHFFFNAINTISALIRIDSDKARYALMQLSTFFRTSLQGG 
QDRE VTLEQEKSHVDAYMNVEKLRFPDKYQLS 

KERKTDNH I LVQ I KPDGHYYCVS VSDNGQG I SDT 1 1 DKLGQETVAESKGTGTAL VNLNNR 
LNLL YGS VS CLHFS SDKNGTKVWYR I PNR I REDEHENFNS 

SEQ XD NO. 5816 

STRAIN M732 frame: 1 

LMVLLFQRLGI IMILAFLLVNNSYFRQLIEERSKRETWLVI IFGLFVI ISNITGIEIKG 
DRSLVERPFLTT I SHSD SLANTRTLVI TTASLVGGPLVGS I VGFIGGVHRFFQGS FSGS F 
Y I VSS VLVGI VSGKIGDKLKENHLYPSTSQVI LI S I IAES I QMLFVG I FTGWELVKMIVI 
PMM I LNS LGS TL FLA I LKTYLSNE S QLRA VQTRD VLELTRQTL P YLRQGL TPQS ARS VCE 
1 1 KRHTNFDAVGLTDRSNVLAHIG IGHDHHI AGQPVKTDLSKSVI FDGEPRI AQDKAAI S 
CPDHNCQLNSAI WPLKINDKTVCALKMYFAGDKTMSEVEENLVLGLAQ I FSGQLAMG I T 
EEQNKLASMAE I KALQAQI NPHFFFNAI NT I SAL I RI DSDKARYALMQL STFFRTSLQGG 
QDRE VTLEQEKSHVDAYMNVEKLRFPDKYQLS YD I SAPE KMKLPP FGLQVLVENAVRHAF 
KERKTDNHI LVQI KPDGHYYCVS VSDNGQG I SDT 1 1 DKLGQETVAESKGTGTAL VNLNNR 
LNLLYGSVSCLHFSSDKNGTKVWYRI PNRIREDEHENFNS 

SEQ XD NO. 5817 
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STRAIN COH1 frame: 1 

LM VLLFQRLG 1 1 M I LAFLLVNNS YFRQL I EERS KRETWL V 1 1 FGLF V 1 1 SN I TG I E I KG 
DRSLVERPFLTT I SHSDSLANTRTLVI TTASLVGGPLVGS I VGF I GGVHRFFQGS FSGS F 
YIVSSVLVGIVSGKIGDKLKENHLYPSTSQVILISIIAESIQMLFVGIFTGWELVKMIVI 
PMMI LNSLGSTLFLAI LKTYLSiraSQLRAVCTRDVLELTRQTLPYLRQGLTPQSARSVCE 
1 1 KRHTN FDAVG LTDRS NVLAH I G VGHDHH I AGQ P VKTDL SKSVI FDGE PRIAQDKAAI S 
CPDHNCQLNS A I WPLKI NDKTVCALKMY FAGDKTMSEVEENLVLGLAQ I FSGQLAMG I T 
EEQNKLASMAE I KALQAQI NPHFFFNAINT I SALI RI DSDKARYALMQLSTFFRTSLQGG 
QDREVTLEQEKSHVDAYMNVEKLRFPDKYQLS YD I SAPE KMKLPP FGLQ VLVENAVRHAF 
KERKTDNHI LVQI KPDGHYYCVSVSDNGQG I SDT 1 1 DKLGQETVAE S KGTGTAL VNLNNR 
LNLLYGSVSCLHFS SDKNGTKVWYRI PNRI REDEHENFNS 

SEQ ID NO, 5818 
STRAIN M781 frame: 1 

LMVLLFQRLGI IMILAFLLVNNSYFRQLIEERSKRETWLVI IFGLFVI ISNITGIEIKG 
DRSLVERPFLTT I SHSDSLANTRTLVI TTASLVGGPLVGS I VGF I GGVHRFFQGS FSGS F 
YI VSSVLVGI VSGKIGDKLKENHLYPSTSQVILI SI IAES IQMLFVGI FTGWELVKMI VI 
PMMILNSLGSTLFIAILKTYLSNESQLRAVQTRDVLELTRQTLPYLRQGLTPQSARSVCE 
1 1 KRHTNFDAVGLTDRSNVLAH I G VGHDHH I AGQPVKTDL SKSVI FDGEPRI AQDKAAI S 
CPDHNCQLNSAI VVPLKINDKTVCALKMYFAGDKTMSEVEEN^ 

EEQNKLASMAEI KALQAQI NPHFFFNAINT I SAL I RI DSDKARYALMQLSTFFRTSLQGG 
QDREVTLEQEKSHVDAYMNVEKLRFPDKYQLSYDI'SAPEKM^ 

KERKTDNH I LVQ I KPDGHYYCVSVSDNGQG I SDT 1 1 DKLGQETVAE S KGTGTAL VNLNNR 
LNLLYGSVSCLHFSSDKNGTKVWYRI PNRI REDEHENFNS 

SEQ ID NO. 5819 
STRAIN CJB1 10 frame: 1 

LMVLLFQRLGI IMI LAFLLVNNSYFRQLIEERSKRETWLVI I FGLFVI I SNITGIEIKG 
DRSL VERPFLTTISHSDSLANTRTLVI TTASLVGGPLVGS IVGFIGGVHRFFQGSFSGSF 
YI VSSVLVGI VSGKIGDKLKENHLYPSTSQVILI SI IAES IQMLFVGI FTGWELVKMI VI 
PMMI LNSLGSTLFLAI LKTYLSNESQLRAVQTRDVLELTRQTLPYLRQGLTPQSARSVCE 
I IKRHTNFDAVGLTDRSNVIAHIGVGHDHHIAGQPVKTDLSKSVI FDGEPRIAQDKAAI S 
CPDHNCQLNS AI VVPLKI NDKTVGALKMYFAGDKTMS EVEENLVLGLAQ I FSGQLAMG I T 
EEQNKLASMAEIKALQAQINPHFFFNAINTISALIRIDSDKARYALMQLSTFFRTSLQGG 
QDREVTLEQEKSHVDAYMN^KLRFPDKYQLSYDISAPEKMKLPPFGLQVLVENAVRHAF 
KE RKTDNH I LVQ I KPDGHYYCVSVSDNGQG I SDT 1 1 DKLGQETVAE S KGTGTAL VNLNNR 

LNLLYGS VSCLHFS SDKNGTKVWYR I PNR I REDEHENFNS 

SEQ ID NO. 5820 
STRAIN 1 169NT frame: 1 

LMVLLFQRLG 1 1 M I LAFLLVNNS YFRQL I EERSKRETWLVI I FGLFVI I SN I TG I E I KG 
DRSLVERPFLTTI SHSDSLANTRTLVI TTASLVGG PLVGS IVGFIGGVHRFFQGSFSGSF 
YIVSSVLVGIVSGKIGDKLKENHLYPSTSQVILI SI IAESIQMLFVGI FTGWELVKMI VI 
PMMILNSI/3STLFLAILKTYLSNESQLRAVGTRDVLELTRQTLPYLRQGLTPQSARSVCE 
1 1 KRHTNFDAVGLTDRSNVLAH I GVGHDHHI AGQPVKTDL SKSVI FDGEPRIAQDKAAI S 
CPDHNCQLNSAI WPLKINDKTVGALKMYFAGDKTMSEVEENLVLGLAQI FSGQLAMG IT 
EEQNKIASMAEIKALQAQINPHFFFNAINTISALIRIDSDKARYAIjMQLSTFFRTSLQGG 
QDREVTLEQEKSHVDAYMNVEKLRFPDKYQLS YD I SAPE KMKLPP FGLQ VLVENAVRHAF 
KE RKTDNH I LVQ I KPDGHYY CV S VS DNGQG I S DT 1 1 DKLG QETVAE S KGTGTAL VNLNNR 
LNLLYGSVSCLHFSSDKNGTKVWYRI PNRI REDEHENFNS 

SEQ ID NO. 5821 
STRAIN JM9130013 frame: 1 

LMVLLFQRLGI IMI LAFLLVNNS YFRQL I EERSKRETWLVI I FGLFVI ISNITGI E I KG 
DRSLVERPFLTTI SHSDSLANTRTLVI TTASLVGGPLVGS IVGFIGGVHRFFQGSFSGSF 
YI VSSVLVGI VSGKIGDKLKENHLYPSTSQVILI S I IAESIQMLFVGIFTGWELVKMIVI 
PMMILNSIjGSTLFLAILKTYLSNESQLRAVQTRDVLELTRQTLPYLRQGLTPQSARSVCE 
1 1 KRHTNFDAVGLTDRSNVLAH I GVGHDHH I AGQPVKTDLSKS V I FDGEPRIAQDKAAI S 
CPDHNCQLNSAI WPLKINDKTVGALKMYFAGDKTMSEVEENLVLGLAQIFSGQLAMGIT 

EEQNKLASMAE I KALQAQI NPHFFFNAINT I SALI RI DSDKARYALMQLSTFFRTSLQGG 
QDREVTLEQEKSHVDAYMNVEKLRFPDKYQLSYDISAPEKMKLPPFGLQVLVF^NAVRHAF 
KERKTDNH I LVQ I KPDGHYYCVSVSDNGQG I SDT 1 1 DKLG QETVAE S KGTGTAL VNLNNR 
LNLLYGSVSCLHFSSDKNGTKVWYRI PNRI REDEHENFNS 

PRETTY of: /biotmp/msa442834 . 2 { * } January 13, 2 0 03 06:47 



msa442834 .2{248_090 
msa442834 . 2 { 248_1169NT 
msa442834.2{248_18RS2i; 
msa442834 .2{248_2603; 
msa442834 . 2 { 248_A909 
msa442834 .2{248_CJB110 
msa442834.2{248_H36B} 
msa442834 . 2 { 248_JM9130013 } 
tnsa442834 . 2i 248_COHl" 
msa442834 .2{248_M781 
msa442834 .2{248_M732 

Consensus 



LMVLLFQRLG 

LMVLLFQRLG 

LMVLLFQRLG 

LMVLLFQRLG 

LMVLLFQRLG 

LMVLLFQRLG 

LMVLLFQRLG 

LMVLLFQRLG 

LMVLLFQRLG 

LMVLLFQRLG 

LMVLLFQRLG 
********** 



IIMILAFLLV 

I IMILAFLLV 

I IMILAFLLV 

IIMILAFLLV 

IIMILAFLLV 

IIMILAFLLV 

IIMILAFLLV 

IIMILAFLLV 

IIMILAFLLV 

IIMILAFLLV 

IIMILAFLLV 
********** 



NNS YFRQL IE 

NNSYFRQLIE 

NNS YFRQL IE 

NNSYFRQLIE 

NNSYFRQLIE 

NNSYFRQLIE 

NNSYFRQLIE 

NNSYFRQLIE 

NNSYFRQLIE 

NNSYFRQLIE 

NNSYFRQLIE 
********** 



ERS KRETWL 
ERS KRETWL 
ERS KRETWL 
ERS KRETWL 
ERS KRETWL 
ERS KRETWL 
ERS KRETWL 
ERS KRETWL 
ERS KRETWL 
ERS KRETWL 
ERS KRETWL 
********** 



50 

VI I FGLFVI I 

VI I FGLFVI I 

VI I FGLFVI I 

VI I FGLFVI I 

VI I FGLFVI I 

VI I FGLFVI I 

VI I FGLFVI I 

VI I FGLFVI I 

VI I FGLFVI I 

VI IFGLFVI I 

VI I FGLFVI I 
********** 



51 100 
msa442834.2{248_090} SNITGIEIKG DRSLVERPFL TTISHSDSLA NTRTLV I TTA SLVGGPLVGS 
msa442834 .2{248_1169NT} SNITGIEIKG DRSLVERPFL TTISHSDSLA NTRTLVI TTA SLVGGPLVGS 
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msa442834 .2 
msa442834 
msa442834 

msa442834 .2 
msa442834 
msa442834 .2(24 
msa442834 
msa442834 
msa442834 



{248_18RS2l} 
.2{248_2603} 
.2{248_A909} 
{24 8_CJB110} 
.2{248_H36B> 
8_JM9130013} 
.2{248_COHl} 
.2{248_M78l} 
,2{248_M732} 
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msa442834 
msa442834 .2{ 
msa442834 .2{ 

msa442834 . 

msa442834 . 
msa442834 .2{ 

msa442834 . 
msa442834 .2(248 

msa442834 . 

msa442834 . 

msa442834 . 



.2{248_090} 
248_1169NT} 
248_18RS2l} 
2{248J2603} 
2{248_A909} 
248_CJB110} 
2{248_H36B} 
_JM9130013} 
2{248_C0H1} 
2{248_M781} 
2{248_M732} 
Consensus 



msa442834 
msa442834.2( 
msa442834 .2{ 

msa442834 . 

msa442834 . 
msa442834 ,2{ 

msa442834 . 
msa442834 .2(248 

msa442834 . 

msa442834 . 

msa442834 . 



.2{248_090} 
248_1169NT> 
248_18RS2l} 
2{248_2603} 
2{248_A909} 
248_CJB110} 
2{248_H36B} 
_JM9130013} 
2{248_C0H1} 
2{248_M781) 
2{248_M732) 
Consensus 



msa442834 
msa442834.2( 
tnsa442 834 .2( 

msa442834 . 

msa442834 . 
msa442834.2{ 

msa442834 . 
msa442834.2{248 

msa442834 . 

msa442834 . 

msa442834 . 



,2(248_090} 
248JL169NT} 
248_18RS2l} 
2{248_2603} 
2{.248_A909} 
248_CJB110} 
2{248_H36B} 
_JM9130013) 
2{248_COHl} 
2{248_M781} 
2{248_M732} 
Consensus 



msa442834 
msa442834.2{ 
msa442834.2{ 

msa442834 . 

msa442834 . 
msa442834.2{ 

msa442834 . 
msa442B34.2{248 

msa442834 . 

msa442834 . 

msa442834 . 



.2{248__090} 
248_1169NT 
248__18RS21 
2{248_2603 
2{248_A909; 
248_CJB110; 
2{248_H36B; 
_JM9130013 
2(248_C0H1 
2{248_M781 
2{248_M732; 
Consensus 



SNITGIEIKG 
SNITGIEIKG 
SNITGIEIKG 
SNITGIEIKG 
SNITGIEIKG 
SNITGIEIKG 
SNITGIEIKG 
SNITGIEIKG 
SNITGIEIKG 



msa442834 
msa442834 .2( 
msa442834 ,2{ 

msa442834 . 

msa442834 . 
msa442834 .2{ 

msa442834 . 
msa442834.2{248 

msa442834 . 

msa442834 . 

msa442834 - 



DRSLVERPFL 
DRSLVERPFL 
DRSLVERPFL 
DRSLVERPFL 
DRSLVERPFL 
DRSLVERPFL 
DRSLVERPFL 
DRSLVERPFL 
DRSLVERPFL 



TTISHSDSLA 
TTISHSDSLA 
TTISHSDSLA 
TTISHSDSLA 
TTISHSDSLA 
TTISHSDSLA 
TTISHSDSLA 
TTISHSDSLA 
TTISHSDSLA 



********** ********** ********** 



NTRTLV I TTA 
NTRTLVITTA 
NTRTLV I TTA 
NTRTLVITTA 
NTRTLVITTA 
NTRTLVITTA 
NTRTLVITTA 
NTRTLVITTA 
NTRTLVITTA 
********** 



101 

IVGFIGGVHR 
IVGFIGGVHR 
IVGFIGGVHR 
IVGFIGGVHR 
IVGFIGGVHR 
IVGFIGGVHR 
IVGFIGGVHR 
IVGFIGGVHR 
IVGFIGGVHR 
IVGFIGGVHR 
IVGFIGGVHR 
******** ** 

151 

VILISIIAES 
VILISIIAES 
VILISIIAES 
VILISIIAES 
VILISIIAES 
VILISIIAES 
VILISIIAES 
VILISIIAES 
VILISIIAES 
VILISIIAES 
VILISIIAES 



.2{248_090} 
248__1169NTj 
248_18RS21) 
2{248__2603} 
2{248_A909} 
248_CJB110) 
2{248_H36B} 
_JM9130013} 
2{248_COHl} 
2{248_M781} 
2{248_M732) 
Consensus 



FFOGSFSGSF 

FFQGSFSGSF 

FFQGSFSGSF 

FFQGSFSGSF 

FFQGSFSGSF 

FFQGSFSGSF 

FFQGSFSGSF 

FFQGSFSGSF 

FFQGSFSGSF 

FFQGSFSGSF 

FFQGSFSGSF 
********** 



YIVSSVLVGI 
YIVSSVLVGI 
YIVSSVLVGI 
YIVSSVLVGI 
YIVSSVLVGI 
YIVSSVLVGI 
YIVSSVLVGI 
YIVSSVLVGI 
YIVSSVLVGI 
YIVSSVLVGI 
YIVSSVLVGI 
********** 



VSGKI GDKLK 

VSGKIGDKLK 

VSGKI GDKLK 

VSGKIGDKLK 

VSGKIGDKLK 

VSGKIGDKLK 

VSGKIGDKLK 

VSGKIGDKLK 

VSGKIGDKLK 

VSGKIGDKLK 

VSGKIGDKLK 
********** ********** 



SLVGGPLVGS 
SLVGGPLVGS 
SLVGGPLVGS 
SLVGGPLVGS 
SLVGGPLVGS 
SLVGGPLVGS 
SLVGGPLVGS 
SLVGGPLVGS 
SLVGGPLVGS 
********** 

150 

ENHLYPSTSQ 
ENHLYPSTSQ 
ENHLYPSTSQ 
ENHLYPSTSQ 
ENHLYPSTSQ 
ENHLYPSTSQ 
ENHLYPSTSQ 
ENHLYPSTSQ 
ENHLYPSTSQ 
ENHLYPSTSQ 
ENHLYPSTSQ 



IQMLFVGIFT 
IQMLFVGIFT 
IQMLFVGIFT 
IQMLFVGIFT 
IQMLFVGIFT 
IQMLFVGIFT 
IQMLFVGIFT 
IQMLFVGIFT 
IQMLFVGIFT 
IQMLFVGIFT 
IQMLFVGIFT 



GWELVKMIVI 
GWELVKMIVI 
GWELVKMIVI 
GWELVKMIVI 
GWELVKMIVI 
GWELVKMIVI 
GWELVKMIVI 
GWELVKMIVI 
GWELVKMIVI 
GWELVKMIVI 
GWELVKMIVI 



********** ********** ********** 



PMM I LNSLGS 
PMMILNSLGS 
PMMI LNSLGS 
PMMILNSLGS 
PMMILNSLGS 
PMMILNSLGS 
PMMILNSLGS 
PMMILNSLGS 
PMMILNSLGS 
PMMILNSLGS 
PMMILNSLGS 
********** 



201 

LSNESQLRAV 
LSNESQLRAV 
LSNESQLRAV 
LSNESQLRAV 
LSNESQLRAV 
LSNESQLRAV 
LSNESQLRAV 
LSNESQLRAV 
LSNESQLRAV 
LSNESQLRAV 
LSNESQLRAV 



QTRDVLELTR 
QTRDVLELTR 
QTRDVLELTR 
QTRDVLELTR 
QTRDVLELTR 
QTRDVLELTR 
QTRDVLELTR 
QTRDVLELTR 
QTRDVLELTR 
QTRDVLELTR 
QTRDVLELTR 



QTLPYLRQGL 
QTLPYLRQGL 
QTLPYLRQGL 
QTLPYLRQGL 
QTLPYLRQGL 
QTLPYLRQGL 
QTLPYLRQGL 
QTLPYLRQGL 
QTLPYLRQGL 
QTLPYLRQGL 
QTLPYLRQGL 



TPQSARSVCE 
TPQSARSVCE 
TPQSARSVCE 
TPQSARSVCE 
TPQSARSVCE 
TPQSARSVCE 
TPQSARSVCE 
TPQSARSVCE 
TPQSARSVCE 
TPQSARSVCE 
TPQSARSVCE 



********** ********** ********** ********** 



200 

TLFLAILKTY 

TLFLAI LKTY 

TLFLAILKTY 

TLFLAILKTY 

TLFLAILKTY 

TLFLAILKTY 

TLFLAILKTY 

TLFLAILKTY 

TLFLAILKTY 

TLFLAILKTY 

TLFLAILKTY 
********** 

250 

I I KRHTNFDA 

I I KRHTNFDA 

1 1 KRHTNFDA 

I I KRHTNFDA 

I I KRHTNFDA 

I I KRHTNFDA 

I I KRHTNFDA 

I I KRHTNFDA 

I I KRHTNFDA 

I I KRHTNFDA 

I I KRHTNFDA 
********** 



251 

VGLTDRSNVL 
VGLTDRSNVL 
VGLTDRSNVL 
VGLTDRSNVL 
VGLTDRSNVL 
VGLTDRSNVL 
VGLTDRSNVL 
VGLTDRSNVL 
VGLTDRSNVL 
VGLTDRSNVL 
VGLTDRSNVL 



AH I GvGHDHH 
AHI GvGHDHH 
AH I GvGHDHH 
AHI GvGHDHH 
AH I GvGHDHH 
AHI GvGHDHH 
AHI GvGHDHH 
AHI GvGHDHH 
AHI GvGHDHH 
AHI GvGHDHH 
AHIGiGHDHH 



IAGQPVKTDL 
IAGQPVKTDL 
IAGQPVKTDL 
IAGQPVKTDL 
IAGQPVKTDL 
IAGQPVKTDL 
IAGQPVKTDL 
IAGQPVKTDL 
IAGQPVKTDL 
IAGQPVKTDL 
IAGQPVKTDL 



********** ****-***** ********** 



SKSVIFDGEP 
SKSVIFDGEP 
SKSVIFDGEP 
SKSVIFDGEP 
SKSVIFDGEP 
SKSVIFDGEP 
SKSVIFDGEP 
SKSVIFDGEP 
SKSVIFDGEP 
SKSVIFDGEP 
SKSVIFDGEP 
********** 



301 

CPDHNCQLNS 
CPDHNCQLNS 
CPDHNCQLNS 
CPDHNCQLNS 
CPDHNCQLNS 
CPDHNCQLNS 
CPDHNCQLNS 
CPDHNCQLNS 
CPDHNCQLNS 
CPDHNCQLNS 
CPDHNCQLNS 



AIWPLKIND 
AIWPLKIND 
AIWPLKIND 
AIWPLKIND 
AIWPLKIND 
AIWPLKIND 
AIWPLKIND 
AIWPLKIND 
AIWPLKIND 
AIWPLKIND 
AIWPLKIND 



********** ********** 



KTVgALKMYF 
KTVgALKMYF 
KTVgALKMYF 
KTVgALKMYF 
KTVgALKMYF 
KTVgALKMYF 
KTVgALKMYF 
KTVgALKMYF 
KTVcALKMYF 
KTVcALKMYF 
KTVcALKMYF 
***-****** 



AGDKTMSEVE 
AGDKTMSEVE 
AGDKTMSEVE 
AGDKTMSEVE 
AGDKTMSEVE 
AGDKTMSEVE 
AGDKTMSEVE 
AGDKTMSEVE 
AGDKTMSEVE 
AGDKTMSEVE 
AGDKTMSEVE 



300 

RIAQDKAAIS 
RIAQDKAAIS 
RIAQDKAAIS 
RIAQDKAAIS 
RIAQDKAAIS 
RIAQDKAAIS 
RIAQDKAAIS 
RIAQDKAAIS 
RIAQDKAAIS 
RIAQDKAAIS 

RIAQDKAAIS 
********** 

350 

ENLVLGLAQI 
ENLVLGLAQI 
ENLVLGLAQI 
ENLVLGLAQI 
ENLVLGLAQI 
ENLVLGLAQI 
ENLVLGLAQI 
ENLVLGLAQI 
ENLVLGLAQI 
ENLVLGLAQI 
ENLVLGLAQI 



********** ********** 



msa442834.2{248_090} 



351 

FSGQLAMGIT 



EEQNKLASMA EIKALQAQIN PHFFFNAINT 



400 

ISALIRIDSD 
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Table 58: Comparative Sequences relating to SAG0182 



msa442834 .2{ 
msa442834 . 2{ 

msa442834 . 

msa442834 . 
msa442834 .2{ 

msa442834 . 
msa442834.2{248 

msa442834 . 

msa442834 . 

msa442834 . 



248_1169NT} 
248_18RS2l} 
2{248_2603} 
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248_CJB110} 
2{248_H36B} 
_JM9130013} 
2{248_COHl} 
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Consensus 



msa442B34 
msa442834 .2{ 
msa442834 .2{ 

msa442834 . 

msa442834 . 
msa442834 .2{ 

msa442834. 
msa442834.2{248 

msa442834 . 

msa442834 . 

msa442834 . 



.2{248_090; 
248_1169NT; 
248_18RS21 
2{248_2603 
2{248_A909; 
248_CJB110; 
2{24 8_H36B^ 
_JM9130013 
2{248_COHl 
2{248_M781 
2{248_M732; 
Consensus 



FSGQLAMGIT 
FSGQLAMGIT 
FSGQLAMGIT 
FSGQLAMGIT 
FSGQLAMGIT 
FSGQLAMGIT 
FSGQLAMGIT 
FSGQLAMGIT 
FSGQLAMGIT 
FSGQLAMGIT 



EEQNKLASMA 
EEQNKLASMA 
EEQNKLASMA 
EEQNKLASMA 
EEQNKLASMA 
EEQNKLASMA 
EEQNKLASMA 
EEQNKLASMA 
EEQNKLASMA 
EEQNKLASMA 



EIKALQAQIN 
EIKALQAQIN 
EIKALQAQIN 
EIKALQAQIN 
EIKALQAQIN 
EIKALQAQIN 
EIKALQAQIN 
EIKALQAQIN 
EIKALQAQIN 
EIKALQAQIN 



PHFFFNAINT 
PHFFFNAINT 
PHFFFNAINT 
PHFFFNAINT 
PHFFFNAINT 
PHFFFNAINT 
PHFFFNAINT 
PHFFFNAINT 
PHFFFNAINT 
PHFFFNAINT 



********** ********** ********** ********** 



ISALIRIDSD 

ISALIRIDSD 

ISALIRIDSD 

ISALIRIDSD 

ISALIRIDSD 

ISALIRIDSD 

ISALIRIDSD 

ISALIRIDSD 

ISALIRIDSD 

ISALIRIDSD 
********** 

450 

EKLRFPDKYQ 
EKLRFPDKYQ 
EKLRFPDKYQ 
EKLRFPDKYQ 
EKLRFPDKYQ 
EKLRFPDKYQ 
EKLRFPDKYQ 
EKLRFPDKYQ 
EKLRFPDKYQ 
EKLRFPDKYQ 
EKLRFPDKYQ 

********** ********** ********** ********** ********** 



401 

KARYALMQLS 
KARYALMQLS 
KARYALMQLS 
KARYALMQLS 
KARYALMQLS 
KARYALMQLS 
KARYALMQLS 
KARYALMQLS 
KARYALMQLS 
KARYALMQLS 
KARYALMQLS 



TFFRTSLQGG 
TFFRTSLQGG 
TFFRTSLQGG 
TFFRTSLQGG 
TFFRTSLQGG 
TFFRTSLQGG 
TFFRTSLQGG 
TFFRTSLQGG 
TFFRTSLQGG 
TFFRTSLQGG 
TFFRTSLQGG 



QDREVTLEQE 
QDREVTLEQE 
QDREVTLEQE 
QDREVTLEQE 
QDREVTLEQE 
QDREVTLEQE 
QDREVTLEQE 
QDREVTLEQE 
QDREVTLEQE 
QDREVTLEQE 
QDREVTLEQE 



KSHVDAYMNV 
KSHVDAYMNV 
KSHVDAYMNV 
KSHVDAYMNV 
KSHVDAYMNV 
KSHVDAYMNV 
KSHVDAYMNV 
KSHVDAYMNV 
KSHVDAYMNV 
KSHVDAYMNV 
KSHVDAYMNV 



msa442834 
msa4428 34 .2 { 
msa442834 .2{ 

msa442834. 

msa442834 . 
msa442834.2{ 

msa442834 . 
msa442834.2{248 

msa442834. 

msa442834 . 

msa442834 . 



•2{248_090 
248_1169NT 
248_18RS21 
2{248_2603 
2{248_A909 
248_CJB110 
2{248__H36B 
_JM9130013} 
2l248_COHl; 
2{248_M78l' 
2{248_M732; 
Consensus 



451 

LSYDISAPEK 
LSYDISAPEK 
LSYDISAPEK 
LSYDISAPEK 
LSYDISAPEK 
LSYDISAPEK 
LSYDISAPEK 
LSYDISAPEK 
LSYDISAPEK 
LSYDISAPEK 
LSYDISAPEK 



MKLPPFGLQV 
MKLPPFGLQV 
MKLPPFGLQV 
MKLPPFGLQV 
MKLPPFGLQV 
MKLPPFGLQV 
MKLPPFGLQV 
MKLPPFGLQV 
MKLPPFGLQV 
MKLPPFGLQV 
MKLPPFGLQV 



**********.. ********** 



LVENAVRHAF 

LVENAVRHAF 

LVENAVRHAF 

LVENAVRHAF 

LVENAVRHAF 

LVENAVRHAF 

LVENAVRHAF 

LVENAVRHAF 

LVENAVRHAF 

LVENAVRHAF 

LVENAVRHAF 
********** 



KERKTDNHIL 

KERKTDNHIL 

KERKTDNHIL 

KERKTDNHIL 

KERKTDNHIL 

KERKTDNHIL 

KERKTDNHIL 

KERKTDNHIL 

KERKTDNHIL 

KERKTDNHIL 

KERKTDNHIL 
********** 



500 

VQ I KPDGHYY 
VQ I KPDGHYY 
VQI KPDGHYY 
VQ I KPDGHYY 
VQI KPDGHYY 
VQI KPDGHYY 
VQI KPDGHYY 
VQI KPDGHYY 
VQI KPDGHYY 
VQI KPDGHYY 

VQI KPDGHYY 
********** 



msa442834 
msa442834 
msa442834 

msa442834 . 

msa442834 . 
msa442834 . 2{ 

tnsa442834 . 
msa442834.2{248 

msa442834 . 

msa442834 . 

msa442834 . 



„ ^ .2 {248^090 
.2{248_1169NT' 
.2{248_18RS21 
2{248_2603 
2{248_A909; 
248_CJB110 
2{248_H36B 
_jJM9130013 
2{248_C0H1 
2l248_M78l} 
2(248_M732} 
Consensus 



msa442834 
msa442834 .2 { 
msa442834 .2{ 

msa442834 . 

msa442834 . 
msa442834 .2{ 

msa442834 . 
msa442834.2{248 

msa442834 . 

msa442834 . 

msa442834 . 



.2{248_090; 

248_1169NT; 

248_18RS21 

2(248_2603 

2{248_A909 

248_CJB110" 

2{248_H36B; 

JM9130013 
2{248_COHl 
2{248_M781 
2{248JM732 
Consensus 



501 

CVSVSDNGQG 

CVSVSDNGQG 

CVSVSDNGQG 

CVSVSDNGQG 

CVSVSDNGQG 

CVSVSDNGQG 

CVSVSDNGQG 

CVSVSDNGQG 

CVSVSDNGQG 

CVSVSDNGQG 

CVSVSDNGQG 
********** 

551 

LHFSSDKNGT 

LHFSSDKNGT 

LHFSSDKNGT 

LHFSSDKNGT 

LHFSSDKNGT 

LHFSSDKNGT 

LHFSSDKNGT 

LHFSSDKNGT 

LHFSSDKNGT 

LHFSSDKNGT 

LHFSSDKNGT 
********** 



ISDTIIDKLG 

ISDTIIDKLG 

ISDTIIDKLG 

ISDTIIDKLG 

ISDTIIDKLG 

ISDTIIDKLG 

ISDTIIDKLG 

ISDTIIDKLG 

ISDTIIDKLG 

ISDTIIDKLG 

ISDTIIDKLG 
********** 



KVWYRIPNRI 

KVWYRIPNRI 

KVWYRIPNRI 

KVWYRIPNRI 

KVWYRIPNRI 

KVWYRIPNRI 

KVWYRIPNRI 

KVWYRIPNRI 

KVWYRIPNRI 

KVWYRIPNRI 

KVWYRIPNRI 
****** **** 



QETVAESKGT 
QETVAESKGT 
QETVAESKGT 
QETVAESKGT 
QETVAESKGT 
QETVAESKGT 
QETVAESKGT 
QETVAESKGT 
QETVAESKGT 
QETVAESKGT 
QETVAESKGT 



GTALVNLNNR 
GTALVNLNNR 
GTALVNLNNR 
GTALVNLNNR 
GTALVNLNNR 
GTALVNLNNR 
GTALVNLNNR 
GTALVNLNNR 
GTALVNLNNR 
GTALVNLNNR 
GTALVNLNNR 



550 

LNLLYGSVSC 
LNLLYGSVSC 
LNLLYGSVSC 
LNLLYGSVSC 
LNLLYGSVSC 
LNLLYGSVSC 
LNLLYGSVSC 
LNLLYGSVSC 
LNLLYGSVSC 
LNLLYGSVSC 
LNLLYGSVSC 



********** ********** ********** 
580 

REDEHENFNS 

REDEHENFNS 

REDEHENFNS 

REDEHENFNS 

REDEHENFNS 

REDEHENFNS 

REDEHENFNS 

REDEHENFNS 

REDEHENFNS 

REDEHENFNS 

REDEHENFNS 
********** 
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SEQ ID NO. 5901 
STRAIN 26.03 

ATGAATAAAAGAAG AAAATT ATCAAAATTG AATGTAAAAAAACAT CATTT AG CTTATGGA 

G CTAT CACTTTAGTAG C C CT TTTTT CATGTATTTTGG CTGT AATGGTCATCTTTAAAAGT 

TCACAAGTTACTACTGAATCTTTGTCAAAAGCAGATAAAGTTCGCGTAGCCAAAAAATCA 

AAAATGACTAAGGCGACATCrAAATCAAAAGTAGAAGATGTAAAAG^GGCTCCAAAACCr 

T CT CAGG CAT CTAATGAAG C CC CAAAAT CAAGTT CT CAAT CTACAGAAGCT AATT CTCAG 

CAACAAGTTACTGCGAGTGAAGAGGCAGCTGTAGAACAAGCAGTTGTAAC^ 

CCTGCTACCAGTCAGGCACAACAAGCTTATGCTGTTACTGA 

C^CACCAGACGAGTGGCCAAGTATTGAGTAATGGAAATAcTGCAGGGGCTATTGGCTCA 
GCAGCTGGAGCAGAAATGGCTGCTGCAAcAGGA 

ATTGCCCGTGAATCAAATGGTAATCCTAATGTTGCTAATGCCTCAGGAGCITTCAGGACTT 
TTCCAAACGATGCCAGGTTGGGGTTCAACAGCTACAGTTCAGGATCA^ 
ATTAAAGCTTATCGTG CTCAAGGTTTATCIAGCITGGGGTTACTAG 

SEQ ID NO. 5902 
STRAIN JM9130013 

AAAAGTTCACAAGTTACTACTGAATCTTTGTCAAA 

AGCAGATAAAGTTCGCGTAGCCAAAAAATCAflAAATGAATAAGGCAACAT 
CTAAATCAAAAGTAGAAGGTGTAAAACAGGCTCCAAAACCAAGTTCTCAA 
T CTACAGAAG CTAATT CT CAG CAACAAGTTACTGCGAGTG AAGAGGCAG C 
TGTAGAACAAG CAGTTGT AACAGAAAATAC CCCTG CTACCAGT CAAGCAC 
AACAAGCTTATGCTGTTACTGAGACAACTTATAGACCTGCTCAACACCAG 
C CGAGTGG CCAAGTATTG AG CAATGGAAATACTG CAGGGGTTATTGGCTC 
AGCAGCAGCAGCACAAATGGCTGCrrGCAACGGGAGTTCCTCAGT 
GGGAACAT ATTATTG C CCGTGAATCAAATGGTAAT CCTAACGTTGCTAAT 
GCCTCAGGAGCTTCAGGACTTTTCCAAACGATGCC^ 

AGCTACAGTT CAGGAT CAAGTTAAT t CAG CT ATTAAAG CTTAT CGTG CTC 
AAGGTTTATCAGCTTGGGGTTAC 

SEQ ID NO. 5903 

STRAIN 1 169NT reverse complement 

AAAAGTTCACAAGTTACTACTGAAT CTTTGT CAAAAGCAGATAAAGTTCG CGTAGC C 
AAAAAATCAAAAATGACTAAGGCGACAT CTAAATCAAAAGTAGAAGATGTAAAACAGG CT 
C CAAAACCTT CTCAGG CATCT AA.TGAAGTCC CAAAATCAAGTT CT CAAT CTACAGAAGCT 
AATTCTCAGCAACAAGTT ACT G CGAGTGAAGAGGCGGCTGTAGAACAAG CAGTTGTAACA 
GAAAATACCC CTG CTAC CAGT CAGG CACAACAAACTTATGCTGTTACTGAG ACAACTTAC 
AAACCTGCTCAACACCAGACAAGTGGCCAAGTATTGAGCAATGGAAATACTGCAG 
GTCGGATCTGCTGCTGCAGCACAAATGGCTGCTGCAACAGGAGTCCCTCAGTCT 
GAACAT ATTATTG C C CGTGAAT CAAATGGTAAT CCTAATGTTG CTAATG CCT CAGGAG CT 
TCAGGACTTTTCCAAACGATGCCAGGTTGGGGTTCAA 

AATTCAG CTATTAAAG CTTAT CGTGCTCAAGGTTTATCAG CTTGGGGTTAC 

SEQ ID NO. 5904 

STRAIN 18RS21 reverse complement 

AAAAGTTCACAAGTTACTACTGAATCTTTGTCAAAAGCA^ 

G CGTAG CCAAAAAAT CAAAAATG ACT AAGG CGACATCTAAATCAAAAGTAGAAGATGTAA. 
AACAGG CT CCAAAAC CTTCT CAGG CATCTAATGAAGCCCCAAAAT CAAGTT CTCAATCTA 
CAGAAG CT AATTCTCAG CAACAAGTTACTG CGAGTGAAGAGG CAG CTGTAG AACAAG CAG 
TTGTAACAGAAAACAC C CCTG CTAC CAGTCAGG CACAACAAGCTTATG CTGTTACTGAGA 
CAACTTATAG ACCTG CT CAACACCAGACGAGTGGCCAAGTATTGAGTAATGGAAATACTG 
CAGGGGCTATTGGCTCAGCAGCTGCAGCACAAATGGCTGCT 
CTACITGGGAACATATTATTGCCCGTGAATCAAATGGTAATCCTAATGTTGOT 
CAGGAG CTTCAGGACTTTTC CAAACGATGCCAGGTTGGGG CAGG 
ATCAAGTT AATTCAG CTATT AAAGCTTATCGTG CT CAAGGTTTAT CAG CTTGGGGTTAC 

SEQ ID NO. 5905 

STRAIN 090 reverse complement 

TAG C CAAAAAAT CAAAAATG ATTAAGG CGACAT CTAAATCAAAAGTAGAAGATGTAAAAC 
AGG CT C CAAAA.CCTT CTCAGGCAT CTAATGAAG CC CCAAAAT CAAGTT CT CAATCTACAG 
AAG CTAATT CT CAG CAACAAGTT ACTGCGAGTGAAG AGGCAG CTGTAGAACAAG CAGTTG 
TAACAGAAAACAC CC CTG CTAC CAGT CAGG CACA^.CAAG CTTAT G CTGTTACTGAGACAA 
CTTATAGAC CTGCTCAACACCAGACGAGTGG C CAAGTATTGAGTAATGGAAATACTG CAG 
GGGCTATTGGCTCAGCAGCTGCAGCACAAATGGCTGCTGCAAC^ 

CTTGGGAACATATTATTGCC CGTGAAT CAAATGGTAAT C CT AATGTTG CTAATGC CTCAG 
GAGCTTCAGGACTTTTCCAAACGATGCCAGGTTGGGGTTCAA 

SEQ ID NO. 5906 

STRAIN A909 reverse complement 

AAGGCGACAT CTAAATCAAAAGTAGAAGATGTAAAACAGG CT C CAAAACCTTCTCAGG CA 
TCTAATGAAG CC CCAAAAT CAAGTT CTCAATCTACAGAAG CTAATT CT CAG CAACAAGTT 
ACTGCGAGTGAAG AGG CAGCT GT AGAACAAG CAGTTGTAACAGAAAACACCCCTG CTACC 
AGT CAGGCACAACAAG CTTATG CTGTTACT G AGACAACTTATAGACCTGCT CAACACCAG 
ACAAGTGG C CAAGTATTGAGT AATGGAAATACTGCAGGGG CTATTGG CT CAG CAG CTG CA 
G CACAAATGG CTTG CTG CAACAGG AGTCC CT CAGTCT ACTTGGGAACATATTATTG CC CGT 
GAATCAAATGGTAAT C CTAATGTTGCTAATG CCTCAGGAG CTTCAGGACTTTT CAAACG 
ATGCCAGGTTGGGGTTCAACAG CTACAGTT CAGAAT CAAGTTAATT CAG CTATTAAAG CT 
TATCGTGCTCAAGGTTTATCA 

SEQ ID NO. 5907 

STRAIN CJB1 10 reverse complement 

AAT CTTTGTCAAAAG CAG AT AAAGTT CG CGTAG CCAAAAAATCAAAAATG ACTAAGG C GA 
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CAT CT AAAT CAAAAGT AGAAG AT GTAAAACAGG CT C CAAAAC CTT CTCAGG CAT CT AATG 
AAGCCCCAAAATCAAGTTCTCAATCTACAGAAGCTA^TTCT 

GTG AAG AGG CAG CTGTAG AACAAG CAGTTGT AACAG AAAACAC CCCTG CT ACCAGT CAGG 

CACAACAAGCTTATGCTGTTACTGAGACAACTTATAGACCTGCTCAACA 

G C CAAGT ATTG AGTAATGGAAATACTGCAGGGG CT ATTGG CT CAG CAG CTG CAG CACAAA 

TGGCTGCTGCAACAGGAGTCCCrCAGTCTACTTGGGAACATATTATTGCCC 

ATGGT AAT C CTAATGT TG CTAATG CCTCAGG AG CTT CAGG ACTTTT C CAAACGATG C CAG 

GTTGGGGTT CAACAG CTACAGTT CAGGAT CAAGTTAATTCAG CTATTAAAG CTTAT CGTG 

CTCAAGGTTTATCAG CTTGGGGTTAC 

SEQ ID NO. 5908 

STRAIN COH1 reverse complement 

AAAAGTTCACAAGTTACTACTGAATCTTTGTCAAAAGCAGATAA 

AGTT CG CGTAG CCAAAAAAT CAAAAATGACTAAGGCGACAT CT AAAT CAAAAGTAG AAGA 

TGTAAAACAGGCTCCAAAACCTTCTCAGGCATCTAATGAAGCCCCAAAATCAAGTTCT 

AT CTACAGAAGCTAATTCT CAG CAACAAGTT ACTG CGAGTG AAGAGGCGG CTGTAGAACA 

AG CAGTTGTAACAGAAAATAC CC CTG CT AC CAGTCAGG CACAACAAACTT ATG CTGTTAC 

TGAGACAACTTACAAACCTGCTCAACACCAGACAAGTGGCCAAGTATTGA 

TACTGCAGGGGCGGTCGGATCTGCTGCTGCAGCACAAATGGCTGCTGC^ 

T CAGT CTACTT GGGAACATATT ATTG C C CGTGAAT CAAATGGTAAT CCTAATGTTG CT AA 

TGCCTCAGGAGCTTCAGGACTTTTCCAAACGATGCCAGGTTGGGGTTCAA 

T CAGGATCAAGTTAATT CAG CTATTAAAG CTTATCGTG CT CAAGGTTTATCAG CTTGGGG 

TTAC 



SEQ XD NO. 5909 

STRAIN H36B reverse complement 

AAAAGTT CACAAGTT ACTACTGAATCTTTGT CAAAAGC 

AGATAAAGTTCG CGTAGC CAAAAAAT CAAAAATGACTAAGG CGACATCTAAAT CAAAAGT 

AGAAGATGTAAAACAGGCTC CAAAACCTTCT CAGGCAT CTAATGAAGCCC CAAAAT CAAG 

TTCTCAAT CT ACAGAAG CTAATT CT CAG CAACAAGTTACT G CG AGTGAAG AGG CAG CTGT 

AGAACAAG CAGTTGTAACAGAAAACACCCCTG CTAC CAGT CAGGCACAACAAG CTTATGC 

TGTTACTGAGACAACTTATAGAC CTG CTCAACAC CAGACAAGTGG CCAAGTATTGAGT AA 

TGGAAATAOTGCAGGGGCTATTGGCTCAGCAGCTGCAGCACAAATGGCTG 

AGT CC CTCAGT CT ACTTGGG AACATATTATTGC CCGTGAAT CAAATGGTAAT CCTAATGT 

TGCTAATGCCTCAGGAGCTTCAGGACTTTTCCAAAC 

TACAGTT CAGGAT CAAGTTAATT CAG CTATTAAAG CTT 



SEQ XD NO. 5910 

STRAIN M732 reverse complement 

AAAAGTT CACAAGTTACTACTGAATCTTTGTCAAAAGCAGATAAAGTT CG CGTAG C 
CAAAAAATCAAAZVATGACTAAGG CGACATCTAAAT CAAAAGTAGAAGATGTAAAACAGGC 
T CCAAAACCTT CT CAGGCAT CTAATGAAG C C CCAAAAT CAAGTTCTCAATCTACAGAAGC 
TAATT CTCAG CAACAAGTTACTG CGAGTGAAGAGGCGG CTGTAGAACAAG CAGTTGTAAC 
AGAAAATACC C CTG CTACCAGTCAGG CACAACAAACTT ATG CTGTT ACTG AGACAACTTA 
CAAACCTG CT CAACAC CAGACAAGTGG CCAAGTATTGAGCAATGGAAATACTG CAGGGGC 
GGTCGGATCTGCTGCTGCAGCACAAATGGCTGCTGCAACAGGAGTCCCT 
GGAACATATTATTGC CCGTGAATCAAATGGTAATCCTAATGTTGCTAATG C CT CAGGAG C 
TTCAGC? ACITTTC CAAACGATG C CAGGTTGGGGTT CAACAGCTACAGTT CAGGAT CAAGT 
TAATTCAGCTATTAAAGCTTATCGTGCTCAAGGTTTATCA 

SEQ XD NO. 5911 

STRAIN M781 reverse complement 

T CTTTGTCAAAAG CAGATAAAGTTCG CGTAG CCAAAAAAT CAAAAATG ACT AAGG CGACA 
T CTAAATCAAAAGTAGAAGATGTAAAACAG G CT CCAAAAC CTT CT CAGG CAT CTAATGAA 
G C C CCAAAATCAAGTT CT CAATCTACAGAAG CTAATT CTCAGCAACAAGTTACTG CGAGT 
GAAGAGGCGGCTGTAGAACAAG CAGTTGTAACAGAAAATACC CCTG CTAC CAGT CAGG CA 
CAACAAACTTATG CTGTT ACTGAGACAACTTACAAACOT CAGACAAGTGGC 
CAAGTATTGAG CAATGGAAATACTGCAGGGGCGGT CGGAT CTG CTG CTGCAGCACAAATG 
G CTG CTG CAACAGGAGTC CCTCAGT CTACTTGG GAACATATTATT G CCCGTGAATCAAAT 
GGTAAT CCTAATGTTG CTAATG CCT CAGG AG CTTCAGGACTTTTCCAAACGATG 
TGGGGTTCAACAGCTACAGTTCAGGATCAAGTTAATTCAGCTATTAAAGCTTATCGTC 

CAAGGTTT AT CAG CTTGGGGTTAC 

PRETTY of : /biOtmp/msa519780 . 2 { *} March 10, 2003 06:25 .. 

1 50 

msa519780.2{25_COHl} 

msa51978 0.2{25_M78l} 

msa519780.2{25_M732} „_„ •-■ ~ 

msa51978 0.2f25_1169NTj 

msa519780.2{25_18RS21, 
msa519780.2{25_A909; 

msa519780.2{2 5_0 90} ~ - ~ - ~ ~ - ~ - - - ~ - ~ - ~ - - ~ ~ ~ ~ ~ ~ - ~ ~ ~ ~ ~ - ' ~ 

msa519780 . 2 { 25_CJB110 J 

tnsa519780 .2(2603} atgaataaaa gaagaaaatt atcaaaattg aatgtaaaaa aacatcattt 

msa519780 .2 {25_H36B} 

msa519780.2{25 JM9130013} 

Consensus ********** ********** ********** ********** ********** 



msa519780.2 
msa519780 .2 
msa519780.2 



'25_COHl) 
t 25_M78l} 
25_M732} 



51 100 
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msa519780.2f 25_1169NT} 

msa519780.2(25_18RS2l} — 

msa519780.2{25_A909} ~ 

msa519780 .2{25_090} 

msa519780 .2{25_CJB110} ~~~ 

msa519780 .2(2603} agcttatgga gctatcactt tagtagccct tttttcatgt attttggctg 

msa519780 . 2{25_H36B} 

msa519780 . 2{25 JM9130013 } — [ 

~~ consensus ********** ********** ********** ********** ********** 

101 I 50 

msa51978 0.2{25_COHl} aaaagt tcacaagtta ctactgaatc tttgtcaaaa 

msa519780.2{25_M78l} tc tttgtcaaaa 

msa5l978 0.2(25_M732} aaaagt tcacaagtta ctactgaatc tttgtcaaaa 

msa519780 .2{25_1169NT} aaaagt tcacaagtta ctactgaatc tttgtcaaaa 

msa519780.2(25_18RS2l) aaaagt tcacaagtta ctactgaatc tttgtcaaaa 

msa51978 0.2(25_A909} 

msa519780.2{25_090} -r 

msa519780. 2{25_CJB110} aatc tttgtcaaaa 

msa519780. 2(2603} taatggtcat ctttaaaagt tcacaagtta ctactgaatc tttgtcaaaa 

msa519780.2(25_H36B} — aaaagt tcacaagtta ctactgaatc tttgtcaaaa 

msa519780.2(25_JM9130013} aaaagt tcacaagtta ctactgaatc tttgtcaaaa 

Consensus ********** **** — 



msa51978 0 .2(2 5_C0H1 } 
msa519780.2(25_M78l} 
msa519780.2(25_M732} 
tnsa519780 . 2 {25_1169NT} 
msa519780.2(25_18RS2l} 
msa519780 . 2 {25_A909} 
msa519780.2(25_090} 
msa519780 . 2 { 25_CJB110} 
msa519780. 2(2603} 
msa519780.2(25_H36B} 
msa519780.2(25_JM9130013} 

Consensus 



151 

gcagataaag 
gcagataaag 
gcagataaag 
gcagataaag 
gcagataaag 



ttcgcgtagc 
ttcgcgtagc 
ttcgcgtagc 
ttcgcgtagc 
ttcgcgtagc 



caaaaaatca 
caaaaaatca 
caaaaaatca 
caaaaaatca 
caaaaaatca 



aaaatgactA 
aaaatgactA 
aaaatgactA 
aaaatgactA 
aaaatgactA 
A 



gcagataaag 
gcagataaag 
gcagataaag 
gcagataaag 



tagc 

ttcgcgtagc 
ttcgcgtagc 
ttcgcgtagc 
ttcgcgtagc 



caaaaaatca 
caaaaaatca 
caaaaaatca 
caaaaaatca 
caaaaaatca 



aaaatgattA 
aaaatgactA 
aaaatgactA 
aaaatgactA 
aaaatgaatA 



200 

AGGCgACATC 
AGGCgACATC 
AGGCgACATC 
AGGCgACATC 
AGGCgACATC 
AGGCgACATC 
AGGCgACATC 
AGGCgACATC 
AGGCgACATC 
AGGCgACATC 
AGGCaACATC 



.* ****_***** 



msa5 1 97 8 0 . 2 { 2 5_C0H1 } 
msa519780 . 2 (25_M78l) 
msa519780.2(25_M732} 
msa51 9780.2(2 5_1169NT j 
msa519780 . 2 (25_18RS2l) 
msa519780 . 2 {25_A909} 
msa519780.2(25__090} 
msa519780.2(25__CJB110> 
msa519780. 2(2603} 
msa519780.2{25_H36B} 
msa519780.2(25_JM9130013} 

Consensus 



201- 

TAAATCAAAA 
TAAATCAAAA 
TAAATCAAAA 
TAAATCAAAA 
TAAATCAAAA 
TAAATCAAAA 
TAAATCAAAA 
TAAATCAAAA 
TAAATCAAAA 
TAAATCAAAA 
TAAATCAAAA 



GTAGAAGaTG 
GTAGAAGaTG 
GTAGAAGaTG 
GTAGAAGaTG 
GTAGAAGaTG 
GTAGAAGaTG 
GTAGAAGaTG 
GTAGAAGaTG 
GTAGAAGaTG 
GTAGAAGaTG 
GTAGAAGgTG 



TAAAACAGGC 
TAAAACAGGC 
TAAAACAGGC 
TAAAACAGGC 
TAAAACAGGC 
TAAAACAGGC 
TAAAACAGGC 
TAAAACAGGC 
TAAAACAGGC 
TAAAACAGGC 
TAAAACAGGC 



********** *******-** ********** 



TCCAAAACct 
TCCAAAACct 
TCCAAAACct 
TCCAAAACct 
TCCAAAACct 
TCCAAAACct 
TCCAAAACct 
TCCAAAACct 
TCCAAAACct 
TCCAAAACct 

TCCAAAAC . . 
* ★******__ 



250 

tctcaggcat 
tctcaggcat 
tctcaggcat 
tctcaggcat 
tctcaggcat 
tctcaggcat 
tctcaggcat 
tctcaggcat 
tctcaggcat 
tctcaggcat 



msa519780.2{25_COHl} 
msa519780.2{25_M781~ 
msa519780 . 2 (25_M732 
msa519780 . 2 (25_1169NT 
msa519780.2(25_18RS21 
rasa519780.2(25_A909} 
msa519780.2{25_090} 
msa519780.2(25_CJB110} 
tnsa519780. 2(2603} 
msa51978 0 . 2 (25_H36Bj 
msa519780.2{25_JM9130013} 

Consensus 



251 

ctaatgaagc 
ctaatgaagc 
ctaatgaagc 
ctaatgaagt 
ctaatgaagc 
ctaatgaagc 
ctaatgaagc 
ctaatgaagc 
ctaatgaagc 
ctaatgaagc 



cccaaaatCA 
cccaaaatCA 
cccaaaatCA 
cccaaaatCA 
cccaaaatCA 
cccaaaatCA 
cccaaaatCA 
cccaaaatCA 
cccaaaatCA 
cccaaaatCA 

CA 

** 



AGTTCTCAAT 
AGTTCTCAAT 
AGTTCTCAAT 
AGTTCTCAAT 
AGTTCTCAAT 
AGTTCTCAAT 
AGTTCTCAAT 
AGTTCTCAAT 
AGTTCTCAAT 
AGTTCTCAAT 
AGTTCTCAAT 
********** 



CTACAGAAGC 

CTACAGAAGC 

CTACAGAAGC 

CTACAGAAGC 

CTACAGAAGC 

CTACAGAAGC 

CTACAGAAGC 

CTACAGAAGC 

CTACAGAAGC 

CTACAGAAGC 

CTACAGAAGC 
********** 



300 

TAATTCTCAG 

TAATTCTCAG 

TAATTCTCAG 

TAATTCTCAG 

TAATTCTCAG 

TAATTCTCAG 

TAATTCTCAG 

TAATTCTCAG 

TAATTCTCAG 

TAATTCTCAG 

TAATTCTCAG 
********** 



msa519780 . 2 {25_COHl} 
.msa5197B0.2(25_M78l} 
msa519780.2(25_M732j 
msa519780 . 2 {25_1169NT} 
msa519780.2(25_18RS21* 
msa519780.2(25_A909" 
msa519780.2{25_090' 
msa5197 8 0.2(2 5_CJB110 ' 
msa519780 .2(2603} 
msa519780 . 2 (25_H36B J 
msa519780.2(25_JM9130013) 

Consensus 



msa519780.2(25_COHl) 
msa519780.2{25_M781) 



301 

CAACAAGTTA 
CAACAAGTTA 
CAACAAGTTA 
CAACAAGTTA 
CAACAAGTTA 
CAACAAGTTA 
CAACAAGTTA 
CAACAAGTTA 
CAACAAGTTA 
CAACAAGTTA 
CAACAAGTTA 



CTGCGAGTGA 
CTGCGAGTGA 
CTGCGAGTGA 
CTGCGAGTGA 
CTGCGAGTGA 
CTGCGAGTGA 
CTGCGAGTGA 
CTGCGAGTGA 
CTGCGAGTGA 
CTGCGAGTGA 
CTGCGAGTGA 



AGAGGCgGCT 
AGAGGCgGCT 
AGAGGCgGCT 
AGAGGCgGCT 
AGAGGCaGCT 
AGAGGCaGCT 
AGAGGCaGCT 
AGAGGCaGCT 
AGAGGCaGCT 
AGAGGCaGCT 
AGAGGCaGCT 



GTAGAACAAG 
GTAGAACAAG 
GTAGAACAAG 
GTAGAACAAG 
GTAGAACAAG 
GTAGAACAAG 
GTAGAACAAG 
GTAGAACAAG 
GTAGAACAAG 
GTAGAACAAG 
GTAGAACAAG 



********** ********** ******_*** ********** 



350 

CAGTTGTAAC 
CAGTTGTAAC 
CAGTTGTAAC 
CAGTTGTAAC 
CAGTTGTAAC 
CAGTTGTAAC 
CAGTTGTAAC 
CAGTTGTAAC 
CAGTTGTAAC 
CAGTTGTAAC 

CAGTTGTAAC 
********** 



351 400 
AGAAAAtACC CCTGCTACCA GTCAgGCACA ACAAaCTTAT GCTGTTACTG 
AGAAAAtACC CCTGCTACCA GTCAgGCACA ACAAaCTTAT GCTGTTACTG 



920 



WO 2004/018646 



PCT/US2003/026827 



Table 59: Comparative Sequences relating to SAG2147 



msa519780.2{25_M732; 
msa519780 .2{25_1169NT 
msa519780 .2{25_18RS21 
msa519780 . 2{25_A909; 
msa519780 . 2(25^90; 
msa519780 . 2 {25_CJB110 
msa519780. 2(2603 
msa519780 .2{25_H36B 
msa519780.2{25_JM9130013/ 

Consensus 



msa51978 0.2(25_COHl} 
msa51978 0.2(25_M78l} 
msa519780.2{25__M732} 
msa51978 0 . 2 f 25_1169NT} 
msa51978 0 ♦ 2 (25_18RS21 * 
msa519780.2{25_A909, 
msa519780 .2{25_090' t 
msa519780 .2{25_CJB110} 
msa519780. 2(2603} 
msaS 19780. 2{2 5_H3 6B } 
msa519780.2{25_JM9130013} 

Consensus 



msa519780 .2{25_COHl} 
msa519780.2(25_M78l} 
msa519780.2(25_M732* 
ms a5 1 97 8 0 . 2 { 2 5_11 6 9NT 
msa519780 - 2 (25_18RS21 
msa51 97 8 0 . 2 { 2 5_A909 , 
msa519780.2{25_090 
msa51978 0 . 2 { 25_CJB110 * 
msa519780. 2(2603 
rasa519780 . 2 (25_H36B 
msa519780.2{25_JM9130013 

Consensus 



msa51978 0 . 2 { 25_C0H1 } 
msa519780.2{25J478l} 
msa519780.2{25_M732) 
msa519780 . 2 { 25_1169NT} 
tnsa519780.2{25_18RS2l} 
msa519780.2{25_A909} 
msa5197 80 . 2 { 25_090 \ 
msa519780 . 2 { 25_CJB110 } 
msa519780. 2(2603} 
msa519780.2(25_H36B} 
msa519780-2(25_JM9130013} 

Consensus 



AGAAAAtACC 
AGAAAAtACC 
AGAAAAcACC 
AGAAAAcACC 
AGAAAAcACC 
AGAAAACACC 
AGAAAACACC 
AGAAAAcACC 
AGAAAAtACC 



:(25_C0H1} 
!{25_J 



msa519780.2 
msa519780.2(25J478l} 
msa519780 . 2 { 25_M732 } 
msa519780.2{25JL169NTj 
msa519780.2(25_18RS2l} 
msa519780.2(25_A909} 
msaS 19780.2(2 5_09 0 } 
msa519780.2{25_CJB110} 
msa519780. 2(2603} 
msa519780 . 2 {25_H36B} 
msa519780.2(25_JM9130013} 

Consensus 



msa519780.2(25_COHl* 
msa519780 . 2 ( 25_M781 
msa519780 . 2 (25_M732 
msa51978 0 . 2 ( 25JL169NT} 
msa519780.2(25_18RS2l} 
msa519780.2{25_A909) 
msa519780.2(25_090> 
msa519780 .2{25_CJB110} 
msa519780. 2(2603} 
msa519780.2(25_H36B} 
msa519780.2(25_JM9130013} 

Consensus 



CCTGCTACCA 
CCTGCTACCA 
CCTGCTACCA 
CCTGCTACCA 
CCTGCTACCA 
CCTGCTACCA 
CCTGCTACCA 
CCTGCTACCA 
CCTGCTACCA 



GTCAgGCACA 
GTCAgGCACA 
GTCAgGCACA 
GTCAgGCACA 
GTCAgGCACA 
GTCAgGCACA 
GTCAgGCACA 
GTCAgGCACA 
GTCAaGCACA 



********** ****_***** 



ACAAaCTTAT 

ACAAaCTTAT 

ACAAgCTTAT 

ACAAgCTTAT 

ACAAgCTTAT 

ACAAgCTTAT 

ACAAgCTTAT 

ACAAgCTTAT 

ACAAgCTTAT 
****_***** 



401 

AGACAACTTA 
AGACAACTTA 
AGACAACTTA 
AGACAACTTA 
AGACAACTTA 
AGACAACTTA 
AGACAACTTA 
AGACAACTTA 
AGACAACTTA 
AGACAACTTA 
AGACAACTTA 



cAaACCTGCT 
cAaACCTGCT 
cAaACCTGCT 
cAaACCTGCT 
tAgACCTGCT 
tAgACCTGCT 
tAgACCTGCT 
tAgACCTGCT 
tAgACCTGCT 
tAgACCTGCT 
tAgACCTGCT 



CAACACCAGa 
CAACACCAGa 
CAACACCAGa 
CAACACCAGa 
CAACACCAGa 
CAACACCAGa 
CAACACCAGa 
CAACACCAGa 
CAACACCAGa 
CAACACCAGa 
CAACACCAGc 



CaAGTGGCCA 
CaAGTGGCCA 
CaAGTGGCCA 
CaAGTGGCCA 
CgAGTGGCCA 
CaAGTGGCCA 
CgAGTGGCCA 
CgAGTGGCCA 
CgAGTGGCCA 
CaAGTGGCCA 
CgAGTGGCCA 



********** _*_******* *********- *_******** 



451 

AATGGAAATA 
AATGGAAATA 
AATGGAAATA 
AATGGAAATA 
AATGGAAATA 
AATGGAAATA 
AATGGAAATA 
AATGGAAATA 
AATGGAAATA 
AATGGAAATA 
AATGGAAATA 



CTG CAGGGGc 
CTGCAGGGGC 
CTGCAGGGGc 
CTGCAGGGGC 
CTGCAGGGGc 
CTGCAGGGGc 
CTGCAGGGGc 
CTGCAGGGGc 
CTGCAGGGGC 
CTGCAGGGGc 
CTGCAGGGGt 



ggTcGGaTCt 
ggTcGGaTCt 
ggTcGGaTCt 
ggTcGGaTCt 
taTtGGcTCa 
taTtGGcTCa 
taTtGGcTCa 
taTtGGcTCa 
taTtGGcTCa 
taTtGGcTCa 
taTtGGcTCa 



GCtGCtGCAG 
GCtGCtGCAG 
GCtGCtGCAG 
GCtGCtGCAG 
GCaGCtGCAG 
GCaGCtGCAG 
GCaGCtGCAG 
GCaGCtGCAG 
GCaGCtGCAG 
GCaGCtGCAG 
GCaGCaGCAG 



GCTGTTACTG 

GCTGTTACTG 

GCTGTTACTG 

GCTGTTACTG 

GCTGTTACTG 

GCTGTTACTG 

GCTGTTACTG 

GCTGTTACTG 

GCTGTTACTG 
********** 

' 450 

AGTATTGAGc 

AGTATTGAGc 

AGTATTGAGc 

AGTATTGAGc 

AGTATTGAGt 

AGTATTGAG t 

AGTATTGAGt 

AGTATTGAGt 

AGTATTGAGt 

AGTATTGAGt 

AGTATTGAGc 
*********- 

500 

CACAAATGGC 
CACAAATGGC 
CACAAATGGC 
CACAAATGGC 
CACAAATGGC 
CACAAATGGC 
CACAAATGGC 
CACAAATGGC 
CACAAATGGC 
CACAAATGGC 
CACAAATGGC 



********** *********_ _.„*_**_**_ **_**«**** ********** 



501 

TGCTGCAACa 
TGCTGCAACa 
TGCTGCAACa 
TGCTGCAACa 
TGCTGCAACa 
TGCTGCAACa 
TGCTGCAACa 
TGCTGCAACa 
TGCTGCAACa 
TGCTGCAACa 
TGCTGCAACg 



GGAGTcCCTC 
GGAGTcCCTC 
GGAGTcCCTC 
GGAGTcCCTC 
GGAGTcCCTC 
GGAGTcCCTC 
GGAGTcCCTC 
GGAGTcCCTC 
GGAGTcCCTC 
GGAGTcCCTC 
GGAGT t CCTC 



AGTCTACTTG 
AGTCTACTTG 
AGTCTACTTG 
AGTCTACTTG 
AGTCTACTTG 
AGTCTACTTG 
AGTCTACTTG 
AGTCTACTTG 
AGTCTACTTG 
AGTCTACTTG 
AGTCTACTTG 



GGAACATATT 
GGAACATATT 
GGAACATATT 
GGAACATATT 
GGAACATATT 
GGAACATATT 
GGAACATATT 
GGAACATATT 
GGAACATATT 
GGAACATATT 
GGAACATATT 



550 

ATTGCCCGTG 
ATTGCCCGTG 
ATTGCCCGTG 
ATTGCCCGTG 
ATTGCCCGTG 
ATTGCCCGTG 
ATTGCCCGTG 
ATTGCCCGTG 
ATTGCCCGTG 
ATTGCCCGTG 
ATTGCCCGTG 



*********_ *****_**** ********** ********** ********** 



551 

AATCAAATGG 
AATCAAATGG 
AATCAAATGG 
AATCAAATGG 
AATCAAATGG 
AATCAAATGG 
AATCAAATGG 
AATCAAATGG 
AATCAAATGG 
AATCAAATGG 
AATCAAATGG 



TAATCCTAAt 
TAATCCTAAt 
TAATCCTAAt 
TAATCCTAAt 
TAATCCTAAt 
TAATCCTAAt 
TAATCCTAAt 
TAATCCTAAt 
TAATCCTAAt 
TAATCCTAAt 
TAATCCTAAc 



GTTGCTAATG 
GTTGCTAATG 
GTTGCTAATG 
GTTGCTAATG 
GTTGCTAATG 
GTTGCTAATG 
GTTGCTAATG 
GTTGCTAATG 
GTTGCTAATG 
GTTGCTAATG 
GTTGCTAATG 



CCTCAGGAGC 
CCTCAGGAGC 
CCTCAGGAGC 
CCTCAGGAGC 
CCTCAGGAGC 
CCTCAGGAGC 
CCTCAGGAGC 
CCTCAGGAGC 
CCTCAGGAGC 
CCTCAGGAGC 
CCTCAGGAGC 



********** *********- ********** ********** 



601 

TTCCAAACGA 
TTCCAAACGA 
TTCCAAACGA 
TTCCAAACGA 
TTCCAAACGA 
TTCCAAACGA 
TTCCAAACGA 
TTCCAAACGA 
TTCCAAACGA 
TTCCAAACGA 
TTCCAAACGA 



TGCCAGGTTG 
TGCCAGGTTG 
TGCCAGGTTG 
TGCCAGGTTG 
TGCCAGGTTG 
TGCCAGGTTG 
TGCCAGGTTG 
TGCCAGGTTG 
TGCCAGGTTG 
TGCCAGGTTG 
TGCCAGGTTG 



GGGTTCAACA 
GGGTTCAACA 
GGGTTCAACA 
GGGTTCAACA 
GGGTTCAACA 
GGGTTCAACA 
GGGTTCAACA 
GGGTTCAACA 
GGGTTCAACA 
GGGTTCAACA 
GGGTTCAACA 



GCTACAGTTC 
GCTACAGTTC 
GCTACAGTTC 
GCTACAGTTC 
GCTACAGTTC 
GCTACAGTTC 
GCTACAGTTC 
GCTACAGTTC 
GCTACAGTTC 
GCTACAGTTC 
GCTACAGTTC 



600 

TTCAGGACTT 
TTCAGGACTT 
TTCAGGACTT 
TTCAGGACTT 
TTCAGGACTT 
TTCAGGACTT 
TTCAGGACTT 
TTCAGGACTT 
TTCAGGACTT 
TTCAGGACTT 

TTCAGGACTT 
********** 

650 

AGgAtcaagt 
AGgAtcaagt 
AGgAtcaagt 
AGgAtcaagt 
AGgAtcaagt 
AGaAtcaagt 

AGgA 

AGgAtcaagt 
AGgAtcaagt 
AGgAtcaagt 
AGgAtcaagt 



********** ********** ********** ********** **-*. 



651 700 
msa519780.2(25_COHl} taattcagct attaaagctt atcgtgctca aggtttatca gcttggggtt 



921 
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{25 J 
{25J 
,5_11' 
0.2{25_18RS21 
>780.2{25_A909 



msa519780.2{25_M78l! 
msa519780.2{25_M732 
msa51978 0.2{25_1169NT 
msa51978 
msa5197 
msa519780.2{25_090. 
msa5197 8 0 . 2 { 25_CJB110 
msa519780.2{2603 
msa519780.2{25_H3SB 
msa519780 . 2 {25_JM9130013 ' 

Consensus 



taattcagct attaaagctt atcgtgctca aggtttatca gcttggggtt 
taattcagct attaaagctt atcgtgctca aggtttatca gcttggggtt 
taattcagct attaaagctt atcgtgctca aggtttatca gcttggggtt 
taattcagct attaaagctt atcgtgctca aggtttatca gcttggggtt 
taattcagct attaaagctt atcgtgctca aggtttatca 

taattcagct attaaagctt atcgtgctca aggtttatca gcttggggtt 
taattcagct attaaagctt atcgtgctca aggtttatca gcttggggtt 

taattcagct attaaagctt 

taattcagct attaaagctt atcgtgctca aggtttatca gcttggggtt 



msa519780.2{2 5_C0H1 ] 
msa519780 . 2 { 25_M781; 
msa519780 . 2 {25_M732 * 
msa519780 . 2 { 25_1169NT 
msa519780.2{25_18RS21 
msa519780 . 2 {25_A909} 
msa519780.2{25_090) 
msa519780.2{25_CJB110} 
msa51978 0.2 {2603} 
msa519780 . 2 {25_H36B} 
msa519780 . 2 { 25_JM9130013 } 

Consensus 



701 

ac 

ac 

ac 

ac 



ac 

actag 

ac—— — 
- — * * * 



SEQ ID NO. 5912 
STRAIN 2603 frame: 1 

MNKRRKL S KLNVKKHHLAYGA I TLVALF S C I LAVMV I F KS S QVTTE S L S KADKVRVAKKS 
KMTKATSKSKVEDVKQAPKPSQASNEAPKSSSQSTEA^ 

PATSQAQQAYAVTETTYRPAQHC/TSGQVLSNGNTAGAI GSAAAAQMAAATGVPQSTWEH I 
IARESNGNPNVANASGASGLFQTMPGWGSTATVQDQVNSAIKAYRAQGLSAWGY 

SEQ ID NO. 5913 
STRAIN 1 169NT frame: 1 

KSSQVTTESLSKADKVRVAKKSK>rrKATSKSKVEDVKQAPKPSQASNEVPKSSSQSTFAN 
SC^VTASEE^VEQAVVTENTPATSQAQQTYAVTETTYKPAQHQTSGQVLSNGNTAGAV 
GSAAAAQMAAATGVPQSTWEHI I ARE SNGNPNVANASGASGLFQTTOPGWGSTATVQDQVN 
SAI KAYRAQGLSAWGY 

SEQ ID NO. 5914 
STRAIN 18RS21 frame: 1 

KS S QVTTE SL S KAD KVRVAKKS KMTKAT S KS KVED VKQAP KPS QASNEAP KS S S Q S TEAN 
S QQQVTAS EEAAVEQAWTENT PAT S QAQQAYAVTETT YR PAQHQTSGQVL SNGNTAGA I 
GSAAAAQMAAATGVPQSTWEHI I ARE SNGNPNVANASGASGLFQTMPGWGSTATVQDQVN 
SAI KAYRAQGLSAWGY 

SEQ ID NO. 5915 
STRAIN 2603 frame: 1 

KSSQVTTESLSKADKVRVAKKSKMTKATSKSKVE 

S QQQVTAS EEAAVEQAWTENT PAT S QAQQAYAVTETT YRPAQHQT SG QVL SNGNTAGA I 
GSAAAAQMAAATGVPQSTWEHI IARESNGNPNVANASGASGLFQTMPGWGSTATVQDQVN 
SAI KAYRAQGLSAWGY 

SEQ ID NO. 5916 

STRAIN 090 frame: 3 

AKKSKM I ICATSKSKVEDVKQAPKPSQASNEAPKSSSQSTEANSQQQVTASEEAAVEQAVV 

TENTPATSCAQQAYAVTETTYRPAQHCTSGQVL 

WEHI I ARE SNGNPNVANASGASGLFQTMPGWGSTATVQ 

SEQ ID NO. 5917 

STRAIN A909 frame: 1 

KAT S KS KVED VKQAP KPS QASNEAP KS S SQ S TEANS QQQVTAS E EAAVEQ AWTENT PAT 
SQAQQAYAVTETTYRPAQHQTSGQVLSNGNTAGAI GSAAAAQMAAATGVPQSTWEHI I AR 
ESNGNPNVANASGASGLFQTMPGWGSTATVQNQVNSAIKAYRAQGLS 

SEQ ID NO. 5918 
STRAIN CJB1 10 frame: 3 

SLSKADKVRVAKKSKMTKATSKSKVEDVKQAPKPSQ 

E EAAVE QAWT ENTPAT S QAQQAYAVTETT YRPAQHQT SGQVL SNGNTAGA I G SAAAAQM 
AAATGVPQSTWEHI IARESNGNPNVANASGASGLFQTMPGWGSTATVQDQVNSAI KAYRA 
QGLSAWGY 

SEQ ID NO. 5919 
STRAIN COH1 frame: 1 

KSSQVHTTESLSKAD KVRVAKKS KMT KATSKSKV^ 

SQ^QVTASEEAAWQAVWENTPATSQAC^TYAVTETTYKPAQHQTSGQVLSNGNTAGAV 
GSAAAAQMAAATGVPQSTWEHI IARESNGNPNVANASGASGLFQTMPGWGSTATVQDQVN 
SAI KAYRAQGLSAWGY 

SEQ ID NO. 5920 
STRAIN H36B frame: 1 
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Table 59: Comparative Sequences relating to SAG2147 



KSSQVTTESLSKADKVRVAKKSKMTKATSKSKVEDVKQAPKPSQASNEAPKSSSQSTEAN 
SQQQVTASEEAAVEQAWTENTPATSQAQQAYAVTETTYRPAQHQTSGQVIiSNGNTAGAI 
GSAAAAQMAAATGVPQSTWEH 1 1 ARE SNGNPNVAN ASGAS GL FQTM PGWG STATVQDQ VN 
SAIKA 



SEQ ID NO. 5921 
STRAIN M732 frame: 1 . 

KSSQVTTESLSK7UDKWVAKKSKMTKATSKSKVEDVKQAPKPSQASNEAPKSSSQSTEAN 
SO^QVTASEEAAVEQAVVTEOTPATSQAC^TYAWETTYKPAQHQTSGQVLSNGNTAGAV 
G S AAAAQMAAATG VPQSTWEH 1 1 ARE SNGNPNVANASGASGLFQTMPGWGSTATVQDQVN 
SAI KAYRAQGL S AWG 

SEQ ID NO. 5922 
STRAIN M781 frame: 4 

S Lr S KAD KVRVAKKS KMT KAT S KS KVED VKQ APKP S QAS NEAP KS S S Q S T EAN S QQQVTAS 
EEAAVE QA WTENT PAT S QAQQTYAVTETT Y KPAQHQT S G Q VL SNGNTAGAVG S AAAAQM 
AAATGVPQSTWEHI IARESNGNPNVANASGASGLFQTMPGWGSTATVQDQVNSAI KAYRA 
QGLSAWGY 

SEQ ID NO. 5923 
STRAIN JM9130013 frame: 1 

KS SQVTTE S L S KADKVRVAKKS KMNKAT S KS KVEGVKQ AP KP S S QS TEAN S QQ QVTAS EE 
AAVEQAVVTENTPATSQAC^AYAWET^YRPAQHQPSGQVLSNGOTAGVIGSAAAAQMAA 
ATGVPQSTWEHI IARESNGNPNVANASGASGLFQTMPGWGSTATVQDQVNSAI KAYRAQG 
LSAWGY 

MSA Alignment Results: Pretty output 

PRETTY of : /biotrrtp/msa519418 . 2 { * } March 10, 2003 06:15 

1 50 

msa519418.2{25_090} 

msa519418.2{25 H36B} KS SQVTTESLSK 

msa519418.2{25lC0Hl) KS SQVTTESLSK 

msa519418.2{25_M781) SLSK 

msa519418.2{25_1169NT} KS SQVTTESLSK 

msa519418.2{25_M732} KS SQVTTESLSK 

msa519418.2{25_18RS2l} KS SQVTTESLSK 

msa519418 . 2 { 25_CJB110 } SLSK 

msa519418.2{25 2603} KS SQVTTESLSK 

msa519418 .2T2603} mnkrrklskl nvkkhhlayg aitlvalfsc ilavmvif KS SQVTTESLSK 

msa519418.2{25_A909} 

msa519418.2{25 JM9130013) KS SQVTTESLSK 

~~ consensus ********** ********** ********** ********** ********** 

51 100 

msa519418.2{25_090} akks kmi KATSKSK VEdVKQAPKP sqasneapks SSQSTEANSQ 

msa519418.2{25_H36B} ADKVRVakks km t KATSKSK VEdVKQAPKP sqasneapks SSQSTEANSQ 

msa519418.2{25_COHl} ADKVRVakks km t KATSKSK VEdVKQAPKP sqasneapks SSQSTEANSQ 

msa519418.2{25_M78l} ADKVRVakks km t KATSKSK VEdVKQAPKP sqasneapks SSQSTEANSQ 

msa519418.2{25_1169NT} ADKVRVakks km t KATSKSK VEdVKQAPKP sqasnevpks SSQSTEANSQ 

msa519418.2{25_M732} ADKVRVakks km t KATSKSK VEdVKQAPKP sqasneapks SSQSTEANSQ 

msa519418.2{25_18RS2l} ADKVRVakks kmtKATSKSK VEdVKQAPKP sqasneapks SSQSTEANSQ 

msa519418 .2{25_CJB110} ADKVRVakks kmtKATSKSK VEdVKQAPKP sqasneapks SSQSTEANSQ 

msa519418.2{25 2603} ADKVRVakks kmtKATSKSK VEdVKQAPKP sqasneapks SSQSTEANSQ 

msa519418.2l2603} ADKVRVakks kmtKATSKSK VEdVKQAPKP sqasneapks SSQSTEANSQ 

msa519418 .2{25_A909} KATSKSK VEdVKQAPKP sqasneapks SSQSTEANSQ 

msa519418.2{25_JM9130013) ADKVRVakks kmnKATSKSK VEgVKQAPKP SSQSTEANSQ 

Consensus ****** ******* **_******* ********** 

101 150 

msa519418 .2{25_090} QQVTASEEAA VEQAWTENT PATSQAQQaY AVTETTYrPA QHQtSGQVLS 

msa519418.2{25_H36B} QQVTASEEAA VEQAWTENT PATSQAQQaY AVTETTYrPA QHQtSGQVLS 

msa519418 .2{25_COHl} QQVTASEEAA VEQAWTENT PATSQAQQtY AVTETTYkPA QHQtSGQVLS 

msa519418 .2{25_M78l} QQVTASEEAA VEQAWTENT PATSQAQQtY AVTETTYkPA QHQtSGQVLS 

msa519418.2{25_1169NT} QQVTASEEAA VEQAWTENT PATSQAQQtY AVTETTYkPA QHQtSGQVLS 

msa519418 .2{25_M732} QQVTASEEAA VEQAWTENT PATSQAQQtY AVTETTYkPA QHQtSGQVLS 

msa519418 . 2 (25_18RS21 } QQVTASEEAA VEQAWTENT PATSQAQQaY AVTETTYrPA QHQtSGQVLS 

msa519418 . 2 (25_CJB110) QQVTASEEAA VEQAWTENT PATSQAQQaY AVTETTYrPA QHQtSGQVLS 

msa519418.2{25 2603} QQVTASEEAA VEQAWTENT PATSQAQQaY AVTETTYrPA QHQtSGQVLS 

msa519418 - 2^2603} QQVTASEEAA VEQAWTENT PATSQAQQaY AVTETTYrPA QHQtSGQVLS 

msa519418 .2{25_A909} QQVTASEEAA VEQAWTENT PATSQAQQaY AVTETTYrPA QHQtSGQVLS 

msa519418.2{25_JM9130013} QQVTASEEAA VEQAWTENT PATSQAQQaY AVTETTYrPA QHQpSGQVLS 

Consensus ********** ********** ********-* *******_** ***_****** 

151 200 

msa519418 .2{25_090} NGNTAGaiGS AAAAQMAAAT GVPQSTWEHI IARESNGNPN VANASGASGL 

msa519418.2{25 H36B} NGNTAGaiGS AAAAQMAAAT GVPQSTWEHI IARESNGNPN VANASGASGL 

msa519418.2{25lC0Hl) NGNTAGavGS AAAAQMAAAT GVPQSTWEHI IARESNGNPN VANASGASGL 

msa519418.2{25_M78l) NGNTAGavGS AAAAQMAAAT GVPQSTWEHI IARESNGNPN VANASGASGL 

msa519418 .2 {25 1169NT) NGNTAGavGS AAAAQMAAAT GVPQSTWEHI IARESNGNPN VANASGASGL 

msa519418.2{25 M732) NGNTAGavGS AAAAQMAAAT GVPQSTWEHI IARESNGNPN VANASGASGL 

msa519418.2{25 18RS21} NGNTAGaiGS AAAAQMAAAT GVPQSTWEHI IARESNGNPN VANASGASGL 

msa519418.2{25~CJB110} NGNTAGaiGS AAAAQMAAAT GVPQSTWEHI IARESNGNPN VANASGASGL 
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msa519418 .2(25 2603} 
msa519418 .2J2603} 
msa519418 . 2 {25_A909} 
msa519418 . 2 { 25_JM9130013 } 

Consensus 



NGNTAGaiGS 
NGNTAGaiGS 
NGNTAGaiGS 
NGNTAGviGS 



AAAAQMAAAT 
AAAAQMAAAT 
AAAAQMAAAT 
AAAAQMAAAT 



GVPQSTWEHI 
GVPQSTWEHI 
GVPQSTWEHI 
GVPQSTWEHI 



IARESNGNPN 
IARESNGNPN 
IARESNGNPN 
IARESNGNPN 



VANASGASGL 
VANASGASGL 
VANASGASGL 
VANASGASGL 



******„_** ********** ********** ********** ********** 



msa519418 
msa519418 . 
msa519418 . 
msa519418 . 
msa519418 .2{ 
msa519418 . 
msa519418 
msa519418 
msa519418 . 

msa5194 
msa519418 . 
msa519418.2{25_. 



,2{25_090} 
2{25_H36B) 
2{25_COHl} 
2{25_M78l} 
25_1169NT} 
2{25_M732} 
25_18RS2l} 
25_CJB110} 
2{25 2603} 
18. 2X2 603} 
2{25_A909} 
JM9130013} 
Consensus 



201 

FQTMPGWGST 
FQTMPGWGST 
FQTMPGWGST 
FQTMPGWGST 
FQTMPGWGST 
FQTMPGWGST 
FQTMPGWGST 
FQTMPGWGST 
FQTMPGWGST 
FQTMPGWGST 
FQTMPGWGST 
FQTMPGWGST 



234 



ATVQ 

ATVQDQVNSA 
ATVQDQVNSA 
ATVQDQVNSA 
ATVQDQVNSA 
ATVQDQVNSA 
ATVQDQVNSA 
ATVQDQVNSA 
ATVQDQVNSA 
ATVQDQVNSA 
ATVQnQVNSA 
ATVQDQVNSA 



********** ********** ********** 



IKA 

IKAYRAQGLS AWGY 

IKAYRAQGLS AWGY 

IKAYRAQGLS AWGY 

IKAYRAQGLS AWG- 

IKAYRAQGLS AWGY 

IKAYRAQGLS AWGY 

IKAYRAQGLS AWGY 

IKAYRAQGLS AWGY 

IKAYRAQGLS 

IKAYRAQGLS AWGY 
**** 
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Table 60: Comparative Sequences relating to SAG1945 



SEQ XD NO. 6001 
STRAIN 2603 

ATGAAAGAAAAACAGTCGAAAAGGCTTATTTATATACTACTGGTTGTTC 

ATAAGTGTTTTTACATACAGTATTAGCCaGCCTTCTAAACTACTTCCA 

GTTATTCTAAGTCGAAATAGTCAAGCCATTTTAACAGGAACGA 

AAA.TACGGTATAAAAGTTAAGCTTATTCAAGGTGGGACAGGGCAACTAATAGATAGATTA 
AGTAAGGAGGGTAAG CAGTTGAAGGCGGATATTTTCTTTG GAGG AAATTATACG CAATTT 
GAAAGTCAT AAGGCATTGTTTGAGT CTT ACGTAT CAAAGAATGTT CATACTGTTATT C CA 
GACTATATCCATCCAAGTGATACGGCGACACCITATACTATAAATGGGAGTGTCTTGATT 
GTAAATAACGAATTAG CTAAGGGAC TTAC CAT CAAGAGTTATGAAGATTT ATTACAG C CT 
TCCTTAAAAGGTAAAATTGCCTTTGCAGATCCGAATACTTCCTCTAGTGCTT^ 
CTCACTAATATACTCTTGGCCAAGGGTGGTTACACCAATCCAAAAGCGTGGAACTATGTT 
AAAAAG CTACAACATAATATT AATGCT AT CAAAT CTTCT AGCT CTT CAGAAGTTT AT CAA 
TCAGTTGCAGAAGGAAAAATGATTGTGGGGCTGACTTAC^ 

CAAAAAAGTGGTGC CAATGTTT CTATT GT ATAT C CGACAGAAGGG ACAGTTTTTGTCC CA 
TCTT CGGTTG CAATTATAAAGAATG CT CCTTCTATGAAAGAAG CAAAGTTATTTATT AAT 
TTTATGCTTTCTTTAGATGTTGAAAATGCCT 

CGTAAAGATG CCCAAACGAGTAATGG CATGAAAG CITTAAAGGATATTGCTACT CTTAAA 
GAAGATTAT CGCTATGT CACTAAG CATAAGGG C CAAAT C CTTAAAAC CTATAAT CGTATT 
CGTAGAAATG CTGAT 

SEQ ID NO. 6002 
STRAIN 090 

CAGCCTTCTAAACTACTTCCACCAAAAGAATTAGTTATTCTAAGT 
CCMU^TAGTCAAGCCATTTTAACAGGAACGATTCCAGCTTTTGA 
ATACGGTATAAAAGTTAAG CTT ATT CAAGGTGGG ACAGGG CAACTAATAG 
ATAGATTAAGTAAGGAGGGTAAGCAGTTGAAGGCGGATATTTTCTTTGGA 
GGAAATTATACX3CAATITGAAAGTCATAAGGCATO 

AT CAAAGAAT GTT CATACTGTTATT CCAGACTATATCCATCCAAGTGATA 

CGGCGACACCTTATACTATAAATGGGAGTGTCTTGATTGTA^TAACGAA 

TTAGCTAAGGGACTTACCATCAAGAGTTATGAAGATTTATTA 

CTTAAAAGGTAAAATTGCCTTTGCAGATCCGAATACTTCCT 

TCTCACAACTCACTAATATACTCTTGGCCAA 

AAAG CGTGG AACT AT GTT AAAAAG CTACAACATAATATT AATGCT AT CAA 

ATCTT CTAG CTCTTCAGAAGTTTAT CAATCAGTTGCAGAAGGAAAAATGA 

TTGTGGGGCrGACTTACGAAGACCCTAGTGTCAATTTGCAAAAA^ 

GCCAATGTTTCTATTGTATATCCGACAGAAGGCACAGTTTT^ 

TT CGGTTGCAATTATAAAGAATGCTC CTTCTATGAAAGAAG CAAAGTTAT 

TTATTAATTTTATGCTT t CTTTAgATGTTCAAAATGCCTTTGGGCAGTCA 

ACGAGTAACCGACCTATT CGTAAAGATGC C CAAACGAGTAATGG CATGAA 

AG CTTTAAAGGATATTGCT ACT CTTAAAG AAGATT ATCGCTATGT CACT A 

AG CATAAGGGCCAAATCCTTAAAACCTATAAT CGTATTCGTAGAAATG CT 

GAT 

SEQ XD NO. 6003 
STRAIN A909 

CAGCCTTCTAAACTACTTCCACCAAAAGAATTAG 

TTATTCTAAGTCCAAATAGTCAAGCCATTTTAACAGGAACGATTCCAGCT 
TTTGAGGAAAAATACGGTATAAAAGTTAAGCTTATTCAAGGTGGGACAGG 
TCAACrAATAGATAGATTAAGTAAGGAGGGTAAGCAGTTGAAGGCGGATA 
TTTT CTTTGGAGGAAATTATACGCAATTTGAAAGTCATAAGG CATTGTTT 
GAGTCTTACGTATCAAAGAATATT CATACTGTTATTC CAGATTATATCCA 
TCCGAGTGATACGGCGACACCTTATACTATAAATGGGAGTGTCTTGATTG 
TAAATAACX3AATTAGCTAAGGGACTTACCAT 

TT ACAG CCTTCCTT AAAAGGT AAAATT GCCTTTGCAGAT C CGAATACTT C 
CTCTAGTGCTTTCrCACAACrCACrAATATACT 

ACAC CAATC CAAAAGCGTGGAACTATGTTAAAAAG CTACAACATAATATT 
AATG CTATCAAAT CTT CTAG CT CTTCAGAAGTTTATCAAT CAGTTG CAGA 
AGGAAAAATGATTGTGGGGTTGACTTACGAAGACCCTAGTGTCAATTTGC 
AAAAAAGTGGTGCCAATGTTTCTATTGTATATCCGACAGAAGGGACAGTT 
TTTGTCC CATCTT CGGTTG CAATTATAAAGAATG CTC CTTCT ATGAAAGA 
AG CAAAGTTATTTATT AATTTTATGCTTT CHTTAGAT CAAAATG C CT 
TTGGGCAGTCAACGAGTAACCGACCTATTCGTAAAGATGCCCAAACGAGT 
AATGGCATGAAAGCTTTAAAGGATATTGCTACTCTTAAAGAAG 
CTATGTCACTAAGCATAAGGG CCAAATCCTTAAAACCTATAAT CGTATTC 
GTAGAAATG CTGAT 

SEQ XD NO. 6004 

STRAIN H36B 

TAAACTACTT CCAC CAAAAGAATTAGTTATT CTAAGT CCAAATAGTCAAG 
CCATTTTAACAGGAACG ATT CCAG CTTTTGAGGAAAAATACGGT ATAAAA 
GTTAAGCTTATTCAAGGTGGGACAGGTCAACTAATAGATAGATTAAGT 
GGAGGGTAAG CAGTTGAAGG CGGATATTTT CTTTGGAGGAAATTATACG C 
AATTTGAAAGTCAT AAGGCATTGTTTG AGT CTT ACGTAT CAAAGAATATT 
CATACTGTTATTC CAGATTATATCCAT C CGAGTGATACGG CGACACCTTA 
TACTATAAATGGGAGTGTCTTGATTGTAAATAAcGAATTAGTTAAGGGAC 
TTAC CATCAAGAGTTATGAAGATTTATTACAG CCTTCCTTAAAAGGTAAA 
ATTG CCTTTGCAGATCCGAATACTT C CTC TAGTG CTTT CT CACAACT CAC 
TAATATACT CTTGG C CAAGGGTGGTTACAC CAAT CCAAAAGCGTGGAACT 
ATGTTAAAAAG CTACAACAT AATATTAATG CTAT CAAAT CTTCTAG CTCT 
TCAGAAGTTT AT CAAT CAGTTGCAGAAGGAAAAATGATTGTGGGGTTGAC 
TTACGAAGACCCTAGTGTCAATTTGCAAAAAAGTGGTGCC^ 
TTGT ATAT CCGACAG AAGGG ACAGTTTTTGT C CCAT CTT CGGTTG CAATT 
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ATAAAGAATG CT C CTT CT ATGAAAG AAG CAAAGTT ATTT ATT AATTTT AT 
GCTTTCTTTAGATGTTCAAAATGCCTTTGGGCAGT 

CTATT CGTAAAGATG C C CAAACGAGTAATGG CATGAAAG CTTTAAAG GAT 
ATTG CTACTT CTT AAAG AAG ATTAT C G CTATGTCA C TAAG CAT AAG GG C C A 
AATC CTTAAAACCT AT AAT CGTATT CGTAG AAATG CTGAT 

SEQ ID NO. 6005 
STRAIN 18RS21 

C^GCCTT CT AAACT ACTT C CAC CAAAAGAATTAGT TATT CTAAGT CCAAA 
TAGTCAAGCCATTTTAACAGGAACGATTCCAGCTTTTGAGGAAAAATACG 
GTATAAAAGTTAAG CTT ATT CAAGGTGGGACAGGG CAACTAATAG ATAG A 
TTAAGTAAGGAG GGTAAG CAGTTG AAGGCGgATATTTTCTTT GG AGGAAA 
TTATACG CAATTTGAAAGTCAT AAGGCATTGTTTGAGT CTTACGTAT CAA 
AGAATGTTCATACTGTTATTCCAGACTATATCCATCCAAGTGATACGGCG 
ACACCrTTATACTATAAATGGGAGTGTt^GATTGTAAATAACGAATTAGC 
TAAGGGACTTACCAT CAAG AGTTATGAAGATTTATTACAG CCTT C CTTAA 
AAGGTAAAATTGCCTTTGCAGATCCGAATACTTCCTCTAGTGCTTTCT 
CAACT CACT AATATACT CTTGG CCAAGGGTGGTTACAC CAAT CCAAAAG C 
GTGGAACTATGTTAAAAAG CTACAACATAATATTAATG CTAT CAAAT CTT 
CTAGCTCTTCAGAAGTTTAT CAATCAGTTGCAGAAGGAAAAATGATTGTG 
GGGCTGACTTACGAAGACCCTAGTGTCAATTT^GCAAAAA 
TGTTTCTATTGTAT ATC CG ACAGAAGOGACAGTTTTTGT C CCAT CTT CGG 
TTGCAATTATAAAGAATGCT C CTTCTATGAAAGAAGCAAAGTTATTTATT 
AATTTTATG CTTTCTTTAGATGTTCAAAATGC CTTTGGG CAGT CAACGAG 
TAACCGACCTATTCGTAAAGATGCC CAAACGAGTAATGG CATGAAAG CTT 
TAAAGGATATTG CTACT CTTAAAGAAGATTAT CG CTATGTCACTAAGCAT 
AAGGG CCAAATC CTTAAAAC CTATAAT CGTATTCGTAGAAATG CTGAT 

SEQ ID NO. 6006 
STRAIN M732 

CAGCCTTCTAAACTACTTCCACCAAAAGAATTAGT 

TATTCTAAGTCCAAATAGTCAAGCC^TTTTAACAGGAACGATTCCAGCTT 
TTGAGGAAAAATACGGTATAAAAGTTAAGCTTATT CAAGGTG GGACAGGG 
CAACTAATAGAT AGATT AAGTAAGG AGGGTAAG CAGTTG AAGG CG GATAT 
TTTCTTTGGAG<3AAATTATACGCAATTTGAAAOT 

AGTCTTACGTAT CAAAGAATGTTCATACTGTT ATT CCAG ACTAT AT CCAT 

CCGAGTGATACGGCGACACCITATACTATAAATGG-GAGTGTCTTGATTGT 

AAATAACGAATTAGCTAAGGGACTTACCATCAAGAGTTATGAAGATTTAT 

TACAGCCTTCCTTAAAAGGTAAAATTGCCTTTGCAGATCCGA 

TCTAGTCCTTTCTCACAACTCACTAATATACT 

CACCAATCCAAAAGCGTGGAACTATGTTAAAAAGCTACAACATAATATTA 

ATGCTATCAAAT CTT CTAG CT CTTCAGAAGTTTAT CAAT CAGTTG CAGAA 

GGAAAAATGATTGTGGGGTTGACTTACGAAGACCCTAGTGTCAATTTGCA 

AAAAAGTGGTGCCAATGTTTCTATTGTATACCCGACAGAAGGGACAGTTT 

TTGTCCCATCTTCG<3TTGCAATTATAAAGAATGCTCCTTCTATGAAAGAA 

GCAAAGTTATTTATTAATTTTATGCTTTCTTTAGATC 

TGGG CAGTCAACGAGTAACCGACCTATT CGTAAAGATG CC CAAACAAGTA 

ATGGCATGAAAGCTTTAAAGGATATCGCTACTCT^ 

TATGTC^CTAAGICATAAGAGCCAAATCCTTAAAACCTATAATCGCATTC 
TAGAAATGCTGAT 

SEQ ID NO. 6007 
STRAIN COH1 

CAGCCIT CTAAACTACTT C CACCAAAAGAATTAGTT 
ATTCTAAGTCCAAATAGTCAAGCCATTTTAACAGGA^ 
TGAGGAAAAATACGGTATAAAAGTTAAGCTTATTCAAGGTGGGACAGGGC 
AACTAATAGATAGATTAAGTAAGGAGG<jTAAGCAGTTGAAGGCGGATATT 
TTCTTTGGAGGAAATTATACG CAATTTGAAAGT CATAAGG CATTGTTTG A 
GTCTTACGTATCAAAGtAATGTT CATACTGTTATT CCAGACTAT^ C 
CGAGTGATACGGCGACACCTTATACTATAAATGGGAGTGTCTTGATTGTA 
AATAACGAATTAG CTAAGGGACTTAC CAT CAAGAGTTATG AAGATTTATT 
ACAG C CTTCCTT AAAAGGTAAAATTG CCTTTG CAGAT CCGAATACTT CCT 
CTAGTGCTTTCTCACAACTCACTA 

AC CAATCCAAAAG CGTGGAACT ATGTTAAAAAGCTACAACATAAT ATTA^ 
TG CTATCAAATCTT CTAGCT CTT CAGAAGTTTAT CAAT CAGTTG CAG AAG 
GAAAAATGATTGTGGGGTTG ACTTACGAAG AC CCTAGTGT CAATTTG CAA 
AAAAGTGGTG CCAATGTTTCTATTGTATAC CCGACAGAAGGGACAGTTTT 
TGTCCCATCTTCGGTTG CAATTATAAAGAATGCT COTT CTATGAAAG AAG 
CAAAGTTATTTATTAATTTTATGCTTTLTl n i"lAGATGTT CAAAATG CCTTT 
GGGCAGTCAACG AGTAACCGACCT ATT CGT AAAGATGC C CAAACAAGTAA 
TGG CATGAAAGCTTTAAAGGATAT CGCTACT CTTAAAGAAGATTATCGCT 
ATGTCACTAAGCATAAGAGCCAAATCCTTAAAACCTATAATCGCATTCGT 
AGAAATG CTGAT 

SEQ ID NO. 6008 
STRAIN M781 

CAGCCTTCTAAACTACTTCCACCAAAAGA 

CTAAGTC CAAATAGTCAAGCCATTTTAACAGGAACGATT C CAGCTTTTG A 
GGAAAAATACGGTAT AAAAGTTAAG CTTATTCAAGGTGGGACAGGGCAAC 
TAATAGATAG ATTAAGTAAGGAGGGTAAG CAGTTGAAGG CGGATATTTT C 
TTTGGAGGAAATTATACG CAATTTGAAAGTCATAAGG CATTGTTTGAGT C 
TTACGTATCAAAGAATGTTCATACTGTTATTCCAGACTATATCCATCCGA 
GTGATACGG CGACAC CTTATACTATAAATGGGAGTGT (CTTGATTGTAAAT 
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AA.CGAATTAG CT AAGGG ACTTACCAT CAAG AGTTATG AAGATTTATTACA 
G C CTT CCTTAAAAG GTAAAAITG C CTTTG CAG AT C CG AATACTT C CT CTA 
GTGCTTTCTCACAACTCACTAATATACTCTTGGCCAAGGGTGGTTACACC 
AAT CCAAAAG CGTGG AACTATGTT AAAAAG CTACAACATAAT ATT AATG C 
TAT CAAAT CTTCTAG CT CTT CAGAAGTTT ATCAAT CAGTTG CAG AAGGAA 
AAATG ATTGTGGGGTTGACTTACGAAGAC C CTAGTGT CAATTTG CAAAAA 
AGTGGTGCCAATGTTTCTATTGTATACCCGACAGAAGGGA<^GTTTTTGT 
CC CAT CT TCGGTTG CAATTATAAAG AATG CT C CTT CT AT GAAAG AAGCAA 
AGTTATTTATTAATTTTATG CT TT CITTAGATGTT CAAAATG C CTTTGGG 
CAGTCIAACGAGTAACCGACCTATTCGTAAAGATGCCCAAAC1AAGTAATGG 
CATGAAAG CTTTAAAGGATATCGCT ACTCTTAAAGAAGATTAT CG CT ATG 
TCACTTAAGCATAAGAGCCAAATCCTTAAAACCTATAATCGCATTCGTAGA 
AATGCTGAT 

SEQ ID NO. 6009 
STRAIN CJB110 

CAG C CTTTTAAACTACTT C CACCAAAAGAATTAGTTATT CT 
AAGT C CAAATAGT CAAG C CATTTT AACAGGAACGATT C CAGCTTTTGAGg 
AAAAATACGGTATAAAAGTTAAG CTTATT CAAGGTGG GACAGGG CAACT A 
AT AG ATAGATTAAGT AAGGAGGGT AAG CAGTTGAAGG CGGATATTTT CTT 
TGGAGG^yUVTTATACGCMTTTGAAAGTCATAAGGCA.TTGTTTGAGTCTT 
ACGTATCAAAGAATGTTCAT ACTGTTATTCCAGACTATAT CCAT C CAAGT 
GATACGG CGACACCTTATACTATAAATGGGAGTGT CTTGATTGT AAATAA 
CG AATTAG CTAAGGGACTTACCAT CAAGAGTTATGAAGATTTATTACAG C 
CTTC CTTAAAAGGTAAAATTG C CTTTG CAGATC CGAATACTT CCT CTAGT 
G CTTT CT CACAACT CACTAATATACTCTTGG CCAAGGGTGG CAA 
TCCAAAAG CGTGGAACT ATGTTAAAAAG CTACAACATAAT ATTAATG CTA 
TCAAATCTTCTAGCTCTTCAGAAGTT^ 

ATGATTGTGGGG CTGACTTACGAAGAC CCT AGTGTCAATTTGCAAAAAAG 
TGGTGCCAATGTTT CTATTGTATAT CCGACAGAAGGGACAGTTTTTGT C C 
CATCTTTCGGTTG CAATTAT AAAGAATG CT C CTTCTATGAAAG AAG CAAAG 
TTATTTATTAATTTTATGCTTTCTTTAGATGTT CAAT^ATG C CTTTGGG CA 
GT CAACGAGT AACCGAC CTATTCGTAAAGATG C CCAAACGAGTAATGGCA 
TGAAAGCTTTT AAAGG AT ATTGCTACT CTTAAAGAAGATTAT CG CTATGT C 
ACTAAGCAT AAGGG CCAAAT CCTTAAAAC CTATAATCGTATTCGTAG A^\A 
TGCTGAT 

SEQ ID NO. 6010 
STRAIN 1169NT 

ATAGTCAAG CCATTTTAACAGG AACG^TT C CAG CTTTTG AGG AAAAAT AC 
GGTATAAAAGTTAAGCTTATTG^GGTGGGACAGGGCAACTAATAGATAG 
ATT AAGT AAGGAGGGTAAG CATTTG AAGGCGGATATTTT CT t TGGAGGAA 
ATTATACGCAATTTGAAAGT CATAAGG CATTGTTT GAGTCTTACGTATCA 
AAGAATGTTCATACTGTTATTC G^GACTAT AT CCATC CAAGTGATACGG C 
GACACCTTATACTATAAATGGGAGTGTCTTGATTGTAAATA^.CGAATTAG 
CTAAGGGACTTACCATCAAGAGTTATGAAGATTTATT^ 
AAAGGTAAAATTGC CTTTG CAGATC CGAATACTTC CTCTAGTG CTTTCT C 
ACAACTCACCAATATACTCTTC^CAAACtGGTGGTTA 

CGTGGAACTATGTT AAAAAG CTACAACATAATATTAATG CTATCAAAT CT 
T CTAG CT CTTCAC4AAGTTTATCAAT CAGTTG CAGAAGGAAAAATGATTGT 
GGGGTTGACTTACGAAGAC C CTAGTGT CAATT t GCAAAAAAGTGGTG CCA 
ATGTTTCTATTGTATAT C CG ACAGAAGGG ACAGTTTTTGT CCCAT CTT CG 
GTTG CAATT ATAAAG AATG CT C CTT CT ATGAAAGAAG CAAAGTTATTTAT 
TAATTTTATGCITTCTTTAGATGTTCAAAATG C CTTTGGG CAGTCAACGA 
GTAACCGAC CTATTCGTAAAGATG C CCAAACGAGTAATGG CATGAAAG CT 
TTAAAGGATATTGCTACT CITAAAG AAGATTATCG CTATGTCACTAAG CA 
TAAGGGCCAAATCCTTAAAACCTATAATCGTATTCGTAGAAATGCTGAT 

SEQ ID NO. 6011 
STRAIN JM91 130013 

CAGC CTT CT AAACTACTT C CAC CAAAAGAATTAGT 

TATT CTAAGTCCAAATAGTCAAGCCATTTTAACAGGAACGATTCCAG CTT 
TTGAC4GAAAAATACGGTATAAAAGTTAAGCTTATTCAAGGTGGGACAGGG 
CAACT AATAG^TAGATTAAGTAAGGAGGGTAAGCAGTTGAAGGCGGATGT 
TTTCTTTGG AGGAAA.TTATACG CAATTTG AAAGTCATAAGG CATTGTTTG 
AGTCITACGTATCAAAGAATGTTCATACTGTTATTCCAGACT 
CCGAGTGATACGGCGACACCITATACTATAAATGGGAGTGTCTTGATTGT 
AAAT AACGAATTAG CTAAGGGACTTAC CAT CAAGAGTTATGAAG ATTTAT 
TACAGCCTTCCTTAAAAGGTAAAATTGCCITTC 

T CTAGTG CTTT CTCACAACT CAC Ci^TAT ACT CTTGG CAAAGGGTGGTTA 
CACCAAT C CAAAAGCGTGGAACTATGTTAAAAAG CTACAACATAATATTA 
ATGCTAT CAAAT CTT CTAG CTCTT CAG AAGTTTAT CAAT CAGTTG CAGAA 
GGCAAAATGATTGTGGGGCTGACTTACGAAGkCCCT^ 
AAAAAGTGGTGCCAATGTTTCTATTGTGTATCCGACAGAAGGGACRGTTT 
TTGTC CCAT CTT CGGTTGCAATTATAAAGAATG CTCCTTCTA 
GCAAAGTTATTTATTAATTTTATGCTTTCT^ 

TGGGCAGTCAACGAGTAACCGACCTATTCGTAAAGATGCCCAAACGAGTA 
ATGG CAT GAAAG CTTTAAAGGATATTG CTACTCTTAAAG AAGATTAT CG C 
TATGT CACTAAG CATAAGGG C CAAATCCTT AAAAC CTAT AAT CGTATTCG 
TAGAAATG CTGAT 

PRETTY of: /biotmp/msa523010 .2{*} April 28, 2003 08:55 
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msa523010 
msa523010 
msa523010 
msa523010 
msa523010 
msa52301 
msa523010 .2 
msa523010 
rasa523010 .2 
msa52301.0 .2 
msa523010.2{263 



1 50 

.2{263_COHl} 

.2 {263_M732} 

.2{263_M781} 

.2{263_A90 9) — . 

. 2{263_H36B} 

O.2{263_090} 

{263_18RS2l} 

.2{263_2603} atgaaagaaa aacagtcgaa aaggcttatt tatatactac tggttgtttc 
{2 63J2JB110* 
{263_1169NT' 

_JM91130013j 

Consensus ********** ********** ********** ********** ********** 

51 100 

msa523010 . 2 ( 263J20H1} cag ccttctaaac 

msa523010 .2 {263_M732} — cag ccttctaaac 

msa523010 . 2 {263_M78l} cag ccttctaaac 

msa523010 ,2{263_A909} cag ccttctaaac 

msa523010.2{263_H36B} taaac 

msa523010 .2{263_090} . cag ccttctaaac 

msa523010.2{263_18RS2l} cag ccttctaaac 

msa523010 ,2{263_2603} cattattttt ataagtgttt ttacatacag tattagccag ccttctaaac 

msa523010 . 2 f 263_CJBllo} cag ccttttaaac 

msa523010 . 2 { 263_1169NT} 

msa523010 . 2 {263_JM91130013} cag ccttctaaac 

Consensus ********** ********** ********** ******* _ 



msa523010. 

msa523010 . 

msa523010 . 

msa523010 . 

msa523010 . 
msa523010 
msa523010 . 2 { 

msa523010 . 
msa523010 .2{ 
msa523010 .2{ 
msa523010.2{263 



2{263_COHl} 
2{263_M732~ 
2{263_M781 
2{263_A909. 
2{263_H3 6B} 
.2{263_090} 
263_18RS2i; 
2{263_2603| 
263_CJB110' 
263J1169NT 
JM91130013; 
Consensus 



101 

tacttccacc 
tacttccacc 
tacttccacc 
tacttccacc 
tacttccacc 
tacttccacc 
tacttccacc 
tacttccacc 
tacttccacc 



aaaagaatta 
aaaagaatta 
aaaagaatta 
aaaagaatta 
aaaagaatta 
aaaagaatta 
aaaagaatta 
aaaagaatta 
aaaagaatta 



gttattctaa 
gttattctaa 
gttattctaa 
gttattctaa 
gttattctaa 
gttattctaa 
gttattctaa 
gttattctaa 
gttattctaa 



gtccaaATAG 
gtccaaATAG 
gtccaaATAG 
gtccaaATAG 
gtccaaATAG 
gtccaaATAG 
gtccaaATAG 
gtccaaATAG 
gtccaaATAG 
ATAG 



tacttccacc aaaagaatta gttattctaa 



gtccaaATAG 
_ _.„___**** 



150 

TCAAGCCATT 

TCAAGCCATT 

TCAAGCCATT 

TCAAGCCATT 

TCAAGCCATT 

TCAAGCCATT 

TCAAGCCATT 

TCAAGCCATT 

TCAAGCCATT 

TCAAGCCATT 

TCAAGCCATT 
********** 



msa523010 . 

msa523010 . 

msa523010 . 

msa523010 . 

msa523010 . 
msa523010 
msa523010 . 2 { 

msa523010 . 
msa523010 .2 { 
msa523010 .2{ 
msa523010.2{263_ 



2{263_COHl} 
2{263__M732} 
2{263_M78l} 
2{263_A909> 
2{263_H36B} 
.2{263_090} 
263_18RS21} 
2{263_2603j 
263_CJB110) 
263_1169NT} 
JM91130013} 
Consensus 



151 

TTAACAGGAA 
TTAACAGGAA 
TTAACAGGAA 
TTAACAGGAA 
TTAACAGGAA 
TTAACAGGAA 
TTAACAGGAA 
TTAACAGGAA 
TTAACAGGAA 
TTAACAGGAA 
TTAACAGGAA 
********** 



CGATTCCAGC 

CGATTCCAGC 

CGATTCCAGC 

CGATTCCAGC 

CGATTCCAGC 

CGATTCCAGC 

CGATTCCAGC 

CGATTCCAGC 

CGATTCCAGC 

CGATTCCAGC 

CGATTCCAGC 
********** 



TTTTGAGGAA 

TTTTGAGGAA 

TTTTGAGGAA 

TTTTGAGGAA 

TTTTGAGGAA 

TTTTGAGGAA 

TTTTGAGGAA 

TTTTGAGGAA 

TTTTGAGGAA 

TTTTGAGGAA 

TTTTGAGGAA 
********** 



AAATACGGTA 
AAATACGGTA 
AAATACGGTA 
AAATACGGTA 
AAATACGGTA 
AAATACGGTA 
AAATACGGTA 
AAATACGGTA 
AAATACGGTA 
AAATACGGTA 
AAATACGGTA 



200 

TAAAAGTTAA 
TAAAAGTTAA 
TAAAAGTTAA 
TAAAAGTTAA 
TAAAAGTTAA 
TAAAAGTTAA 
TAAAAGTTAA 
TAAAAGTTAA 
TAAAAGTTAA 
TAAAAGTTAA 
TAAAAGTTAA 



********** ********** 



msa523010 
msa523010 
msa523010 
msa523010 
msa523010 
msa52301 
msa523010 .2 
msa523010 
msa523010.2 
msa523010.2 
msa523010.2{263 



.2{263_COHl} 
.2{263_M732> 
.2{263__M781) 
.2{263__A909} 
.2{263_H36B} 
0.2{263_090) 
{263_18RS21} 
.2{263_2603 
{263_CJB110 
{263_1169NT ( 
_JM91130013 
Consensus 



201 

GCTTATTCAA 

GCTTATTCAA 

GCTTATTCAA 

GCTTATTCAA 

GCTTATTCAA 

GCTTATTCAA 

GCTTATTCAA 

GCTTATTCAA 

GCTTATTCAA 

GCTTATTCAA 

GCTTATTCAA 
********** 



GGTGGGACAG 
GGTGGGACAG 
GGTGGGACAG 
GGTGGGACAG 
GGTGGGACAG 
GGTGGGACAG 
GGTGGGACAG 
GGTGGGACAG 
GGTGGGACAG 
GGTGGGACAG 
GGTGGGACAG 



GgCAACTAAT 
GgCAACTAAT 
GgCAACTAAT 
G t CAACTAAT 
GtCAACTAAT 
GgCAACTAAT 
GgCAACTAAT 
GgCAACTAAT 
GgCAACTAAT 
GgCAACTAAT 
GgCAACTAAT 



AGATAGATTA 
AGATAGATTA 
AGATAGATTA 
AGATAGATTA 
AGATAGATTA 
AGATAGATTA 
AGATAGATTA 
AGATAGATTA 
AGATAGATTA 
AGATAGATTA 
AGATAGATTA 



250 

AGTAAGGAGG 
AGTAAGGAGG 
AGTAAGGAGG 
AGTAAGGAGG 
AGTAAGGAGG 
AGTAAGGAGG 
AGTAAGGAGG 
AGTAAGGAGG 
AGTAAGGAGG 
AGTAAGGAGG 
AGTAAGGAGG 



********** *_******** ********** ********** 



msa523010 . 

msa523010 . 

msa523010 . 

msa523010 . 

msa523010 . 
msa523010 
msa523010.2{ 

msa523010 . 
msa523010.2{ 
msa523010.2{ 
msa523010.2{263_. 



2(263_COHl 
2(263_M732 
2{263_M781 
2(263_A909' 
2{263_H36B 
.2{263_090 
263_18RS21 
2{263_2603' 
263_CJB110} 
263_1169NT) 
JM91130013} 
Consensus 



251 

GTAAGCAgTT 

GTAAGCAgTT 

GTAAGCAgTT 

GTAAGCAgTT 

GTAAGCAgTT 

GTAAGCAgTT 

GTAAGCAgTT 

GTAAGCAgTT 

GTAAGCAgTT 

GTAAGCAtTT 

GTAAGCAgTT 
*******_** 



GAAGGCGGAT 

GAAGGCGGAT 

GAAGGCGGAT 

GAAGGCGGAT 

GAAGGCGGAT 

GAAGGCGGAT 

GAAGGCGGAT 

GAAGGCGGAT 

GAAGGCGGAT 

GAAGGCGGAT 

GAAGGCGGAT 
********** 



aTTTTCTTTG 
aTTTTCTTTG 
aTTTTCTTTG 
aTTTTCTTTG 
aTTTTCTTTG 
aTTTTCTTTG 
aTTTTCTTTG 
aTTTTCTTTG 
aTTTTCTTTG 
aTTTTCTTTG 
gTTTTCTTTG 
_********* 



GAGGAAATTA 

GAGGAAATTA 

GAGGAAATTA 

GAGGAAATTA 

GAGGAAATTA 

GAGGAAATTA 

GAGGAAATTA 

GAGGAAATTA 

GAGGAAATTA 

GAGGAAATTA 

GAGGAAATTA 
********** 



300 

TACGCAATTT 

TACGCAATTT 

TACGCAATTT 

TACGCAATTT 

TACGCAATTT 

TACGCAATTT 

TACGCAATTT 

TACGCAATTT 

TACGCAATTT 

TACGCAATTT 

TACGCAATTT 
********** 
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Table 60: Comparative Sequences relating to SAG1945 



msa523010 
msa523010 
msa523010 
msa523010 
msa523010 
msa52301 
msa523010.2 
msa523010 
msa523 010 .2 
msa523010.2 
msa523010 .2(263 



.2{263_COHl 
.2{263_M732 
.2(263_M781 
.2{263_A90 9 
.2{263_H36B; 
0.2{263_090 
(263_18RS21~ 
.2{263_2603 
{263_CJB110 
(263_1169NT] 
_JM91130013] 
Consensus 



301 

GAAAGTCATA 

GAAAGTCATA 

GAAAGTCATA 

GAAAGTCATA 

GAAAGTCATA 

GAAAGTCATA 

GAAAGTCATA 

GAAAGTCATA 

GAAAGTCATA 

GAAAGTCATA 

GAAAGTCATA 
********** 



AGGCATTGTT 

AGGCATTGTT 

AGGCATTGTT 

AGGCATTGTT 

AGGCATTGTT 

AGGCATTGTT 

AGGCATTGTT 

AGGCATTGTT 

AGGCATTGTT 

AGGCATTGTT 

AGGCATTGTT 
********** 



TGAGTCTTAC 

TGAGTCTTAC 

TGAGTCTTAC 

TGAGTCTTAC 

TGAGTCTTAC 

TGAGTCTTAC 

TGAGTCTTAC 

TGAGTCTTAC 

TGAGTCTTAC 

TGAGTCTTAC 

TGAGTCTTAC 
********** 



GTATCAAAGA 

GTATCAAAGA 

GTATCAAAGA 

GTATCAAAGA 

GTATCAAAGA 

GTATCAAAGA 

GTATCAAAGA 

GTATCAAAGA 

GTATCAAAGA 

GTATCAAAGA 

GTATCAAAGA 
********** 



350 

ATgTTCATAC 

ATgTTCATAC 

ATgTTCATAC 

ATaTTCATAC 

ATaTTCATAC 

ATgTTCATAC 

ATgTTCATAC 

ATgTTCATAC 

ATgTTCATAC 

ATgTTCATAC 

ATgTTCATAC 
**_******* 



msa523010 
msa523010 
msa523010 
msa523010 
msa523010 
msa52301 
msa523010.2 
msa523010 
msa523010 .2 
msa523010.2 
msa523010 .2(263 



.2{263_COHl} 
.2{263_M732} 
-2{263_M781} 
.2f 263__A90 9} 
«2(263_H36B} 
0.2{263_090} 
{263 18RS21} 
.2{263_2603} 
{263_CJB110} 
{263_1169NT} 
__JM91130013} 
Consensus 



351 

TGTTATTCCA 
TGTTATTCCA 
TGTTATTCCA 
TGTTATTCCA 
TGTTATTCCA 
TGTTATTCCA 
TGTTATTCCA 
TGTTATTCCA 
TGTTATTCCA 
TGTTATTCCA 
TGTTATTCCA 



GAcTATATCC 
GAcTATATCC 
GAcTATATCC 
GAtTATATCC 
GAtTATATCC 
GAcTATATCC 
GAcTATATCC 
GAcTATATCC 
GAcTATATCC 
GAcTATATCC 
GAcTATATCC 



ATCCgAGTGA 
ATC CgAGTGA 
ATCCgAGTGA 
ATCCgAGTGA 
ATCCgAGTGA 
ATCCaAGTGA 
ATCCaAGTGA 
ATCCaAGTGA 
ATCCaAGTGA 
ATCCaAGTGA 
ATCCgAGTGA 



TACGGCGACA 
TACGGCGACA 
TACGGCGACA 
TACGGCGACA 
TACGGCGACA 
TACGGCGACA 
TACGGCGACA 
TACGGCGACA 
TACGGCGACA 
TACGGCGACA 
TACGGCGACA 



400 

CCTTATACTA 
CCTTATACTA 
CCTTATACTA 
CCTTATACTA 
CCTTATACTA 
CCTTATACTA 
CCTTATACTA 
CCTTATACTA 
CCTTATACTA 
CCTTATACTA 
CCTTATACTA 



********** **_******* ****_***** ********** ********** 



msa523010 
msa523010 
msa523010 
msa523010 
msa523010 
msa52301 
msa523010.2 
msa523010 
msa523010 . 2 
msa523010 .2 
msa523010 .2(263 



.2(263_C0H1} 
.2f 263_M732} 
.2{263_M78l} 
.2{263_A909} 
.2{2 63_H36B; 
0.2{263_090 
(263_18RS21 
.2{263_2603} 
{263_CJB110} 
(263_1169NT} 
_JM91130013} 
Consensus 



401 

TAAATGGGAG 

TAAATGGGAG 

TAAATGGGAG 

TAAATGGGAG 

TAAATGGGAG 

TAAATGGGAG 

TAAATGGGAG 

TAAATGGGAG 

TAAATGGGAG 

TAAATGGGAG 

TAAATGGGAG 
********** 



TGTCTTGATT 
TGTCTTGATT 
TGTCTTGATT 
TGTCTTGATT 
TGTCTTGATT 
TGTCTTGATT 
TGTCTTGATT 
TGTCTTGATT 
TGTCTTGATT 
TGTCTTGATT 
TGTCTTGATT 



GTAAATAACG 
GTAAATAACG 
GTAAATAACG 
GTAAATAACG 
GTAAATAACG 
GTAAATAACG 
GTAAATAACG 
GTAAATAACG 
GTAAATAACG 
GTAAATAACG 
GTAAATAACG 



********** ********** 



AATTAGcTAA 

AATTAGcTAA 

AATTAGcTAA 

AATTAGcTAA 

AATTAGtTAA 

AATTAGcTAA 

AATTAGcTAA 

AATTAGcTAA 

AATTAGcTAA 

AATTAGcTAA 

AATTAGcTAA 
******_*** 



450 

GGGACTTACC 

GGGACTTACC 

GGGACTTACC 

GGGACTTACC 

GGGACTTACC 

GGGACTTACC 

GGGACTTACC 

GGGACTTACC 

GGGACTTACC 

GGGACTTACC 

GGGACTTACC 
********** 



msa523010 
msa523010 
msa523010 
msa523010 
msa523010 
msa52301 
msa523010.2 
msa523010 
msa523010.2 
msa523010.2 
msa523010 .2(263 



.2{263_COHl] 
.2{263_M732] 
.2{263_M781 
-2{263_A909 
.2(263_H36B 
0.2{263_090 
(263_18RS21 ! 
.2{263_2603/ 
(263_CJB110 ! 
(263_1169NT 
_JM91130013 
Consensus 



451 

ATCAAGAGTT 
ATCAAGAGTT 
ATCAAGAGTT 
ATCAAGAGTT 
ATCAAGAGTT 
ATCAAGAGTT 
ATCAAGAGTT 
ATCAAGAGTT 
ATCAAGAGTT 
ATCAAGAGTT 
ATCAAGAGTT 



ATGAAGATTT 
ATGAAGATTT 
ATGAAGATTT 
ATGAAGATTT 
ATGAAGATTT 
ATGAAGATTT 
ATGAAGATTT 
ATGAAGATTT 
ATGAAGATTT 
ATGAAGATTT 
ATGAAGATTT 



ATTACAGCCT 
ATTACAGCCT 
ATTACAGCCT 
ATTACAGCCT 
ATTACAGCCT 
ATTACAGCCT 
ATTACAGCCT 
ATTACAGCCT 
ATTACAGCCT 
ATTACAGCCT 
ATTACAGCCT 



TCCTTAAAAG 
TCCTTAAAAG 
TCCTTAAAAG 
TCCTTAAAAG 
TCCTTAAAAG 
TCCTTAAAAG 
TCCTTAAAAG 
TCCTTAAAAG 
TCCTTAAAAG 
TCCTTAAAAG 
TCCTTAAAAG 



500 

GTAAAATTGC 
GTAAAATTGC 
GTAAAATTGC 
GTAAAATTGC 
GTAAAATTGC 
GTAAAATTGC 
GTAAAATTGC 
GTAAAATTGC 
GTAAAATTGC 
GTAAAATTGC 
GTAAAATTGC 



********** ********** ********** ********** ********** 



msa523010 
msa523010 
msa523010 
msa523010 
msa523010 
msa52301 
msa523010.2 
msa523010 
msa523010.2 
rasa523010.2 
msa523010 .2(263 



.2{263_COHl 
.2f 263_M732 
.2{263_M781 
.2{263_A909; 
.2{263_H36B' 
0.2{263_090 
(263_18RS21 
.2{263_2603 
{263_CJB110} 
(263_1169NT) 
_JM91130013) 
Consensus 



501 

CTTTGCAGAT 

CTTTGCAGAT 

CTTTGCAGAT 

CTTTGCAGAT 

CTTTGCAGAT 

CTTTGCAGAT 

CTTTGCAGAT 

CTTTGCAGAT 

CTTTGCAGAT 

CTTTGCAGAT 

CTTTGCAGAT 
********** 



CCGAATACTT 

CCGAATACTT 

CCGAATACTT 

CCGAATACTT 

CCGAATACTT 

CCGAATACTT 

CCGAATACTT 

CCGAATACTT 

CCGAATACTT 

CCGAATACTT 

CCGAATACTT 
********** 



CCTCTAGTGC 

CCTCTAGTGC 

CCTCTAGTGC 

CCTCTAGTGC 

CCTCTAGTGC 

CCTCTAGTGC 

CCTCTAGTGC 

CCTCTAGTGC 

CCTCTAGTGC 

CCTCTAGTGC 

CCTCTAGTGC 
********** 



TTTCTCACAA 

TTTCTCACAA 

TTTCTCACAA 

TTTCTCACAA 

TTTCTCACAA 

TTTCTCACAA 

TTTCTCACAA 

TTTCTCACAA 

TTTCTCACAA 

TTTCTCACAA 

TTTCTCACAA 
********** 



550 

CTCACtAATA 

CTCACfcAATA 

CTCACtAATA 

CTCACtAATA 

CTCACtAATA 

CTCACtAATA 

CTCACtAATA 

CTCACtAATA 

CTCACtAATA 

CTCACcAATA 

CTCACcAATA 
*****_**** 



msa 5 2 3 0 1 0 . 2 { 2 6 3_C0H1 
msa523010.2{ 2'63_M732 
msa523010 .2(263__M781 
msa523010.2(263__A909 
msa523010 .2(263_H36B 
msa523010 .2{263_090 
msa523010.2(263_18RS21 
msa523010 .2{263_2603 
msa523 010 . 2 ( 263_CJB110 
msa523010 . 2 {263_1169NT 
msa523010.2(263_JM91130013 



551 

TACTCTTGGC 
TACTCTTGGC 
TACTCTTGGC 
TACTCTTGGC 
TACTCTTGGC 
TACTCTTGGC 
TACTCTTGGC 
TACTCTTGGC 
TACTCTTGGC 
TACTCTTGGC 
TACTCTTGGC 



cAAGGGTGGT 
cAAGGGTGGT 
CAAGGGTGGT 
cAAGGGTGGT 
cAAGGGTGGT 
cAAGGGTGGT 
CAAGGGTGGT 
CAAGGGTGGT 
cAAGGGTGGT 
aAAGGGTGGT 
aAAGGGTGGT 



TACACCAATC 
TACACCAATC 
TACACCAATC 
TACACCAATC 
TACACCAATC 
TACACCAATC 
TACACCAATC 
TACACCAATC 
TACACCAATC 
TACACCAATC 
TACACCAATC 



CAAAAGCGTG 
CAAAAGCGTG 
CAAAAGCGTG 
CAAAAGCGTG 
CAAAAGCGTG 
CAAAAGCGTG 
CAAAAGCGTG 
CAAAAGCGTG 
CAAAAGCGTG 
CAAAAGCGTG 
CAAAAGCGTG 



600 

GAACTATGTT 
GAACTATGTT 
GAACTATGTT 
GAACTATGTT 
GAACTATGTT 
GAACTATGTT 
GAACTATGTT 
GAACTATGTT 
GAACTATGTT 
GAACTATGTT 
GAACTATGTT 
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Table 60: Comparative Sequences relating to SAG1945 



Consensus 



msa523010 .2{263_COHl} 
msa523010 .2f 263_M732} 
msa523010 .2(263_M78l} 
msa523010.2{263_A909} 
msa523010.2{263_H3 6B} 
msa523010.2{263_090} 
msa523010.2{263_18RS2l} 
msa523010 .2{263_2603} 
msa523010.2{263_CJB110} 
msa523010 . 2 {263_1169NT} 
msa523010.2{263_JM91130013} 

Consensus 



msa523 0 10 . 2 { 2 63_COHl } 
msa523010.2{263_M732} 
msa523010 .2 (263_M78l} 
msa523010.2{263 < _A909} 
msa523010 .2{263_H36B} 
msa523010 .2{263_090) 
msa523010.2{263_18RS2Xj 
msa523010.2{263_2603) 
msa523010 .2{263_CJB110} 
msa523010 .2{263_1169NT} 
msa523010.2{263_JM91130013} 

Consensus 



msa523 0 10 . 2 f 263_COHl 
msa5 2 3 0 1 0 . 2 { 2 6 3_M7 3 2 
msa523010.2{263__M781; 
msa523010 .2 {263_A909; 
msa523010 .2 {263_H36B 
ntsa523010.2{263_090 
msa523010.2{263_18RS21 
msa523010.2{263_2603 
msa523010.2{263_CJB110 
msa523010 . 2 {263_1169NT/ 
msa523010 . 2 { 263_JM91130013 ; 

Consensus 



msa5 2 3 0 1 0 . 2 { 2 6 3_COHl ' 
msa523010 . 2 (263_M732 
msa523010 .2{263_M781 
msaS23010.2{263_A909 
msa523010 ,2{263_H36B 
msa523010.2{263_090 
msa523010 . 2 {263_18RS21 : 
msa523010.2{263_2603/ 
msaS23010.2{263_CJB110 
msa523010.2{263_1169NT 
msa523010 . 2 { 263_JM91130013 ; 

Consensus 



msa523010.2{263_COHl} 
msa523010.2{263 i _M732} 
msa523010.2{263_M78l} 
msa5 23 010 . 2 f 2 63_A9 0 9 } 
m&a523010.2{263_H36B} 
msa523010. 2 {263^090} 
msa523010.2{263_18RS21 } 
msa523010.2{263_2603} 
msa523010 . 2 {263_CJB110 } 
msa523010 . 2 { 263_1169NT) 
msa523010.2{263_JM91130013) 

Consensus 



msa523010.2|263_COHl) 
msa523010 .2{263_M732} 
msa523010 .2(263_M781} 
msa523010.2{263_A909} 
msa523010.2{263_H36B) 
msa523010.2{263__090} 
msa523010 . 2 {263__18RS2l} 
msa523010 . 2 {263_2603 ) 
msa523010.2f 263_CJB110J 
msa523010 . 2 { 263_1169NT} 



********** _********* ********** ********** ********** 



601 

AAAAAGCTAC 

AAAAAGCTAC 

AAAAAGCTAC 

AAAAAGCTAC 

AAAAAGCTAC 

AAAAAGCTAC 

AAAAAGCTAC 

AAAAAGCTAC 

AAAAAGCTAC 

AAAAAGCTAC 

AAAAAGCTAC 
********** 



AACATAATAT 

AACATAATAT 

AACATAATAT 

AACATAATAT 

AACATAATAT 

AACATAATAT 

AACATAATAT 

AACATAATAT 

AACATAATAT 

AACATAATAT 

AACATAATAT 
********** 



TAATGCTATC 

TAATGCTATC 

TAATGCTATC 

TAATGCTATC 

TAATGCTATC 

TAATGCTATC 

TAATGCTATC 

TAATGCTATC 

TAATGCTATC 

TAATGCTATC 

TAATGCTATC 
********** 



AAATCTTCTA 

AAATCTTCTA 

AAATCTTCTA 

AAATCTTCTA 

AAATCTTCTA 

AAATCTTCTA 

AAATCTTCTA 

AAATCTTCTA 

AAATCTTCTA 

AAATCTTCTA 

AAATCTTCTA 
********** 



650 

GCTCTTCAGA 

GCTCTTCAGA 

GCTCTTCAGA 

GCTCTTCAGA 

GCTCTTCAGA 

GCTCTTCAGA 

GCTCTTCAGA 

GCTCTTCAGA 

GCTCTTCAGA 

GCTCTTCAGA 

GCTCTTCAGA 
********** 



651 

AGTTTATCAA 

AGTTTATCAA 

AGTTTATCAA 

AGTTTATCAA 

AGTTTATCAA 

AGTTTATCAA 

AGTTTATCAA 

AGTTTATCAA 

AGTTTATCAA 

AGTTTATCAA 

AGTTTATCAA 
********** 



TCAGTTGCAG 

TCAGTTGCAG 

TCAGTTGCAG 

TCAGTTGCAG 

TCAGTTGCAG 

TCAGTTGCAG 

TCAGTTGCAG 

TCAGTTGCAG 

TCAGTTGCAG 

TCAGTTGCAG 

TCAGTTGCAG 
********** 



AAGGaAAAAT 

AAGGaAAAAT 

AAGGaAAAAT 

AAGGaAAAAT 

AAGGaAAAAT 

AAGGaAAAAT 

AAGGaAAAAT 

AAGGaAAAAT 

AAGGaAAAAT 

AAGGaAAAAT 

AAGGcAAAAT 
****_***** 



GATTGTGGGG 

GATTGTGGGG 

GATTGTGGGG 

GATTGTGGGG 

GATTGTGGGG 

GATTGTGGGG 

GATTGTGGGG 

GATTGTGGGG 

GATTGTGGGG 

GATTGTGGGG 

GATTGTGGGG 
********** 



700 

tTGACTTACG 

tTGACTTACG 

tTGACTTACG 

tTGACTTACG 

tTGACTTACG 

CTGACTTACG 

CTGACTTACG 

CTGACTTACG 

CTGACTTACG 

tTGACTTACG 

CTGACTTACG 
_********* 



701 

AAGACCCTAG 
AAGACCCTAG 
AAGACCCTAG 
AAGACCCTAG 
AAGACCCTAG 
AAGACCCTAG 
AAGACCCTAG 
AAGACCCTAG 
AAGACCCTAG 
AAGACCCTAG 

AAGACCCTAG 
********** 



TGTCAATTTG 

TGTCAATTTG 

TGTCAATTTG 

TGTCAATTTG 

TGTCAATTTG 

TGTCAATTTG 

TGTCAATTTG 

TGTCAATTTG 

TGTCAATTTG 

TGTCAATTTG 

TGTCAATTTG 
********** 



CAAAAAAGTG 

CAAAAAAGTG 

CAAAAAAGTG 

CAAAAAAGTG 

CAAAAAAGTG 

CAAAAAAGTG 

CAAAAAAGTG 

CAAAAAAGTG 

CAAAAAAGTG 

CAAAAAAGTG 

CAAAAAAGTG 
********** 



GTGCCAATGT 

GTGCCAATGT 

GTGCCAATGT 

GTGCCAATGT 

GTGCCAATGT 

GTGCCAATGT 

GTGCCAATGT 

GTGCCAATGT 

GTGCCAATGT 

GTGCCAATGT 

GTGCCAATGT 
********** 



750 

TTCTATTGTa 

TTCTATTGTa 

TTCTATTGTa 

TTCTATTGTa 

TTCTATTGTa 

TTCTATTGTa 

TTCTATTGTa 

TTCTATTGTa 

TTCTATTGTa 

TTCTATTGTa 

TTCTATTGTg 
*********_ 



751 800 

TAcCCGACAG AAGGGACAGT TTTTGTCCCA TCTTCGGTTG CAATTATAAA 

TAcCCGACAG AAGGGACAGT TTTTGTCCCA TCTTCGGTTG CAATTATAAA 

TAcCCGACAG AAGGGACAGT TTTTGTCCCA TCTTCGGTTG CAATTATAAA 

TAt CCGACAG AAGGGACAGT TTTTGTCCCA'' TCTTCGGTTG CAATTATAAA 

TAt CCGACAG AAGGGACAGT TTTTGTCCCA TCTTCGGTTG CAATTATAAA 

TAt CCGACAG AAGGGACAGT TTTTGTCCCA TCTTCGGTTG CAATTATAAA 

TAt CCGACAG AAGGGACAGT TTTTGTCCCA TCTTCGGTTG CAATTATAAA 

TAt CCGACAG AAGGGACAGT TTTTGTCCCA TCTTCGGTTG CAATTATAAA 

TAt CCGACAG AAGGGACAGT TTTTGTCCCA TCTTCGGTTG CAATTATAAA 

TAt CCGACAG AAGGGACAGT TTTTGTCCCA TCTTCGGTTG CAATTATAAA 

TAt CCGACAG AAGGGACAGT TTTTGTCCCA TCTTCGGTTG CAATTATAAA 

**w******* ********** ********** ********** ********** 



801 

GAATGCTCCT 

GAATGCTCCT 

GAATGCTCCT 

GAATGCTCCT 

GAATGCTCCT 

GAATGCTCCT 

GAATGCTCCT 

GAATGCTCCT 

GAATGCTCCT 

GAATGCTCCT 

GAATGCTCCT 
********** 



TCTATGAAAG 

TCTATGAAAG 

TCTATGAAAG 

TCTATGAAAG 

TCTATGAAAG 

TCTATGAAAG 

TCTATGAAAG 

TCTATGAAAG 

TCTATGAAAG 

TCTATGAAAG 

TCTATGAAAG 
********** 



AAGCAAAGTT 

AAGCAAAGTT 

AAGCAAAGTT 

AAGCAAAGTT 

AAGCAAAGTT 

AAGCAAAGTT 

AAGCAAAGTT 

AAGCAAAGTT 

AAGCAAAGTT 

AAGCAAAGTT 

AAGCAAAGTT 
********** 



ATTTATTAAT 

ATTTATTAAT 

ATTTATTAAT 

ATTTATTAAT 

ATTTATTAAT 

ATTTATTAAT 

ATTTATTAAT 

ATTTATTAAT 

ATTTATTAAT 

ATTTATTAAT 

ATTTATTAAT 
********** 



850 

TTTATGCTTT 

TTTATGCTTT 

TTTATGCTTT 

TTTATGCTTT 

TTTATGCTTT 

TTTATGCTTT 

TTTATGCTTT 

TTTATGCTTT 

TTTATGCTTT 

TTTATGCTTT 

TTTATGCTTT 
********** 



851 

CTTTAGATGT 
CTTTAGATGT 
CTTTAGATGT 
CTTTAGATGT 
CTTTAGATGT 
CTTTAGATGT 
CTTTAGATGT 
CTTTAGATGT 
CTTTAGATGT 
CTTTAGATGT 



TCAAAATGCC 
TCAAAATGCC 
TCAAAATGCC 
TCAAAATGCC 
TCAAAATGCC 
TCAAAATGCC 
TCAAAATGCC 
TCAAAATGCC 
TCAAAATGCC 
TCAAAATGCC 



TTTGGGCAGT 
TTTGGGCAGT 
TTTGGGCAGT 
TTTGGGCAGT 
TTTGGGCAGT 
TTTGGGCAGT 
TTTGGGCAGT 
TTTGGGCAGT 
TTTGGGCAGT 
TTTGGGCAGT 



CAACGAGTAA 
CAACGAGTAA 
CAACGAGTAA 
CAACGAGTAA 
CAACGAGTAA 
CAACGAGTAA 
CAACGAGTAA 
CAACGAGTAA 
CAACGAGTAA 
CAACGAGTAA 



900 

CCGACCTATT 
CCGACCTATT 
CCGACCTATT 
CCGACCTATT 
CCGACCTATT 
CCGACCTATT 
CCGACCTATT 
CCGACCTATT 
CCGACCTATT 
CCGACCTATT 
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* 

msa523010 . 2 { 263_JM91130013 } CTTTAGATGT TCAAAATGCC TTTGGGCAGT CAACGAGTAA CCGACCTATT 

Consensus ********** ********** ********** ********** ********** 



msa523010 
msa523010 
msa523010 
msa523010 
msa523010 
msa52301 
msa523010 .2 
msa523010 
msa523010 .2 
msa523010 .2 
msa523010 .2(263 



.2{263_COHl} 
.2{263_M732} 
.2{263_M781} 
.2{263_A909} 
.2{263_H36B} 
0.2{263_090) 
{263_18RS2l} 
.2{263_2603} 
{263_CJB110} 
{263_1169NT} 
_JM91130013} 
Consensus 



msa523010.2{2S3_COHl} 
msa523010-2{263_M732} 
msa523010 .2{263_M78l} 
msa523010 .2 {263_A909} 
msa523010 .2{263_H36B} 
msa523010.2{263_090) 
msa523010.2{263_18RS2l) 
msa523010 .2{263_2603} 
msa523010.2{263_CJB110} 
msa523010.2{263_1169NT) 
msa523010.2{263_JM91130013} 

Consensus 



901 

CGTAAAGATG 
CGTAAAGATG 
CGTAAAGATG 
CGTAAAGATG 
CGTAAAGATG 
CGTAAAGATG 
CGTAAAGATG 
CGTAAAGATG 
CGTAAAGATG 
CGTAAAGATG 
CGTAAAGATG 



CCCAAACaAG 
CCCAAACaAG 
CCCAAACaAG 
CCCAAACgAG 
CCCAAACgAG 
CCCAAACgAG 
CCCAAACgAG 
CCCAAACgAG 
CCCAAACgAG 
CCCAAACgAG 
CCCAAACgAG 
********** *******_** 



950 

AGGATATcGC 

AGGATATcGC 

AGGATATcGC 

AGGATAT t GC 

AGGATATtGC 

AGGATAT tGC 

AGGATATtGC 

AGGATATtGC 

AGGATATtGC 

AGGATATtGC 

AGGATATtGC 
*******_** 

1000 
aGCCAAATCC 
aGCCAAATCC 
aGCCAAATCC 
gGCCAAATCC 
gGCCAAATCC 
gGCCAAATCC 
gGCCAAATCC 
gGCCAAATCC 
gGCCAAATCC 
gGCCAAATCC 
gGCCAAATCC 

********** ********** ********** ********** _********* 



951 

TACTCTTAAA 
TACTCTTAAA 
TACTCTTAAA 
TACTCTTAAA 
TACTCTTAAA 
TACTCTTAAA 
TACTCTTAAA 
TACTCTTAAA 
TACTCTTAAA 
TACTCTTAAA 
TACTCTTAAA 



GAAGATTATC 
GAAGATTATC 
GAAGATTATC 
GAAGATTATC 
GAAGATTATC 
GAAGATTATC 
GAAGATTATC 
GAAGATTATC 
GAAGATTATC 
GAAGATTATC 
GAAGATTATC 



TAATGGCATG 

TAATGGCATG 

TAATGGCATG 

TAATGGCATG 

TAATGGCATG 

TAATGGCATG 

TAATGGCATG 

TAATGGCATG 

TAATGGCATG 

TAATGGCATG 

TAATGGCATG 
********** 



GCTATGTCAC 
GCTATGTCAC 
GCTATGTCAC 
GCTATGTCAC 
GCTATGTCAC 
GCTATGTCAC 
GCTATGTCAC 
GCTATGTCAC 
GCTATGTCAC 
GCTATGTCAC 
GCTATGTCAC 



AAAGCTTTAA 

AAAGCTTTAA 

AAAGCTTTAA 

AAAGCTTTAA 

AAAGCTTTAA 

AAAGCTTTAA 

AAAGCTTTAA 

AAAGCTTTAA 

AAAGCTTTAA 

AAAGCTTTAA 

AAAGCTTTAA 
********** 



TAAGCATAAG 
TAAGCATAAG 
TAAGCATAAG 
TAAGCATAAG 
TAAGCATAAG 
TAAGCATAAG 
TAAGCATAAG 
TAAGCATAAG 
TAAGCATAAG 
TAAGCATAAG 
TAAGCATAAG 



msa523010 . 

msa523010 . 

msa523010 . 

msa523010 . 

msa523010 . 
msa523010 
msa523010 .2{ 

msa523010 . 
msa523010 .2{ 
msa523010 .2{ 
msa523010.2{263 



2{263_COHi: 
2{263_M732; 
2{263_M781; 
2{263_A909 
2{263_H36B; 
.2{263_090 
263_18RS21 
2{263_2€03 
263_CJB110 
263_1169NT; 
JM91130013} 
Consensus 



1001 

TTAAAACCTA 

TTAAAACCTA 

TTAAAACCTA 

TTAAAACCTA 

TTAAAACCTA 

TTAAAACCTA 

TTAAAACCTA 

TTAAAACCTA 

TTAAAACCTA 

TTAAAACCTA 

TTAAAACCTA 
********** 



TAATCGcATT 

TAATCGcATT 

TAATCGcATT 

TAATCGtATT 

TAATCGtATT 

TAATCGtATT 

TAATCGtATT 

TAATCGtATT 

TAATCGtATT 

TAATCGtATT 

TAATCGtATT 
******_*** 



CGTAGAAATG 

CGTAGAAATG 

CGTAGAAATG 

CGTAGAAATG 

CGTAGAAATG 

CGTAGAAATG 

CGTAGAAATG 

CGTAGAAATG 

CGTAGAAATG 

CGTAGAAATG 

CGTAGAAATG 
********** 



1035 

CTGAT 

CTGAT 

CTGAT 

CTGAT 

CTGAT 

CTGAT 

CTGAT 

CTGAT 

CTGAT 

CTGAT 

CTGAT 
***** 



SEQ ZD NO. 6012 
STRAIN 2603 frame: 1 

MKEKQSKRL I YI LLWS 1 1 FI S VFT YS I SQPSKLLPPKELVILSPNSQAI LTGTI PAFEE 
KYGIKVKLI QGGTGQL I DRLSKEGKQLKAD I FFGGNYTQFESHKALFE S YVS KNVHTVI P 
DYIHPSDTATPYTINGSVLI VNNEIAKGLT I KSYEDLLQPSLKGKIAFADPNTSSSAFSQ 
LTN I LIjAKGG YTNPKAWNYVKKLQHN I NAI KS SS S SEVYQSVAEGKM I VGLT YEDPS VNL 
QKSGANVS I VYPTEGTVFVPS SVA 1 1 KNAPSMKEAKLF I NFMLSLDVQNAFGQSTSNRP I 
RKDAQTSNGMKALKD I ATLKEDYRYVTKHKGQ I LKTYNRI RRNAD 

SEQ ID NO. 6013 
STRAIN 090 frame: 1 

QPSKX.LPPKELVI LSPNSQAILTGT I PAFEE KYGIKVKLI QGGTGQL I DRLSKEGKQLKA 
DI FFGGNYTQFESHKALFESYVSKNVHTVI PDYIHPSDTATPYTINGSVLIVNNELAKGL 
TI KS YEDLLQPSLKGKI AFADPNTSSSAFSQLTNI LliAKGG YTN P KAWNYVKKLQHN I NA 
I KSSSSSEVYQSVAEGKMI VGIiTYEDPSVNLQKSGANVS I VYPTEGTVFVPS SVAI I KNA 
PSMKEAKLF I NFMLSLDVQNAFGQSTSNRP I RKDAQTSNGMKALKD I ATLKEDYRYVTKH 
KGQ I LKTYNRI RRNAD 

SEQ ID NO. 6014 
STRAIN A909 frame: 1 

QPSKLLPPKELVI LSPNSQAILTGT I PAFEEKYG I KVKL I QGGTGQL I DRLSKEGKQLKA 
DI FFGGNYTQFESHKALFESYVSKNIHTVI PDYIHPSDTATPYTINGSVLIVNNELAKGL 
T IKS YEDLLQPSLKGKI AFADPNTS S S AF S QLTN I LLAKGG YTN P KA WN YVKKL QHN I NA 
* IKSSSSSEVYQSVAEGKMI VGLTYEDPS VNLQKSGANVSI VYPTEGTVFVPS SVA 1 1 KNA 
PSMKEAKLF I NFMLS LDVQNAFGQSTSNRP I RKDAQTSNGMKALKD I ATLKEDYRYVTKH 
KGQI LKTYNRI RRNAD 

SEQ ID NO. 6015 

STRAIN H36B frame: 2 

KLLPPKELVILSPNSQAILTGTIPAFEEKYGIKVKLIQGGTGQLIDRLSKEGKQLKADIF 
FGGNYTQFESHKALFES YVSKNIHTVI PDY IHPSDTATPYTINGSVL I VNNELVKGLTI K 
S YEDLLQPSLKGKI AFADPNTSSSAFSQLTNI LLAKGGYTNPKAWNYVKKIjQ^ 
SSSSEVYQSVAEGKM I VGLTYEDPS VNLQKSGANVSI VYPTEGTVFVPS SVAI I KNAPSM 
KFJUO^F I NFMLS LDVQNAFGQSTSNRP I RKDAQTSNGMKALKD I ATLKEDYRYVTKHKG 

I LKTYNRI RRNAD 



SEQ ID NO. 6016 
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STRAIN 18RS21 frame: 1 

QPSKLLP PKELVI LSPNSQAI LTGT I PAFEEKYG I KVKL I QGGTGQL I DRLSKEGKQLKA 
DI FFGGNYTQFESHKALFESYVSKNVHTVI PDYI HPSDTATPYTINGSVLIVNNELAKGL 
TIKSYEDBLQPSLKGKIAFADPlSrrSSSAFSQLTNILIiAKGGYTNPKAWNYVKKLQHNINA 
IKSSSSSEVYQSVAEGKMI VGLTYEDPSVNLQKSGANVS I VYPTEGTVFVPSSVAI I KNA 
PSMKEAKLF INFMLSLDVQNAFGQSTSNRP I RKDAQTSNGMKALKDI ATLKEDYRYVTKH 
KGQI LKTYNRI RRNAD 

SEQ ID NO. 6017 
STRAIN M732 frame: 1 

QPSKLLPPKELVILSPNSQAILTGTIPAFEEKYGIKVKLIQGGTGQLIDRLSKEGKQLKA 
DIFFGGNYTQFESHKALFESYVSKNVHTVIPDYIHPSDTATPYTINGSVLIVNNEIiAKGL 
TIKSYEDLLQPSLKGKIAFADP^TTSSSAFSQLTNIIiLAKGGYTNPKAWNYVKKLQHNINA 
IKSSSSSEVYQSVAEGKMI VGLTYEDPSVNLQKSGANVS IVY PTEGTVFVPS SVAI I KNA 
PSMKEAKLF INFMLSLDVQNAFGQSTSNRP I RKDAQTSNGMKALKD I ATLKEDYRYVTKH 
KSQI LKTYNRI RRNAD 

SEQ ID NO . 6018 
STRAIN COH1 frame: 1 

QPSKLLP PKELVI LSPNSQAILTGT I PAFEEKYG I KVKL I QGGTGQL I DRLSKEGKQLKA 
DI FFGGNYTQFESHKALFESYVSKNVHTVI PD Y I HPSDTATP YT I NGS VL I VNNELAKGL 
TIKSYEDLLQPSLKGKIAFADPNTSSSAFSQLTNILLAKGGYTNPKAWNYVKKLQHNINA 
I KSS S S SEVYQSVAEGKMI VGLTYEDPSVNLQKSGANVS I VY PTEGTVFVPS SVAI I KNA 
PSMKEAKLF INFMLSLDVQNAFGQSTSNRP I RKDAQT SNGMKALKD I ATLKEDYRYVTKH 
KSQI LKTYNRI RRNAD 

SEQ ID NO. 6019 

STRAIN M781 frame: 1 

QPSKLLP PKELVI LS PNSQA I LTGT I PAFEEKYG I KVKL I QGGTGQL I DRLS KEGKQLKA 
DI FFGGNYTQFESHKALFESYVSKNVHTVI PDYIHPSDTATPYTINGSVLIVNNELAKGL 
TI KS YEDLLQPSLKGKI AFADPNTS S SAF SQLTN I LLAKGGYTNP KAWNYVKKLQHNINA 
I KS S S S SEVYQS VAEGKM I VGLTYEDPSVNLQKSGANVS I VYPTEGTVFVPS S VAI I KNA 
PSMKEAKLF INFMLSLDVQNAFGQSTSNRP I RKDAQTSNGMKALKD I ATLKEDYRYVTKH 
KSQI LKTYNRI RRNAD 

SEQ ID NO. 6020 

STRAIN CJB110 frame: 1 

QPFKLLPPKELVI LS PNSQAI LTGT I PAFEEKYG I KVKL I QGGTGQL I DRLSKEGKQLKA 
DIFFGGKTYTQFESHKALFESYVSKNVHTVI PDYIHPSDTATPYTINGSVLIVNNELAKGL 
T I KSYEDLLQPS LKGKI AFADPNTS S SAF SQLTN I LLAKGGYTNPKAWNYVKKLQHNINA 
IKSSSSSEVYQSVAEGKMI VGLTYEDPSVNLQKSGANVS I VYPTEGTVFVPSSVAI I KNA 
PSMKEAKLF INFMLSLDVQNAFGQSTSNRP I RKDAQTSNGMKALKDI ATLKEDYRYVTKH 
KGQI LKTYNRI RRNAD 

SEQ ID NO. 6021 

STRAIN 1 169NT frame: 3 

SQAI LTGT I PAFEEKYG I KVKL I QGGTGQL IDRLS KEGKHLKAD I FFGGNYTQFE SHKAL 

FESYVSKNVHTVI PDYI HPSDTATP YTINGSVL I VNNELAKGLTI KS YEDLLQPSLKGKI 

AFADPNTSSSAFSQLTNILLAKGGYTNPKAWNYVKKLQHNINAIKSSSSSEVYQ 

MI VGLTYEDPSVNLQKSGANVS I VYPTEGTVFVPSSVAI I KNAPSMKEAKLFINFMLSLD 

VQNAFGQSTSNRP I RKDAQT SNGMKALKD I ATLKED YRYVTKHKGQ I LKTYNRI RRNAD 

SEQ ID NO. 6022 
STRAIN JM91 130013 frame: 1 

QPSKLLPPKELVILSPNSQAILTGTIPAFEEKYGIKVKLIQGGTGQLIDRLSKEGKQLKA 

DVFFGGNYTQFESHKALFES YVSKNVHTVI PDYI HPSDTATP YT I NGS VL I VNNELAKGL 
TIKSYEDLLQPSLKGKIAFADPNTSSSAFSQLTNILIAKGGYTNPKAWNYVKKLQHNINA 

I KSS SS SEVYQSVAEGKMI VGLTYEDPSVNLQKSGANVS I VYPTEGTVFVPSSVAI I KNA 
PSMKEAKLF INFMLSLDVQNAFGQSTSNRP I RKDAQTSNGMKALKDI ATLKEDYRYVTKH 

KGQ I LKTYNRI RRNAD 

PRETTY of : /biotmp/msa523117 . 2 { *} April 28, 2003 08:56 .. 

! 50 

msa523117.2{263_COHl} q pskllppkel vilspnSQAI 

msa523117.2(263_M732} q pskllppkel vilspnSQAI 

msa523117.2{263_M78l} ~q pskllppkel vilspnSQAI 

msa523117.2{263_1169NTj ~ 7T7~T"7 19^1 

msa523117.2{263 CJBllO} q pfkllppkel vilspnSQAI 

msa523117.2{263 090} q pskllppkel vilspnSQAI 

msa523117.2{263 18RS21) q pskllppkel vilspnSQAI 

msa523117.2{263_2603} mkekqskrli yillwsiif isvftysisq pskllppkel vilspnSQAI 

msa523117.2{263_A909} q pskllppkel vilspnSQAI 

msa523117.2{263 JM91130013 q pskllppkel vilspnSQAI 

msa523117T2{263 H36B} kllppkel vilspnSQAI 

Consensus ********** ********** *********- **** 

51 100 
msa523117 .2(263 COHl} LTGT I PAFEE KYGIKVKLIQ GGTGQLIDRL SKEGKqLKAD iFFGGNYTQF 
msa523117.2{263_M732} LTGT I PAFEE KYGIKVKLIQ GGTGQLIDRL SKEGKqLKAD iFFGGNYTQF 
msa523117 .2(263 M78l} LTGT I PAFEE KYGIKVKLIQ GGTGQLIDRL SKEGKqLKAD iFFGGNYTQF 
msa523117. 2(263 1169NT} LTGTI PAFEE KYGIKVKLIQ GGTGQLIDRL SKEGKhLKAD iFFGGNYTQF 
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msa523117.2{263_CJB110 
msa523117.2{263_090 
msa523117.2{263_18RS21 
msa523117 . 2 { 263_2603 
msa523117 .2{263_A909/ 
msa523117.2{263__JM9113 0013 
msa523117.2{263_H36B • 

Consensus 



msa523117 .2{263_COHl} 
msa523117 .2{263_M732} 
msa523117.2{263_M781 
msa523117.2(263_1169NT 
msa523117 . 2 {263__CJB110 

msa52 3 1 17 . 2 { 2 63_0 9 0 ) 
msa523117.2{263_18RS2l} 
msa523117 .2{263_2603} 
msa523117.2{263_A909} 
msa523117 . 2 {263_JM91130013> 
msa523117.2{263_H36B} 

Consensus 



msa52 3 117 . 2 { 2 63_COHl } 
msa523117 .2{263_M732} 
msa523117.2{263_M78l} 
msa523117.2f263_1169NT^ 
msa523117.2{263_CJB110 
msa523117 . 2 { 263_090 
msa523117.2{263_18RS2l} 
msa523117 .2(263_2603) 
msa523117.2{263_A909} 
msa523117.2{263_JM91130013} 
msa523 117 . 2 { 2 63_H3 6B} 

Consensus 



msa523117-2(263_COHl; 
msa523117.2{263_M732 
msa523117 . 2 {263_M781 
msa523117.2{263_1169NT 
msa523 117 . 2 { 2 63_CJB110 
msa523117.2{263_090 
msa523117.2{263_18RS2i; 
msa523117 .2{263_2603 
msa523117.2{263_A909 
msa523117 . 2 (263_JM91130013 } 
msa52 3 1 1 7 . 2 { 2 6 3_H3 6B } 

Consensus 



msa523117.2(263_COHl} 
msa523117.2l263_M732} 
msa523117.2{263_M78l) 
msa523117.2{263_1169NT} 
msa523117.2{263_CJB110} 
msa523117.2{263_090} 
msa523117.2{263_18RS21* 
msa523117 .2{263_2603 
msa52 3117.2(2 63_A9 0 9 
msa523117.2{263_JM91130013. 

msa523 117 .2(2 63_H36B } 

Consensus 



LTGTIPAFEE 
LTGTIPAPEE 
LTGTIPAFEE 
LTGTIPAFEE 
LTGTIPAFEE 
LTGTIPAFEE 
LTGTIPAFEE 



msa523117 . 

msa523117 . 

tnsa523117 . 
msa523117.2( 
msa523117.2{ 
msa523117 
msa523117.2{ 

msa523117 . 

msa523117 . 
msa523117.2{263_. 

msa523117 , 



KYGIKVKLIQ 
KYGIKVKLIQ 
KYGIKVKLIQ 
KYGIKVKLIQ 
KYGIKVKLIQ 
KYGIKVKLIQ 
KYGIKVKLIQ 



GGTGQLIDRL 
GGTGQLIDRL 
GGTGQLIDRL 
GGTGQLIDRL 
GGTGQLIDRL 
GGTGQLIDRL 
GGTGQLIDRL 



S KEGKqLKAD 
SKEGKqLKAD 
SKEGKqLKAD 
SKEGKqLKAD 
SKEGKqLKAD 
SKEGKqLKAD 
SKEGKqLKAD 



i F FGGNYTQF 
iFFGGNYTQF 
iFFGGNYTQF 
iFFGGNYTQF 
iFFGGNYTQF 
vF FGGNYTQF 
iFFGGNYTQF 



********** ********** ********** *****_**** _********* 



101 

ESHKALFESY 
ESHKALFESY 
ESHKALFESY 
ESHKALFESY 
ESHKALFESY 
ESHKALFESY 
ESHKALFESY 
ESHKALFESY 
ESHKALFESY 
ESHKALFESY 
ESHKALFESY 



VSKNvHTVIP 
VSKNvHTVIP 
VSKNvHTVIP 
VSKNvHTVIP 
VSKNvHTVIP 
VSKNvHTVIP 
VSKNvHTVIP 
VSKNvHTVIP 
VSKNiHTVIP 
VSKNVHTVIP 
VSKNiHTVIP 



********** ****_***** 



DYIHPSDTAT 
DYIHPSDTAT 
DYIHPSDTAT 
DYIHPSDTAT 
DYIHPSDTAT 
DYIHPSDTAT 
DYIHPSDTAT 
DYIHPSDTAT 
DYIHPSDTAT 
DYIHPSDTAT 

DYIHPSDTAT 
********** 



PYTINGSVLI 

PYTINGSVLI 

PYTINGSVLI 

PYTINGSVLI 

PYTINGSVLI 

PYTINGSVLI 

PYTINGSVLI 

PYTINGSVLI 

PYTINGSVLI 

PYTINGSVLI 

PYTINGSVLI 
********** 



2{263_COHl) 
2{263_M732) 
2{263_M78l} 
263_1169NT) 
263_CJB110} 
2{263_090} 
263_18RS2l} 
2{263_2603j 
2{263_A909) 
JM91130013} 
'2{263_H36B} 
Consensus 



151 

IKSYEDLLQP 
I KSYEDLLQP 
IKSYEDLLQP 
IKSYEDLLQP 
IKSYEDLLQP 
IKSYEDLLQP 
IKSYEDLLQP 
IKSYEDLLQP 
IKSYEDLLQP 
IKSYEDLLQP 
IKSYEDLLQP 



SLKGKIAFAD 
SLKGKIAFAD 
SLKGKIAFAD 
SLKGKIAFAD 
SLKGKIAFAD 
SLKGKIAFAD 
SLKGKIAFAD 
SLKGKIAFAD 
SLKGKIAFAD 
SLKGKIAFAD 
SLKGKIAFAD 



PNTSSSAFSQ 
PNTSSSAFSQ 
PNTSSSAFSQ 
PNTSSSAFSQ 
PNTSSSAFSQ 
PNTSSSAFSQ 
PNTSSSAFSQ 
PNTSSSAFSQ 
PNTSSSAFSQ 
PNTSSSAFSQ 
PNTSSSAFSQ 



********** ********** ********** 



201 

KKLQHNINAI 
KKLQHNINAI 
KKLQHNINAI 
KKLQHNINAI 
KKLQHNINAI 
KKLQHNINAI 
KKLQHNINAI 
KKLQHNINAI 
KKLQHNINAI 
KKLQHNINAI 
KKLQHNINAI 



KSSSSSEVYQ 
KSSSSSEVYQ 
KSSSSSEVYQ 
KSSSSSEVYQ 
KSSSSSEVYQ 
KSSSSSEVYQ 
KSSSSSEVYQ 
KSSSSSEVYQ 
KSSSSSEVYQ 
KSSSSSEVYQ 
KSSSSSEVYQ 



********** ********** 



SVAEGKMIVG 
SVAEGKMIVG 
SVAEGKMIVG 
SVAEGKMIVG 
SVAEGKMIVG 
SVAEGKMIVG 
SVAEGKMIVG 
SVAEGKMIVG 
SVAEGKMIVG 
SVAEGKMIVG 

SVAEGKMIVG 
********** 



150 

VNNELaKGLT 

VNNELaKGLT 

VNNELaKGLT 

VNNELaKGLT 

VNNELaKGLT 

VNNELaKGLT 

VNNELaKGLT 

VNNELaKGLT 

VNNELaKGLT 

VNNELaKGLT 

VNNELvKGLT 
*****_**** 

200 

YTNPKAWNYV 
YTNPKAWNYV 
YTNPKAWNYV 
YTNPKAWNYV 
YTNPKAWNYV 
YTNPKAWNYV 
YTNPKAWNYV 
YTNPKAWNYV 
YTNPKAWNYV 
YTNPKAWNYV 

YTNPKAWNYV 
********** 

250 

QKSGANVSIV 
QKSGANVSIV 
QKSGANVSIV 
QKSGANVSIV 
QKSGANVSIV 
QKSGANVSIV 
QKSGANVSIV 
QKSGANVSIV 
QKSGANVSIV 
QKSGANVSIV 
QKSGANVSIV 
********** ********** 



LTN I LLAKGG 
LTNILLAKGG 
LTN I LLAKGG 
LTNILLAKGG 
LTN I LLAKGG 
LTNILLAKGG 
LTNILLAKGG 
LTNILLAKGG 
LTNILLAKGG 
LTNILLAKGG 

LTNILLAKGG 
********** 



LTYEDPSVNL 
LTYEDPSVNL 
LTYEDPSVNL 
LTYEDPSVNL 
LTYEDPSVNL 
LTYEDPSVNL 
LTYEDPSVNL 
LTYEDPSVNL 
LTYEDPSVNL 
LTYEDPSVNL 
LTYEDPSVNL 



251 

YPTEGTVFVP 

YPTEGTVFVP 

YPTEGTVFVP 

YPTEGTVFVP 

YPTEGTVFVP 

YPTEGTVFVP 

YPTEGTVFVP 

YPTEGTVFVP 

YPTEGTVFVP 

YPTEGTVFVP 

YPTEGTVFVP 
********** 

301 

RKDAQTSNGM 
RKDAQTSNGM 
RKDAQTSNGM 
RKDAQTSNGM 
RKDAQTSNGM 
RKDAQTSNGM 
RKDAQTSNGM 
RKDAQTSNGM 
RKDAQTSNGM 
RKDAQTSNGM 
RKDAQTSNGM 



SSVAIIKNAP 

SSVAIIKNAP 

SSVAIIKNAP 

SSVAI I KNAP 

SSVAIIKNAP 

SSVAIIKNAP 

SSVAIIKNAP 

SSVAIIKNAP 

SSVAI I KNAP 

SSVAIIKNAP 

SSVAIIKNAP 
********** 



SMKEAKLF I N 
SMKEAKLF IN 
SMKEAKLF I N 
SMKEAKLF IN 
SMKEAKLFIN 
SMKEAKLF IN 
SMKEAKLFIN 
SMKEAKLFIN 
SMKEAKLFIN 
SMKEAKLFIN 
SMKEAKLFIN 



FMLSLDVQNA 
FMLSLDVQNA 
FMLSLDVQNA 
FMLSLDVQNA 
FMLSLDVQNA 
FMLSLDVQNA 
FMLSLDVQNA 
FMLSLDVQNA 
FMLSLDVQNA 
FMLSLDVQNA 
FMLSLDVQNA 



300 

FGQSTSNRPI 
FGQSTSNRPI 
FGQSTSNRPI 
FGQSTSNRPI 
FGQSTSNRPI 
FGQSTSNRPI 
FGQSTSNRPI 
FGQSTSNRPI 
FGQSTSNRPI 
FGQSTSNRPI 
FGQSTSNRPI 



********** ********** ********** 



KALKDIATLK 
KALKD I ATLK 
KALKDIATLK 
KALKDIATLK 
KALKDIATLK 
KALKDIATLK 
KALKDIATLK 
KALKDIATLK 
KALKDIATLK 
KALKDIATLK 
KALKDIATLK 



********** ********** 



EDYRYVTKHK 

EDYRYVTKHK 

EDYRYVTKHK 

EDYRYVTKHK 

EDYRYVTKHK 

EDYRYVTKHK 

EDYRYVTKHK 

EDYRYVTKHK 

EDYRYVTKHK 

EDYRYVTKHK 

EDYRYVTKHK 
********** 



SQILKTYNRI 

SQILKTYNRI 

SQILKTYNRI 

gQILKTYNRI 

gQILKTYNRI 

gQILKTYNRI 

gQILKTYNRI 

gQILKTYNRI 

gQILKTYNRI 

gQILKTYNRI 

gQILKTYNRI 
_********* 



345 

RRNAD 

RRNAD 

RRNAD 

RRNAD 

RRNAD 

RRNAD 

RRNAD 

RRNAD 

RRNAD 

RRNAD 

RRNAD 
***** 
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Table 61: Comparative Sequences relating to SAG1030 



SEQ ID NO. 6101 
STRAIN 2603 

ATGGT AAAAGTTAGTGTAAGTTCTGTACX3AACTCAAG CAT CAACAGT AG CTATTT CTATG 
TTTAGT CGTGTAT CGG CTTT AAATG ATG CAATAACAAAACT AT CAT CTTTTG CAGAGGCT 
G CAACTCTT CAAGGGACTG CTTATT CAAATG CAAAAAG CTATGCTACTGG AACGTTAACT 
CCGATG CTT CAAGGAATGATTCTTTT CT CTGAAACATTG AGTGAGAAATGTACAG AATT A 
CAAACCTTATAT GT CTCAATTTGTGGTGATGAGGATTTAG ACT CTGT CGTTTTAGAATCA 
AAATT AG CAAGTGATAGGG CAT CATTAAAG ATTG CTGAAG CACTTTT AG AG CATCTT AAC 
GATGATC CAGAACCTT C CAAAT CT GC CATAAGTT CTACAAAAAGTAAT ATTAAAAAATTA 
AAAAAACGTATAAAATCTAATCAAAAG AAATTAG ACAAC CTTAATGAATTTAACGC C CAT 
TCAG CAACAGTATTTG C GG ACATTT CTAATG CACAGT CAACTGTT AAC CAAGCACTAG CG 
GCTCTTTCAACAGGATTTTCrGGATATAATAGTAAAACCGGA 

T C CGGACAG ATGGAATG GACAAAG ACAGTTAAGAAGAATTGG AAAGAGCG AG AAG ACG C C 
AAAGCTGAAGAACTG AAAAGTAAAAAG G C TGAAG AAAGTAAG AAAGCTT CAAAAATT GAA 
AATACTACT AAAAAAAGTAATGTTT CAGTTGATAAAAAG AAATT AATAAAAG CGG CTAAT 
GAAG CGTATAAATTAGG AG AAATT AAAAAAGATAC CTATGAAT CAATTATCAGTGGTTTA 
AGTAATG CAT CGG CTGC CTTACTTAAAGAGGT AG CTAAAT CAAAATTGACTGACACAGCT 
CGGCTATTGATG 

SEQ XD NO. 6102 
STRAIN 090 

TTAAATG ATG CAAT AACAAAACTAT CAT CTTTTG CAGAGG CT 
G G^CTCTT CAAGGGACTG CTTATT CAAATGCAAAAAG CT ATGCTAC TG G 
AACGTTAACT CCGATGCHTCAAGGAATGAITCTT'ITCTCTGAAACATTGA 
GTGAGAAATGTACAGAATTACAAACCTTATATGT CT CAATTTGTGGTGAT 
GAGGATTTAGACT CTGTCGTTTTAGAATCAAAATTAG CAAGTGATAGGG C 
AT CATTAAAGATTG CTGAAG CACTTTTAGAG CAT CTTAACGATGATC CAG 
AACCTTC CAAAT CTGC CATAAGTTGTACAAAAAGTAATATTAAAAAATT A 
AAAAAACGTATAAAATCTAATCAAAAGAAATTAGACAACCTTAATGAATT 
TAACG CCCATT CAG CAACAGTATTTGCGGACATTT CTAATGCACAGT CAA 
CTGTTAACCAAG CACTAGCGGCTGTTT CAACAGGATTTT CTGGATATAAT 
AGTAAAACCGGAGCTTTTGGAAAAC CAACATC CGGACAGATGGAATGGAC 
AAAGACAGTTAAGAAGAATTGGAAAGAG CGAGAAGACGC CAAAG CTGAAG 
AACTGAAAAGTAAAAAG G CTGAAGAAAGTAAGAAAG CTT CAAAAATTGAA 
AATACTACTAAAAAAAGTAATGTTT CAGTTGATAAAAAGAAATT AAT AAA 
AG CGGCTAATGAAG CGTATAAATT AGGAGAAATTAAAAAAGATAC CTATG 
AATCAATTAT CAGTGGTTT AAGTAATG CAT CGG CTGC CTTACTTAAAGAG 
GTAG CTAAAT CAAAATTGACTGACACAGCTCGGCTATTGATG 

SEQ ZD NO. 6103 

STRAIN 18RS21 

TT AAATGATG CAAT AACAAAACTAT CAT CTTTTG CAGAGG C 
TGCAACT CTT CAAGGGACT G CTTATT CAAATG CAAAAAG CTATG CTACTG 
GAACGTTAACTCCGATGCTTCAAGGAATGATTCTTTTCTCTGAAA^ 
AGTGAGAAAT GTACAGAATT ACAAAC CTTATATGTCT CAATTTGTGGTGA 
TGAGGATTTAGACTCTGTCGTTTTAGAAT CAAAATTAG CAAGTGATAGGG 
CATCATTAAAGATTG CTGAAG CACTTTTAGAG CAT CTTAACGATGATCCA 
GAACCTTCCAAATCTG C CAT AAGTTCTACAAAAAGTAATATT AAAAAATT 
AAAAAAACGTATAAAAT CTAATCAAAAGAAATTAGACAACCTTAATGAAT 
TTAACGCC CATT CAG CAACAGTATTTGCGGACATTTCTAATG CACAGTCA 
ACTGTTAACCAAGCACTAGCGGCTGTTTCAACAGGAITTT'CTC^GATATAA 
TAGTAAAACCGGAGCITTTGGAAAAC CAACAT CCGGACAGATGGAATGG A 
0\AAGACAGTTAAGAAGAATTGGAAAGAGCGAGAAGA(2GCCAAAGCr 
GAACTGAAAAGTAAAAAGGCTGAAGAAAGTAAGAAAG CTT CAAAAATTG A 
AAAT ACTACTAAAAAAAGT AATGTTT CAGTTGATAAAAAGAAATT AATAA 
AAGCGGCTAATGAAGCGTATAAATTAGGAGAAATTAAAAAAGATACCTAT 
GAATCAATTATGAGTGGTTTAAGTAATGCATCGGCTGCCTTAC^ 
GGTAG CTAAAT CAAAATTG ACTGACACAG CTCGG CTATTG ATG 

PRETTY of : /biotmp/msal85066 . 2 { * } May 13, 2003 07:01 



50 



1 

msal85066 .2{270_090} 

msal85066 . 2 { 270_18RS2l) 

msa!85066.2{270 2603} atggtaaaag ttagtgtaag ttctgtagga actcaagcat caacagtagc 

Consensus ********** ********** ********** ********** ********** 

51 100 

msal85066.2{270 090} TT AAATGATGCA ATAACAAAAC 

msal85066.2{270 18RS21} TT AAATGATGCA ATAACAAAAC 

msal85066.2{270 2603} tatttctatg tttagtcgtg tatcggctTT AAATGATGCA ATAACAAAAC 

********** ********** ********** ********** ********** 



Consensus 



101 



150 



msal85066.2{270_090} TATCATCTTT TGCAGAGGCT GCAACTCTTC AAGGGACTGC TT ATT CAAAT 



msal85066 . 2 {270_18RS21 
msal85066 . 2 { 270J2603 



TATCATCTTT TGCAGAGGCT GCAACTCTTC AAGGGACTGC TT ATT CAAAT 
TATCATCTTT TGCAGAGGCT GCAACTCTTC AAGGGACTGC TTATT CAAAT 



Consensus ********** ********** ********** ********** ********** 

151 200 
msal85066'.2{270 090} GCAAAAAGCT ATGCTACTGG AACGTTAACT CCGATG CTTC AAGGAATGAT 
msal85066.2{270 18RS21} GCAAAAAGCT ATGCTACTGG AACGTTAACT CCGATG CTTC AAGGAATGAT 
msa!85066.2{270 2603} GCAAAAAGCT ATGCTACTGG AACGTTAACT CCGATGCTTC AAGGAATGAT 
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Table 61: Comparative Sequences relating to SAG1030 



Consensus ********** ********** ********** ********** ********** 



201 250 
msal8 5066 .2{270_090} TCTTTTCTCT GAAACATTGA GTGAGAAATG TACAGAATTA CAAACCTTAT 
msal85066.2{270 18RS21} TCTTTTCTCT GAAACATTGA GTGAGAAATG TACAGAATTA CAAACCTTAT 
msal85066.2{270 2603} TCTTTTCTCT GAAACATTGA GTGAGAAATG TACAGAATTA CAAACCTTAT 

Consensus ********** ********** ********** ********** ********** 



msal85066 -2{270_090} 
msal85066.2{270_JL8RS2l} 
msa!8 5066.2{270_2603} 

Consensus 



msal85066 ,2{270_090> 
msal85066.2{270_18RS2l} 
msal85066. 2 {270^2603} 

Consensus 



msal85066 .2{270_090} 
msal85066.2{270_18RS2l} 
msal85066 . 2 (270_2603 } 

Consensus 



251 300 
ATGT CTCAAT TTGTGGTGAT GAGGATTTAG ACTCTGTCGT TTTAGAATCA 
ATGT CTCAAT TTGTGGTGAT GAGGATTTAG ACTCTGTCGT TTTAGAATCA 
ATGT CTCAAT TTGTGGTGAT GAGGATTTAG ACTCTGTCGT TTTAGAATCA 
********** ********** ********** ********** ********** 

301 350 
AAATT AG CAA GTGATAGGGC ATCATTAAAG ATTGCTGAAG CACTTTTAGA 
AAATTAG CAA GTGATAGGGC ATCATTAAAG ATTGCTGAAG CACTTTTAGA 
AAATTAG CAA GTGATAGGGC ATCATTAAAG ATTGCTGAAG CACTTTTAGA 
********** ********** ********** ********** ********** 

351 400 
GCATCTTAAC GATGAT CCAG AACCTTCCAA ATCTGCCATA AGTTCTACAA 
GCAT CTTAAC GATGAT CCAG AACCTTCCAA ATCTGCCATA AGTTCTACAA 
GCATCTTAAC GATGATCCAG AACCTTCCAA ATCTGCCATA AGTTCTACAA 
********** ********** ********** ********** ********** 



401 450 
msal8 5 0 66 . 2 { 27 0_0 90 } AAAGTAATAT TAAAAAATTA AAAAAACGTA TAAAATCTAA TCAAAAGAAA 
msal85066.2{270 18RS21} AAAGTAATAT TAAAAAATTA AAAAAACGTA TAAAATCTAA TCAAAAGAAA 
msal85066.2{270 2603} AAAGTAATAT TAAAAAATTA AAAAAACGTA TAAAATCTAA TCAAAAGAAA 

Consensus ********** ********** ********** ********** ********** 

451 500 
msal85066.2{270_090} TTAGACAACC TTAATGAATT TAACG C C CAT TCAGCAACAG TATTTGCGGA 
msal85066.2{270 18RS21} TTAGACAACC TTAATGAATT TAACG CC CAT TCAGCAACAG TATTTGCGGA 
msal85066.2{270 2603} TTAGACAACC TTAATGAATT TAACG CC CAT TCAGCAACAG TATTTGCGGA 

Consensus ********** ********** ********** ********** ********** 



msal85066.2{270_090} 
msal85066 .2{270_18RS2l} 
msal85066 . 2 {270_2603 } 

Consensus 



msal85066 . 2 {27 0_090 
msal85066 .2{270_18RS21 
msal85066 . 2{ 270_2603 

Consensus 



501 550 
CATTTCTAAT GCACAGTCAA CTGTTAACCA AG CACTAG C G GCTG TTT CAA 
CATTTCTAAT GCACAGTCAA CTGTTAACCA AGCACTAGCG GCTGTTTCAA 
CATTTCTAAT GCACAGTCAA CTGTTAACCA AGCACTAGCG GCTGTTTCAA 
********** ********** ********** ********** ********** 

551 600 
CAGGATTTTC TGGATATAAT AGTAAAACCG GAGCTTTTGG AAAACCAACA 
CAGGATTTTC TGGATATAAT AGTAAAACCG GAGCTTTTGG AAAACCAACA 
CAGGATTTTC TGGATATAAT AGTAAAACCG GAGCTTTTGG AAAACCAACA 
********** ********** ********** ********** ********** 



601 650 
msa!85066.2{270_090} TCCGGACAGA TGGAATGGAC AAAGACAGTT AAGAAGAATT GGAAAGAGCG 
msal85066 .2(270 18RS21} TCCGGACAGA TGGAATGGAC AAAGACAGTT AAGAAGAATT GGAAAGAGCG 
msal85066 .2{270_2603 } TCCGGACAGA TGGAATGGAC AAAGACAGTT AAGAAGAATT GGAAAGAGCG 

Consensus ********** ********** ********** ********** ********** 

651 700 
msal8 50 66. 2 { 27 0_0 90} AGAAGACGCC AAAGCTGAAG AACTGAAAAG TAAAAAGGCT GAAGAAAGTA 
msal85066-2{270 18RS21} AGAAGACGCC AAAGCTGAAG AACTGAAAAG TAAAAAGGCT GAAGAAAGTA 
msal85066.2{270_2603) AGAAGACGCC AAAGCTGAAG AACTGAAAAG TAAAAAGGCT GAAGAAAGTA 

Consensus ********** ********** ********** ********** ********** 

701 750 
msal85066.2{270 090} AGAAAGCTTC AAAAATTGAA AATACTACTA AAAAAAGTAA TGTTTCAGTT 
msal85066.2{270 18RS21} AGAAAGCTTC AAAAATTGAA AATACTACTA AAAAAAGTAA TGTTTCAGTT 
msal8 5066.2{270 2603} AGAAAGCTTC AAAAATTGAA AATACTACTA AAAAAAGTAA TGTTTCAGTT 

Consensus ********** ********** ********** ********** ********** 



msal85066.2{270_090i 
msal85066.2{270_18RS21) 
msal85066. 2 {270^2603} 

Consensus 



msal85066.2{270_090 
msal85066.2{270_18RS21 
msal85066.2{270_2603 

Consensus 



751 800 
GATAAAAAGA AATT AATAAA AG CGG CTAAT GAAGCGTATA AATTAGGAGA 
GATAAAAAGA AATTAATAAA AGCGG CTAAT GAAGCGTATA AATTAGGAGA 
GATAAAAAGA AATTAATAAA AGCGG CTAAT GAAGCGTATA AATTAGGAGA 
********** ********** ********** ********** ********** 

801 850 
AATTAAAAAA GATACCTATG AATCAATTAT CAGTGGTTTA AGTAATGCAT 
AATTAAAAAA GATACCTATG AATCAATTAT CAGTGGTTTA AGTAATGCAT 
AATTAAAAAA GATACCTATG AATCAATTAT CAGTGGTTTA AGTAATGCAT 
********** ********** ********** ********** ********** 



851 900 
msal850 66. 2 {270_090} CGGCTGCCTT ACTTAAAGAG GTAGCTAAAT CAAAATTGAC TGACACAGCT 
msal85066.2{270_18RS21) CGGCTGCCTT ACTTAAAGAG GTAGCTAAAT CAAAATTGAC TGACACAGCT 
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■i 



Table 61: Comparative Sequences relating to SAG1030 

msal8 5066.2{270_2603} CGGCTGCCTT ACTTAAAGAG GTAGCTAAAT CAAAATTGAC TGACACAGCT 

Consensus ********** ********** ********** ********** ********** 

901 912 
msal85066 .2{270_090} CGGCTATTGA TG 
msa!85066 . 2 {270_18RS2l} CGGCTATTGA TG 
msalS 50 66. 2 {270_2603} CGGCTATTGA TG 

Consensus ************ 

SEQ XD NO. 6104 

STRAIN 2603 frame: I 

MVKVS VS S VGTQASTVAI SMFS RVSALNDAITKLS S PAEAATLQGTAYSNAKS YATGTIiT 
PMLC^MILFSETLSEKCTELOTLYVSICGDEDLDSVVLESKIiASDRASLKIAEALLEHLN 
DDPEPSKSAI SSTKSNI KKLKKRI KSNQKKLDNLNEFNAHSATVFAD I SNAQSTVNQALA 

AVSTGFSGYNSKTGAFGKPTSGQMEWTKTVKKNWKEREDAKAEELKSKKAEESKKASKIE 
NTTKKSNVS VDKKKLI KAANEAYKLGEIKKDTYES 1 1 SGLSNASAALLKEVAKSKLTDTA 
RLLM 

SEQ ID NO. 6105 
STRAIN 090 frame: 1 

LNDAI TKLSS FAEAATLQGTAYSNAKS YATGTLT PMLQGM I LFSETLSEKCTELQTLYVS 
I CGDEDLDSWLESKIASDRASLKIAEALLEHLNDDPEPSKSAI SSTKSNI KKLKKRI KS 
NQKKLDNLNE FNAHS ATVFAD I SNAQ S TVNQALAAVS TG F S G YN S KTGAFGKPT S GQME W 
TKTVKKNWKEREDAKAEELKSKKAEESKKASKIENTTKKSNV KAANEAYKLG 
EI KKDTYES 1 1 SGLSNASAALLKEVAKSKLTDTARLLM 

SEQ XD NO. 6106 
STRAIN 18RS2I frame: 1 

LNDAI TKLS S FAEAATLQGTAYSNAKS YATGTLTPMLQGM I LFS ETLSE KCTELQTL YVS 
I CGDEDIjDSVVLESKLASDRASLKIAEALLEHLNDDPEPSKSAI SSTKSNI KKLKKRI KS 
NQKKLDNLNE FNAHSATWADISNAQSTVNQALAAVSTGFSGYNSKTGAFGKPTSGQMEW 
TKTVKKNWKEREDAKAEELKSKKAEESKKASKIENTTKKSN^ 
E I KKDTYES 1 1 SGLSNASAALLKEVAKSKLTDTARLLM 

PRETTY of : /biotmp/msal85181 . 2 { * } May 13, 2003 07:03 

i 1 50 
msal85181.2{270_090} LNDA ITKLSSFAEA ATLQGTAYSN 

msal85181.2{270_18RS2l} LNDA ITKLSSFAEA ATLQGTAYSN 

msal85181 ,2{270_2603} mvkvsvssvg tqastvaism f srvsaLNDA ITKLSSFAEA ATLQGTAYSN 

Consensus ********** ********** ********** ********** ********** 

51 100 

msa!85 18 1.2 {270^090} AKS YATGTLT PMLQGMILFS ETLSEKCTEL QTLYVSICGD EDLDSWLES 

msal85181.2{270_18RS2l} AKS YATGTLT PMLQGMILFS ETLSEKCTEL QTLYVSICGD EDLDSWLES 

rasal85181.2{270_2603} AKSYATGTLT PMLQGMILFS ETLSEKCTEL QTLYVSICGD EDLDSWLES 

Consensus ********** ********** ********** ********** ********** 

101 150 

msal85181.2{270_090} KLASDRASLK IAEALLEHLN DDPEPSKSAI SSTKSNI KKL KKRIKSNQKK 

msal85181.2{270_18RS2l} KLASDRASLK IAEALLEHLN DDPEPSKSAI SSTKSNI KKL KKRIKSNQKK 

msal85181 .2{270_2603} KLASDRASLK IAEALLEHLN DDPEPSKSAI SSTKSNI KKL KKRIKSNQKK 

Consensus ********** ********** ********** ********** ********** 



msa!85181 .2{270_090} 
msal85181.2{270_18RS2l} 
msal85181.2{270_2603} 

Consensus 



151 200 

LDNLNEFNAH SATVFADISN AQSTVNQALA AVSTGFSGYN S KTGAFGKPT 

LDNLNEFNAH SATVFADISN AQSTVNQALA AVSTGFSGYN S KTGAFGKPT 

LDNLNEFNAH SATVFADISN AQSTVNQALA AVSTGFSGYN S KTGAFGKPT 

********** ********** ********** ********** ********** 



msal85181 . 2 { 270_090} 
msal8 5181.2{270_18RS2l} 
msal85181.2{270_2603} 

Consensus 



201 250 

SGQMEWTKTV KKNWKEREDA KAEELKSKKA EESKKASKIE NTTKKSNVSV 

SGQMEWTKTV KKNWKEREDA KAEELKSKKA EESKKASKIE NTTKKSNVSV 

SGQMEWTKTV KKNWKEREDA KAEELKSKKA EESKKASKIE NTTKKSNVSV 
********** ********** ********** ********** ********** 



251 300 
msalS 5 18 1.2 {270^090} DKKKLIKAAN EAYKLGEIKK DTYESIISGL SNASAALLKE VAKSKLTDTA 
msal85181.2{27 0_18RS2l} DKKKLIKAAN EAYKLGEIKK DTYESIISGL SNASAALLKE VAKSKLTDTA 
msal8 518 1.2 {270_2603} DKKKLIKAAN EAYKLGEIKK DTYESIISGL SNASAALLKE VAKSKLTDTA 

Consensus ********** ********** ********** ********** ********** 

301 

msal85181.2{270_090} RLLM 
msal85181.2.{270_18RS2l} RLLM 
msal85181.2{270_2603} RLLM 

Consensus **** 
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SEQ ID NO. 6201 
STRAIN 2603 

ATGATTTTAAAAATTTGTCGTGCAGCATATAGTTTACAATGGG^ 
GCTTTGCTGGATTATCCTCGAATTAAGGCGTTTGAATTC 

GCI^ACGAGAAACAATATAAAAGAAAAACTGAGATACAATGTGACGATAAAG^TCTCCT 
GCAAAAATTGTTCATTTTTTAAAATACAATAGTTTTAC^ 

AGAGAAGCGGCAGCTACITTTAATGAGGATGGTATTAGTTTAACTTCTGATrTl'TTAAGC 
CATACAT GT ACG ATTGAAACTG CAAAACT AATTTTTAAAG AAGGT AAAAT CTTAT CAG CA 
GTTAAAGCCTTTAATAAGCCTGCTGAAGTACTGGTAAAAGATAAGAGGAATGCTGCTGGA 
GACC CTAAAG ATTACTTTG ACTATGTGAT GTTGAACTGGT CAAAT AC CAATT CTGGTTAT 
CGTTTAGTAATGGAAAGATTGTTAGG CAAAGCAC CAT CTGAACAGGAGTTAACAGTAGGT 
TTTAAGC CAGGGGT CAGTTTTCATTTTACTTAT CAAGAT AT CAT CAATCATC CTGATTCT 
ATTTTTGATGGTTATCATCCTGCTAAAATTAAAAATCA.GCTTTC 

GTTG CAT GTGTT AT C C CAAAAC AT TAT CAAGAAG ATT AT CAAAG C CTTGTG C C CAATGAC 

TTGAAACACAGGGTTTATTATTTAGATTACTGTAACGAAA 

AAAGTTT ATGATTT T CTTTGTCATTTGGAAAATAAA 

SEQ XD NO. 6202 
STRAIN 090 

TGGATT?ATCCTCTAATTAAGGCGTTTGAATTGGAAAGGATAGGAGCTT^ 

ATAGCnTACGAGAAAC^TATAAAAGAAAAATTGAGATAa\ATGTGACGA 

TAAACAT CT CCT CACAAAAATTGTT CATTTTTTAAAATACA^ 

CTTTTCC CTATATT C CCAAATATAGAGAAG CGGCAGCTACTTTTAATGAG 

GATGGTATTAGTTT AACTT CTG ATTTTTT AAG C CAT ACATGT ACGATTGA 

AACTGCAAAACTAATTTTTAAAGAAGGTAAAATCTTATC^ 

CCTTTAATAAGCCTGCTGAAGTACTGGTAAATGATAAGAGGAATGCTGCT 

GGAGACCCTAAAGATTACTTTGAiCTATGTGtATGTTGAACT 

CAATT CTGGTTATCGTTTAGTAATGGAAAGATTGTTAGG CAAAG CACCAT 

CTGAAC^GGAGTTAACAGTAGCITTTAAGC 

AATT aTCAAG ATAT CAT CAATCATC CTGATT CTA'ITTTTGATGGTTATCA 
T CCTG CTAAAATTAAAAAT CL^ACTTTCTTTAG CAGAACATTT AGTTG CAT 
GTGTT ATCC CAAAACATTATCAAGAAGATTAT CAAAG CCTTGTG CCTAAT 
GACTTGAAACACAGAGTTTATTATTTAGATTACTGTAACGAAACACTTTA 
TGAGTGGAATCAAAAAGTTTATGATTTTCTTTGTCATTTGGAA 

SEQ ID NO. 6203 
STRAIN A909 

TTGCTGGATTAT CCT CG AATTAAGG CGTTTGAATTGG AAAGGAT A 
GGAG CTTTCATAGCTTACGAGAAACAATATAAAAGAAAAATTGAGATACA 
ATGTGACGATAAACATCT CCT CACAAAAATTGTT CATTTTTT AAAATACA 
ATAGTl"ITACUU"i"rCCCTATATTCCCAAATATAGAGAAGCGGCAGCTACT 
TTTAATGAGGATGGTATTAGTTTAACTTCTGATTTTT^ 
TACGATTGAAACTG CAAAACTAATTTTTAAAGAAGGTAAAAT CTTAT CAG 
CAGTTAAAGCCTTTAATA^GCCTGCTGAAGTACTGGTAAATGATAAGAGG 
AATG CTGCTGGAGAC CCTAAAG ATTACTTTGACT ATGTGATGTTGAACTG 
GTCAAATAC CAATT CTGGTTAT CGTTTAGTAA.TGGAAAGATTGTTAGG CA 
AAGCAC CAT CTG AACAGGAGTTAACAGTAG CTTTTAAG C CAGGGGTCAG C 
TTTCATTTT AATTAT CAAGATATCATCAATCATCCTGATT CTATTTTTGA 
TGGTTAT CATCCTG CTAAAATT AAAAATCAACTTT CTTTAG CAGAACATT 
TAGTTGCATGTGTTATCCCAAAACATTATCAAGAAGATTATC^ 
GTGCCTAATGACTTGAAACACAGAGTTTATTATTTAGATTACT 
AACACTTTATGAGTGGAAT CAAAAAGTTTATG ATTTT CTTTGTCATTTGG 
AAAATAAA 

SEQ ID NO. 6204 
STRAIN H36B 

TTAAGGCGTTTG AATTGGAAAGGATAGGAG CTTT CAT AG CTTACG AGAAA 
CAATATAAAAGAAAAATTGAGATACAATGTGACGATAAACAT CT C CT CAC 
AAAAATTGTT CATTTTTT AAAATACAATAGTT^ 

C CAAATATAGAG AAG CGGCAG CTACTTTTAATGAGGATGGTATTAGTTTA 
ACTT CTG ATTTTTT AAG C CATACATGTACG ATTGAAACTG CAAAACT AAT 
TTTTAAAGAAGGTAAAAT CTTATCAG CAGTTAAAG CCTTTAATAAG CCTG 
CTGAAGT ACTGGTAAATGATAAGAGGAATG CTG CT GGAGACC CTAAAGAT 
TACTTTG ACTATGTGATGTTGAACTGGT CAAATAC CAATT CTGGTTATCG 
TTTAGTAATGGAAAGATTGTTAGGCAAAGCACCATCTGAAC^ 
CAGTAGCTTTTAAG CCAGG GGTCAG CTTT CATTTT AATTAT CAAGAT AT C 
ATCAATCATC CTGATT CTATTTTTGATGGTTATCATCCTGCTAAAATTAA 
AAAT CAACTTT CTTT AG CAGAACATTTAGTTG CATGTGTTATCCCAAAAC 
ATTATCAAGAAGATTATCAAAGCCTTGTGCCTAATGACTTGAAACACAGA 
GTTTATTATTTAGATTACTGTAACGAAACACT 
AGTTTATGATTTT CTTTGTCATTTGGAAAATAAA 

SEQ ID NO. 6205 
STRAIN 18RS21 

TTGCTGGATTATCCTCGAATTAAGGCGTT 

TGAATTGGAAAGGATAGGAG CTTT CATAG CTTACGAG AAACAAT AT AAAA 
GAAAAACTC4AGATACAATGTGACGATAAACAT CT C CT CGCAAAAATTGTT 
CATTTTTTAAAATACAATAGTTTT ACTTTT C C CTATATT C CCAAATATAG 
AGAAGQ^CAGCTACTTTTAATGAGGATGGTATTA 
TTTTAAGCCATACATGTACGATTGAAACTGCAAAACTAA 
GGTAAAATCTTAT CAG CAGTTAAAG CCITTAATAAGC CTG CTGAAGT ACT 
GGTAAAAGATAAGAGGAATGCTGCrGGAGACCCTAAAGATTACTTO 
ATGTGATGTTGAACTGGTCAAATACCAATTCTGGTTATCGTTTAGTAATG 
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GAAAGATTGTT AGG CAAAG CAC CAT CTGAAGAGGAGTTAACAGTAGGTTT 
TAAG C CAGGGGT CAGTTTT CATTTTACTT ATCAAG AT AT CAT CAAT CAT C 
CTGATTCTATTTTTG ATGGTTATCATC CTG CTAAA CAG CTT 

TCTTTAG CAG AACATTT AGTTG CATGTGTTAT CC CAAAACATTAT CAAG A 
AGATTAT CAAAG CCTTGTG CCCAATCACTTGAAACACAGGGTTTATT^ 
TAGATTACTGTAACGAAACACTTTATGAGTGGAATCAAAAAGT^ 
TTTCTTTGTCATTTGGAAAATAAA 

SEQ ID NO. 6206 
STRAIN M732 

TTGCTGGATTATCCTCGAATTAAGGCGTT 

TGAATTGGAAAGGATAGGAG CTTT CATAG CTTACG AG AAACAAT ATAAAA 
GAAAAACTGAGATACAATGTGACGATAAACATCTCCTCGCAAAAATTGTT 
CATTTTTTAAAAT ACAAT AGTTTT ACTTTT C C CTATATT C CCAAATATAG 
AG AAG CGGCAGCTACITTTAATGAGGATGGTATTAGTTT^ 
TTTT AAG CCATACATGT ACGATTGAAACTG CAAAACT AATTTTT 
GGTAAAAT CTTATCAGCAGTTAAAG CCTTTAATAAGCCTG CTGAAGT ACT 
GGTAAAAGATAAGAGGAATGCTGCTGGAGACCCTAAAGATTACrTTGACT 
ATGTGATGTTGAACTGGTCAAATAC CAATT CTGGTTAT CGTTTAGTAATG 
GAAAGATTGTTAGG CAAAG CAC CAT CTGAACAGGAGTTAACAGTAGGTTT 
TAAG C CAGGGGT CAGTTTTCATTTT ACTT ATCAAGATATCAT CAATCATC 
CTGATTCTATTTTTG ATGGTTATCAT CCTG CT AAAATTAA CTT 
TCTTTAG CAGAACATTT AGTTGCATGTGTTATC C CAAAACATTATCAAGA 
AGATTATCAAAGCCTTGTGCCCAATGACTTGAAACA 

TAGATTACTGTAACGAAACACTTTATGAGTGGAATCAAAAAGTTTATGAT 
TTTCTTTGnCATTTGGAAAATAAA 

SEQ XD NO. 6207 
STRAIN COH1 

TTGCTGGAT 

TATCCTCGAATTAAGGCGTTTGAATTGGAAAGGATAGGAGCTTTCATAGC 

TTACGAGAAACAATATAAAAGAAAA^CTGAGATACAATGTGACGATAAAC 

ATCTC CTCGCAAAAATTGTT CATTTTTTAAAATACAATAGTTTT ACTTTT 

CCCTATATT C CCAAAT ATAGAG AAGCGGCAG CTACTTTTAATGAGGATGG 

TATTAGTTTAACTT CTGATTTTTTAAG CCATACATGT ACGATTGAAACTG 

CAAAACTAATTTTTAAAGAAGGTAAAATCTTATC^ 

AATAAGCCTGCTGAAGTACTGGTAAA^GATAAGAGGAATGCTGCTGGAGA 
CC CTAAAGATTACTTTG ACT ATGTGATGTTGAACTGGT CAAATAC CAATT 
CTGGTTATCGTTTAGTAATGGAAAGATTGTTAGG CAAAG CACCATCTGAA 
CAGGAGTTAACAGTAGGTTTTAAG C CAGGGGTCAGTTTT CATTTTACTTA 
T CAAGATAT CATCAAT CAT C CTGATT CTATTTTTGATGGTTATCATC CTG 
CTAAAATTAAAAATCAG CTTTCTTT AG CAGAACATTT AGTTG CATGTGTT 
ATCCCAAAACATTATCAAGAAG ATTATCAAAG C CTTGTG C CCAATGACTT 

GAAACACAGGX5TTTATTATTTAGATTACT 
GGAATCAAAAAGTTTATGATTTTCTTTGGCATTTGGAAA^ 

SEQ ID NO. 6208 
STRAIN M781 
TTGCTGGA 

TTAT C CT CG AATTAAGG CGTTT GAATTGGAAAGGAT AGG AG CTTT CATAG 

CTTACGAGAAACAATATAAAAGAAAAACTGAGATACAATGTGACGATAAA 

CATCT CCT CGCAAAAATTGTT CATTTTTT AAAATACAATAGTTTC 

TC CCTATATT C CCAAAT ATAGAGAAG CGG CAG CTACTTTTAATGAGGATG 

GTATTAGTTTAACTT CTGAITT'IUT'AAGC CATACATGTACGATTG AAACT 

G CAAAACTAATTTTTAAAGAAGGT AAAAT CTTAT CAG CAGTTAAAG C CTT 
TAATAAGCCTGCTGAAGTACTGGTAAAAGATAAGAGGAATGCTGCTGGAG 
ACCCTAAAGATTACTTTGACTATGTGATGTTGAACTGGTCAA^ 
TCTC^TTATCGTTTAGTAATGGAAAGATTGTTAGGCAAAGCACCATCTGA 

ACAGGAGTTAACAGTAGX3TTTTAAGCCAG 

AT CAAGATAT CATCAATCATCCTG ATT OTATTTTTGATGGTTATCAT C CT 
G CTAAAATTAAAAAT CAG CTTT CTTTAGCAGAACATTTAGTTGCATGTGT 
TATC C CAAAACATTAT CAAG AAGATTAT CAAAG CCTTGTG CCCAATG ACT 
TGAAACACAGGGTTTATTATTTAGATTACTGTAAC 
TGGAATC^AAAAGTTTATGATTTTCTTTGTCATTTGGAA 

SEQ ID NO. 6209 
STRAIN CJB110 

TTGCTGGATTATCCT CGAATTAAGG C 

GTTTGAATTGGAAAGGATAGGAGCTTT CATAG CTT ACGAGAAACAATAT A 
AAAGAAAAATTGAGATACAATGTGACGATAAACATCTCCTCACAAAAATT 

GTTCATTTTTTAAAATACAATAGTTTTACT^ 

TAGAGAAGCGGCAGCTACTTTTAATGAGGATGGTATTAGTTTAACT 
ATTTTTT AAG CCATACATGT ACGATTGAAACTG CAAAACTAATT^ 
GAAGGTAAAATCTTAT CAG CAGTTAAAG CCTTTAATAAG C CTGCTGAAGT 
ACTGGTAAATGATAAGAGGAATGCTGCTGGAGACCCTAAAGATTACTTTG 
ACTATGTGATGTTGAACTGGTCAAATACCAATT CTGGTTAT CGTTTAGT A 
ATGG AAAGATTGTTAGG CAAAG CAC CATCTGAACAGG AGTTAACAGT AG C 
TTTTAAGCCAGGGGT CAGC!TTT CATTTTAATTATCAA.GATAT CATCAATC 
ATC CTGATTCTATTTTTGATGGTTATCAT C CTGCTAAAATTAAAAAT CAA 
CTTTCTTTAG CAGAACATTTAGTTG CATGTGTTAT CCCAAAACATTAT CA 
AGAAGATTAT CAAAG C (CTTGTG CCTAATGACTTG AAACACAGAGTTTA^ 
ATTTAGATTACTGTAACGAAACACTTTATGAGTGGAATCAAAAAGTTTAT 

GATTTTCTTTGTCATTTGGAAAATAAA 
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SEQ ID NO. 6210 
STRAIN 1169NT 

AATTAAGGCGTTTGAATTGGAAAGGATAGGAGCTrTTCATAGCTTACGAGA 
AACAATATAAAAGAAAAACTGAGATACAATGTGA CGATAAACAT CT CCTC 
GCAAAAATTGTTCATTTTTTAAAATAGAATAGTTT^ 

T C C CAAATATAG AGAAG CG G CAGCTACTTTTAATG AGGATGGTATTAGTT 
TAACTTCTG ATTTTTTAAG C CAT ACAT GTACG ATTGAAACTG CAAAACTA 
ATTTTTAAAGAAGGTAAAAT CTTAT CAG CAGTTAAAG C CTTTAAT AAGC C 
TGCTGAAGTACTGGTAAATGATAAGAGGAATGCTGCTGGAGACCCTAAAG 
ATTACT/TTGACTATGTGATGTTGAACTGGTCAAATACCAATTCTGGTTAT 
CGTTTAGTAATGGAAAGATTGTTAGGCAAAGCACCATCTGAACAGGAGTT 
AACAGTAGGTTTTAAGCCAGGGGTCAGCTTTCATTTTACTTATCAAGATA 
TCAT CAATCAT C CTGATTCTATTTTTGATGGTTATCATCCTG CTAAAATT 
AAAAAT CAG CTTT CTTTAG CAGAACATTT AGTTG CGTGTGTT AT C CCAAA 
ACATTAT CAAGAAG ATT AT CAAAAT CTTGTGCCCAATGACITGAAACACA 
GAGTTTATTATTTAGATTACTGTAACGAAACACTTTAT^ 
AAAGTTTATG ATTTT CTTTGT CATTTGGAAAATAAA 

SEQ ID NO. 6211 
STRAIN JM9130013 

ATAGGAGCTTTCATAGCTTACGAGAAACAATATAAAAGAAAAATTGAGAT 
ACAATGTGACGATAAACAT CTC CT CACAAAAATTGTTCAlTTTT''i' AA^AT 
ACAATAGTTTTACTTTTCCCTATATT CCCAAATATAGAGAAG CGGCAG CT 
ACTTTTAATG AGGATGGTATTAGTTTAACTT CTGA1TTT"1T'AAG C CATAC 
ATGTACGATTGAAACTGCAAAACTAATTTTTAAAGAAC^TAAAATCTTAT 
CAGCAGTTAAAGCCTTTAATAAGCCTGCTGAAGTACTGGTAAATGATAAG 
AGGAATGOTGCTGGAGACCCTAAAGATTACTTTGAOTATGTGATGTTGAA 
CTGGTCAAAT ACCAATT CTGGTTAT CGTTTAGTAATGGAAAGATTGTTAG 
GCAAAGCAC CATCTG aACAGGAGTTAACAGTAGCTTTTAAGC CAGGGGTC 
AG CTTTCATTTTAATTATCA^GATATCATCAATCATC CT 
TGATGGTTATCAT C CTG CTAAAATTAAAAATCAACITCTCTTTAG CAG AAC 
ATTTAGTTGCATGTGCTATCCCAAAACATTATCAAGAAGATTATCAAAGC 
CTTGTGCCTAATGACTTGAAACACAGAGTTTAT^AT^TAGATTACTGTAA 
CGAAACACTTTATGAGTGGAAT CAAAAAGTTTATGATTTTCTTT'GTCATT 
TGGAAAATAAA 
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50 



atgattttaa aaatttgt eg tgcagcatat agtttacaat ggggaggtgt 



********** ********** ********** ********** ********** 

51 100 

— — tgg attatcctct aattaaggcg tttgaattgg 

~ ~; --ttaaggcg tttgaattgg 

TTGCtgg at tat cc teg aattaaggcg tttgaattgg 

TTGCtgg attatcctcg aattaaggcg tttgaattgg 

TTGCtgg attatcctcg aattaaggcg tttgaattgg 

ttaccaatta gctTTGCtgg attatcctcg aattaaggcg tttgaattgg 

TTGCtgg attatcctcg aattaaggcg tttgaattgg 

TTGCtgg attatcctcg aattaaggcg tttgaattgg 

TTGCtgg attatcctcg aattaaggcg tttgaattgg 

, — aattaaggcg tttgaattgg 

********** ******* , 

1Q 1 150 

aaaggATAGG AGCTTTCATA GCTTACGAGA AACAATATAA AAGAAAAAtT 

aaaggATAGG AGCTTTCATA GCTTACGAGA AACAATATAA AAGAAAAAtT 

ATAGG AGCTTTCATA GCTTACGAGA AACAATATAA AAGAAAAAtT 

aaaggATAGG AGC TTT CATA GCTTACGAGA AACAATATAA AAGAAAAAtT 

aaaggATAGG AGCTTTCATA GCTTACGAGA AACAATATAA AAGAAAAAtT 

aaaggATAGG AGCTTTCATA GCTTACGAGA AACAATATAA AAGAAAAAcT 

aaaggATAGG AGCTTTCATA GCTTACGAGA AACAATATAA AAGAAAAAcT 

aaaggATAGG AGCTTTCATA GCTTACGAGA AACAATATAA AAGAAAAAcT 

aaaggATAGG AGCTTTCATA GCTTACGAGA AACAATATAA AAGAAAAAcT 

aaaggATAGG AGCTTTCATA GCTTACGAGA AACAATATAA AAGAAAAAcT 

aaaggATAGG AGCTTTCATA GCTTACGAGA AACAATATAA AAGAAAAAcT 
***** ********** ********** ********** ********_* 

151 200 
GAGATACAAT GTGACGATAA ACATCTCCTC aCAAAAATTG TTCATTTTTT 
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GAGATACAAT 
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GAGATACAAT 
GAGATACAAT 
GAGATACAAT 
GAGATACAAT 
GAGATACAAT 



GTGACGATAA 
GTGACGATAA 
GTGACGATAA 
GTGACGATAA 
GTGACGATAA 
GTGACGATAA 
GTGACGATAA 
GTGACGATAA 
GTGACGATAA 
GTGACGATAA 



ACATCTCCTC 
ACATCTCCTC 
ACATCTCCTC 
ACATCTCCTC 
ACATCTCCTC 
ACATCTCCTC 
ACATCTCCTC 
ACATCTCCTC 
ACATCTCCTC 
ACATCTCCTC 



aCAAAAATTG 
aCAAAAATTG 
aCAAAAATTG 
aCAAAAATTG 
gCAAAAATTG 
gCAAAAATTG 
gCAAAAATTG 
gCAAAAATTG 
gCAAAAATTG 
gCAAAAATTG 



TTCATTTTTT 
TTCATTTTTT 
TTCATTTTTT 
TTCATTTTTT 
TTCATTTTTT 
TTCATTTTTT 
TTCATTTTTT 
TTCATTTTTT 
TTCATTTTTT 
TTCATTTTTT 



********** ********** ********** _********* ********** 



201 

AAAATACAAT 
AAAATACAAT 
AAAATACAAT 
AAAATACAAT 
AAAATACAAT 
AAAATACAAT 
AAAATACAAT 
AAAATACAAT 
AAAATACAAT 
AAAATACAAT 
AAAATACAAT 
********** 

251 

CAGCTACTTT 

CAGCTACTTT 

CAGCTACTTT 

CAGCTACTTT 

CAGCTACTTT 

CAGCTACTTT 

CAGCTACTTT 

CAGCTACTTT 

CAGCTACTTT 

CAGCTACTTT 

CAGCTACTTT 
********** 

301 

CATACATGTA 
CATACATGTA 
CATACATGTA 
CATACATGTA 
CATACATGTA 
CATACATGTA 
CATACATGTA 
CATACATGTA 
CATACATGTA 
CATACATGTA 
CATACATGTA 



.2{271_090} 
2{271_H36B) 
_JM9130013} 
a 2 .2{271_A90 9} 
.2{271_CJB110j 
.2{271_18RS21) 
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2(271_M781 
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AGTTTTACTT 

AGTTTTACTT 

AGTTTTACTT 

AGTTTTACTT 

AGTTTTACTT 

AGTTTTACTT 

AGTTTTACTT 

AGTTTTACTT 

AGTTTTACTT 

AGTTTTACTT 

AGTTTTACTT 
********** 



TTCCCTATAT 

TTCCCTATAT 

TTCCCTATAT 

TTCCCTATAT 

TTCCCTATAT 

TTCCCTATAT 

TTCCCTATAT 

TTCCCTATAT 

TTCCCTATAT 

TTCCCTATAT 

TTCCCTATAT 
********** 



GGTATTAGTT 
GGTATTAGTT 
GGTATTAGTT 
GGTATTAGTT 
GGTATTAGTT 
GGTATTAGTT 
GGTATTAGTT 
GGTATTAGTT 
GGTATTAGTT 
GGTATTAGTT 

GGTATTAGTT 
********** ********** 



TAATGAGGAT 
TAATGAGGAT 
TAATGAGGAT 
TAATGAGGAT 
TAATGAGGAT 
TAATGAGGAT 
TAATGAGGAT 
TAATGAGGAT 
TAATGAGGAT 
TAATGAGGAT 
TAATGAGGAT 



TCCCAAATAT 

TCCCAAATAT 

TCCCAAATAT 

TCCCAAATAT 

TCCCAAATAT 

TCCCAAATAT 

TCCCAAATAT 

TCCCAAATAT 

TCCCAAATAT 

TCCCAAATAT 

TCCCAAATAT 
********** 



TAACTTCTGA 
TAACTTCTGA 
TAACTTCTGA 
TAACTTCTGA 
TAACTTCTGA 
TAACTTCTGA 
TAACTTCTGA 
TAACTTCTGA 
TAACTTCTGA 
TAACTTCTGA 

TAACTTCTGA 
********** 



CGATTGAAAC 
CGATTGAAAC 
CGATTGAAAC 
CGATTGAAAC 
CGATTGAAAC 
CGATTGAAAC 
CGATTGAAAC 
CGATTGAAAC 
CGATTGAAAC 
CGATTGAAAC 
CGATTGAAAC 
********** ********** 



TGCAAAACTA 
TGCAAAACTA 
TGCAAAACTA 
TGCAAAACTA 
TGCAAAACTA 
TGCAAAACTA 
TGCAAAACTA 
TGCAAAACTA 
TGCAAAACTA 
TGCAAAACTA 
TGCAAAACTA 
********** 



ATTTTTAAAG 
ATTTTTAAAG 
ATTTTTAAAG 
ATTTTTAAAG 
ATTTTTAAAG 
ATTTTTAAAG 
ATTTTTAAAG 
ATTTTTAAAG 
ATTTTTAAAG 
ATTTTTAAAG 
ATTTTTAAAG 
********** 



351 

CTTATCAGCA 
CTTATCAGCA 
CTTATCAGCA 
CTTATCAGCA 
CTTATCAGCA 
CTTATCAGCA 
CTTATCAGCA 
CTTATCAGCA 
CTTATCAGCA 
CTTATCAGCA 
CTTATCAGCA 



GTTAAAGCCT 
GTTAAAGCCT 
GTTAAAGCCT 
GTTAAAGCCT 
GTTAAAGCCT 
GTTAAAGCCT 
GTTAAAGCCT 
GTTAAAGCCT 
GTTAAAGCCT 
GTTAAAGCCT 
GTTAAAGCCT 



TTAATAAGCC 
TTAATAAGCC 
TTAATAAGCC 
TTAATAAGCC 
TTAATAAGCC 
TTAATAAGCC 
TTAATAAGCC 
TTAATAAGCC 
TTAATAAGCC 
TTAATAAGCC 
TTAATAAGCC 



TGCTGAAGTA 
TGCTGAAGTA 
TGCTGAAGTA 
TGCTGAAGTA 
TGCTGAAGTA 
TGCTGAAGTA 
TGCTGAAGTA 
TGCTGAAGTA 
TGCTGAAGTA 
TGCTGAAGTA 
TGCTGAAGTA 



********** ********** ********** ********** 



250 

AGAGAAGCGG 

AGAGAAGCGG 

AGAGAAGCGG 

AGAGAAGCGG 

AGAGAAGCGG 

AGAGAAGCGG 

AGAGAAGCGG 

AGAGAAGCGG 

AGAGAAGCGG 

AGAGAAGCGG 

AGAGAAGCGG 
********** 

300 

TTTTTTAAGC 
TTTTTTAAGC 
TTTTTTAAGC 
TTTTTTAAGC 
TTTTTTAAGC 
TTTTTTAAGC 
TTTTTTAAGC 
TTTTTTAAGC 
TTTTTTAAGC 
TTTTTTAAGC 
TTTTTTAAGC 
********** 

350 

AAGGTAAAAT 
AAGGTAAAAT 
AAGGTAAAAT 
AAGGTAAAAT 
AAGGTAAAAT 
AAGGTAAAAT 
AAGGTAAAAT 
AAGGTAAAAT 
AAGGTAAAAT 
AAGGTAAAAT 
AAGGTAAAAT 
********** 

400 

CTGGTAAAtG 

CTGGTAAAtG 

CTGGTAAAtG 

CTGGTAAAtG 

CTGGTAAAtG 

CTGGTAAAaG 

CTGGTAAAaG 

CTGGTAAAaG 

CTGGTAAAaG 

CTGGTAAAaG 

CTGGTAAAtG 
********_* 



401 

ATAAGAGGAA 

ATAAGAGGAA 

ATAAGAGGAA 

ATAAGAGGAA 

ATAAGAGGAA 

ATAAGAGGAA 

ATAAGAGGAA 

ATAAGAGGAA 

ATAAGAGGAA 

ATAAGAGGAA 

ATAAGAGGAA 
********** 



TGCTGCTGGA 
TGCTGCTGGA 
TGCTGCTGGA 
TGCTGCTGGA 
TGCTGCTGGA 
TGCTGCTGGA 
TGCTGCTGGA 
TGCTGCTGGA 
TGCTGCTGGA 
TGCTGCTGGA 
TGCTGCTGGA 
********** 



GACCCTAAAG 
GACCCTAAAG 
GACCCTAAAG 
GACCCTAAAG 
GACCCTAAAG 
GACCCTAAAG 
GACCCTAAAG 
GACCCTAAAG 
GACCCTAAAG 
GACCCTAAAG 
GACCCTAAAG 
********** 



ATTACTTTGA 

ATTACTTTGA 

ATTA CTTT GA 

ATTACTTTGA 

ATTACTTTGA 

ATTACTTTGA 

ATTACTTTGA 

ATTACTTTGA 

ATTACTTTGA 

ATTACTTTGA 

ATTACTTTGA 
********** 



450 

CTATGTGATG 

CTATGTGATG 

CTATGTGATG 

CTATGTGATG 

CTATGTGATG 

CTATGTGATG 

CTATGTGATG 

CTATGTGATG 

CTATGTGATG 

CTATGTGATG 

CTATGTGATG 
********** 



451 



500 



940 
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Table 62: Comparative Sequences relating to SAG0690 



msal85284 

msal85284 . 
msal85284.2{271 

msal85284 . 
msal85284 .2 f 
msal85284 . 2 { 

msal85284 . 

msal85284. 

msal85284 . 

msal85284 . 
msal85284.2{ 



.2{271_090 
2{271_H3 6B 
_JM9130013 : 
2{271_A909 
271_CJB110} 
271_18RS2l} 
2f271_2 603} 
2{271_M732} 
2{271_M781) 
2{271_COHl} 
271_1169NT} 
Consensus 



msal85284 

msal85284 . 
msal85284.2{271 

msal85284 . 
msal85284 .2{ 
msal85284 .2{ 

msal85284 . 

msa!85284 . 

msal85284 . 

msal85284 . 
msal85284 .2{ 



.2{271_090 
2{271_H36B 
__JM9130013 ! 
2{271_A909 ! 
271_CJB110 
271_18RS21 
2{271_2603 
2{271_M732 
2{271_M781 
2{271_COHl ! 
271JL169NT 
Consensus 



TTGAACTGGT 
TTGAACTGGT 
TTGAACTGGT 
TTGAACTGGT 
TTGAACTGGT 
TTGAACTGGT 
TTGAACTGGT 
TTGAACTGGT 
TTGAACTGGT 
TTGAACTGGT 
TTGAACTGGT 



CAAATACCAA 
CAAATACCAA 
CAAATACCAA 
CAAATACCAA 
CAAATACCAA 
CAAATACCAA 
CAAATACCAA 
CAAATACCAA 
CAAATACCAA 
CAAATACCAA 
CAAATACCAA 



TTCTGGTTAT 
TT CTGGTTAT 
TTCTGGTTAT 
TTCTGGTTAT 
TTCTGGTTAT 
TTCTGGTTAT 
TTCTGGTTAT 
TTCTGGTTAT 
TTCTGGTTAT 
TTCTGGTTAT 
TTCTGGTTAT 



********** ********** ********** 



TGGAAAGATT 

TGGAAAGATT 

TGGAAAGATT 

TGGAAAGATT 

TGGAAAGATT 

TGGAAAGATT 

TGGAAAGATT 

TGGAAAGATT 

TGGAAAGATT 

TGGAAAGATT 

TGGAAAGATT 
********** 

550 

TTTAAGCCAG 
TTTAAGCCAG 
TTTAAGCCAG 
TTTAAGCCAG 
TTTAAGCCAG 
TTTAAGCCAG 
TTTAAGCCAG 
TTTAAGCCAG 
TTTAAGCCAG 
TTTAAGCCAG 
TTTAAGCCAG 

********** ********** ********** ********_* ********** 



501 

GTTAGGCAAA 
GTTAGGCAAA 
GTTAGGCAAA 
GTTAGGCAAA 
GTTAGGCAAA 
GTTAGGCAAA 
GTTAGGCAAA 
GTTAGGCAAA 
GTTAGGCAAA 
GTTAGGCAAA 
GTTAGGCAAA 



GCACCATCTG 
GCACCATCTG 
GCACCATCTG 
GCACCATCTG 
GCACCATCTG 
GCACCATCTG 
GCACCATCTG 
GCACCATCTG 
GCACCATCTG 
GCACCATCTG 
GCACCATCTG 



AACAGGAGTT 
AACAGGAGTT 
AACAGGAGTT 
AACAGGAGTT 
AACAGGAGTT 
AACAGGAGTT 
AACAGGAGTT 
AACAGGAGTT 
AACAGGAGTT 
AACAGGAGTT 
AACAGGAGTT 



CGTTTAGTAA 

CGTTTAGTAA 

CGTTTAGTAA 

CGTTTAGTAA 

CGTTTAGTAA 

CGTTTAGTAA 

CGTTTAGTAA 

CGTTTAGTAA 

CGTTTAGTAA 

CGTTTAGTAA 

CGTTTAGTAA 
********** 



AACAGTAGcT 
AACAGTAGcT 
AACAGTAGcT 
AACAGTAGcT 
AACAGTAGcT 
AACAGTAGgT 
AACAGTAGgT 
AACAGTAGgT 
AACAGTAGgT 
AACAGTAGgT 
AACAGTAGgT 



msal85284 

msal85284 . 
msal85284.2{271 

msal85284 . 
msal85284 ,2{ 
msal85284 -2{ 

msal85284 . 

msal85284 . 

msal85284 . 

msal85284. 
msa!85284 .2{ 



.2{271_090} 
2{271__H36B} 
_JM9130013 
2{271_A909 
271_CJB110 
271_18RS21 
2{271_2603 
2(271_M732 
2{271_M781 
2{271_C0H1; 
271_1169NT/ 
Consensus 



msal85284.2{271_090 
msal85284.2{271_H36B 
tnsal85284.2{271_JM9130013] 
msal85284.2{271_A909" 
msal85284.2{271_CJB110 
msal85284 .2{271__18RS21 
msal85284 . 2 { 271_2603 ; 
msal85284 . 2 { 271_M732 } 
msal85284.2{271_M781 
msal 85284.2(27 l_COHl 
msal85284.2{271_1169NT 

Consensus 



msal85284 .2{271_090] 
msa!85284.2{271_H36B 
msal85284.2{271_JM9130013 
msal85284.2{271_A909 
msal85284-2{271_CJB110| 
msal85284 .2{271_18RS21 
msal85284 .2{271_2603 

271 M732 



msal85284 .2 



> — 



msal85284.2(271_M781 
msal85284.2{271_COHl 
msal85284 . 2 {271__1169NT} 

Consensus 



msal85284.2{271_090; 
msal85284.2{271_H36B 
msal85284.2{271_JM9130013 
rasal85284.2{271_A909 
msal85284 .2{271_CJB110' 
msal85284 . 2 { 271_18RS21 
msal85284 .2l271_2603/ 
msal85284.2i271_M732; 
msal85284.2(271_M781 
msal85284 .2{271_COHl} 
msal85284.2{271_1169NT} 

Consensus 



551 

GGGTCAGcTT 

GGGTCAGcTT 

GGGTCAGcTT 

GGGTCAGcTT 

GGGTCAGcTT 

GGGTCAGtTT 

GGGTCAGtTT 

GGGTCAGtTT 

GGGTCAGtTT 

GGGTCAGtTT 

GGGTCAGcTT 
*******_** 

601 

ATTTTTGATG 

ATTTTTGATG 

ATTTTTGATG 

ATTTTTGATG 

ATTTTTGATG 

ATTTTTGATG 

ATTTTTGATG 

ATTTTTGATG 

ATTTTTGATG 

ATTTTTGATG 

ATTTTTGATG 
********** 

651 

AGAACATTTA 

AGAACATTTA 

AGAACATTTA 

AGAACATTTA 

AGAACATTTA 

AGAACATTTA 

AGAACATTTA 

AGAACATTTA 

AGAACATTTA 

AGAACATTTA 

AGAACATTTA 
********** 

701 

AAAgcCTTGT 
AAAgcCTTGT 
AAAgcCTTGT 
AAAgcCTTGT 
AAAgcCTTGT 
AAAgcCTTGT 
AAAgcCTTGT 
AAAgcCTTGT 
AAAgcCTTGT 
AAAgcCTTGT 
AAAatCTTGT 
***_ _***** 



TCATTTTAaT 

TCATTTTAaT 

TCATTTTAaT 

TCATTTTAaT 

TCATTTTAaT 

TCATTTTAcT 

TCATTTTAcT 

TCATTTTAcT 

TCATTTTAcT 

TCATTTTAcT 

TCATTTTAcT 
********_* 



GTTATCATCC 

GTTATCATCC 

GTTATCATCC 

GTTATCATCC 

GTTATCATCC 

GTTATCATCC 

GTTATCATCC 

GTTATCATCC 

GTTATCATCC 

GTTATCATCC 

GTTATCATCC 
********** 



TATCAAGATA 
TATCAAGATA 
TATCAAGATA 
TATCAAGATA 
TATCAAGATA 
TATCAAGATA 
TATCAAGATA 
TATCAAGATA 
TATCAAGATA 
TATCAAGATA 
TATCAAGATA 
********** 



TCATCAATCA 

TCATCAATCA 

TCATCAATCA 

TCATCAATCA 

TCATCAATCA 

TCATCAATCA 

TCATCAATCA 

TCATCAATCA 

TCATCAATCA 

TCATCAATCA 

TCATCAATCA 
********** 



TGCTAAAATT 

TGCTAAAATT 

TGCTAAAATT 

TGCTAAAATT 

TGCTAAAATT 

TGCTAAAATT 

TGCTAAAATT 

TGCTAAAATT 

TGCTAAAATT 

TGCTAAAATT 

TGCTAAAATT 
********** ********_* 



AAAAATCAaC 
AAAAATCAaC 
AAAAATCAaC 
AAAAATCAaC 
AAAAATCAaC 
AAAAATCAgC 
AAAAATCAgC 
AAAAATCAgC 
AAAAATCAgC 
AAAAATCAgC 
AAAAATCAgC 



TTATCCCAAA 
TTATCCCAAA 
TTATCCCAAA 
TTATCCCAAA 
TTATCCCAAA 
TTATCCCAAA 
TTATCCCAAA 
TTATCCCAAA 
TTATCCCAAA 
TTATCCCAAA 
TTATCCCAAA 



GTTGCaTGTG 

GTTGCaTGTG 

GTTGCaTGTG 

GTTGCaTGTG 

GTTGCaTGTG 

GTTGCaTGTG 

GTTGCaTGTG 

GTTGCaTGTG 

GTTGCaTGTG 

GTTGCaTGTG 

GTTGCgTGTG 
*****^**** ********** 



GCCtAATGAC 

GCCtAATGAC 

GCCtAATGAC 

GCCtAATGAC 

GCCtAATGAC 

GCCcAATGAC 

GCCcAATGAC 

GCCcAATGAC 

GCCcAATGAC 

GCCCAATGAC 

GCCcAATGAC 
***_****** 



TTGAAACACA 

TTGAAACACA 

TTGAAACACA 

TTGAAACACA 

TTGAAACACA 

TTGAAACACA 

TTGAAACACA 

TTGAAACACA 

TTGAAACACA 

TTGAAACACA 

TTGAAACACA 
********** 



ACATTATCAA 
ACATTATCAA 
ACATTATCAA 
ACATTATCAA 
ACATTATCAA 
ACATTATCAA 
ACATTATCAA 
ACATTATCAA 
ACATTATCAA 
ACATTATCAA 
ACATTATCAA 
********* * 



GaGTTTATTA 
GaGTTTATTA 
GaGTTTATTA 
GaGTTTATTA 
GaGTTTATTA 
GgGTTTATTA 
GgGTTTATTA 
GgGTTTATTA 
GgGTTTATTA 
GgGTTTATTA 
GaGTTTATTA 
*_******** 



600 

TCCTGATTCT 

TCCTGATTCT 

TCCTGATTCT 

TCCTGATTCT 

TCCTGATTCT 

TCCTGATTCT 

TCCTGATTCT 

TCCTGATTCT 

TCCTGATTCT 

TCCTGATTCT 

TCCTGATTCT 
********** 

650 

TTTCTTTAGC 

TTT CTTTAGC 

TTTCTTTAGC 

TTTCTTTAGC 

TTTCTTTAGC 

TTTCTTTAGC 

TTTCTTTAGC 

TTTCTTTAGC 

TTTCTTTAGC 

TTTCTTTAGC 

TTTCTTTAGC 
********** 

700 

GAAGATTATC 

GAAGATTATC 

GAAGATTATC 

GAAGATTATC 

GAAGATTATC 

GAAGATTATC 

GAAGATTATC 

GAAGATTATC 

GAAGATTATC 

GAAGATTATC 

GAAGATTATC 
********** 

750 

TTTAGATTAC 

TTTAGATTAC 

TTTAGATTAC 

TTTAGATTAC 

TTTAGATTAC 

TTTAGATTAC 

TTTAGATTAC 

TTTAGATTAC 

TTTAGATTAC 

TTTAGATTAC 

TTTAGATTAC 
********** 
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Table 62: Comparative Sequences relating to SAG0690 



msal85284 

msal85284 . 
msal85284 .2(271 

msal85284 . 
msal85284 .2{ 
msal85284 .2 { 

msal85284 . 

msal85284 . 

msal85284 . 

msal85284 . 
msal85284 .2{ 



.2{271_090} 
2{271_H36B} 
_JM9130013} 
2{271_A909) 
271__CJB110} 
271_18RS2l} 
2{271_2603} 
2f 271_M732} 
2 { 27 1_M78 1 } 
2{271_COHl} 
271_1169NT} 
Consensus 



msal852S4 

msal85284 . 
msal85284.2{271 

msal85284 . 
msal85284 
msal85284 

msaX8S284 . 

msal85284 . 

tnsal85284 . 

msal85284 . 
msal85284 .2{ 



.2{271_090 
2{271_H36B 
JM9130013 



.2{271_A909 
.2f 271_CJB110 
.2{271_18RS21) 
2{271_2603} 
2{271_M732} 
2{271_M781> 
2{271__COHl} 
271_1169NT} 
Consensus 



751 

TGTAACGAAA 
TGTAACGAAA 
TGTAACGAAA 
TGTAACGAAA 
TGTAACGAAA 
TGTAACGAAA 
TGTAACGAAA 
TGTAACGAAA 
TGTAACGAAA 
TGTAACGAAA 
TGTAACGAAA 



CACTTTATGA 
CACTTTATGA 
CACTTTATGA 
CACTTTATGA 
CACTTTATGA 
CACTTTATGA 
CACTTTATGA 
CACTTTATGA 
CACTTTATGA 
CACTTTATGA 
CACTTTATGA 



GTGGAATCAA 
GTGGAATCAA 
GTGGAATCAA 
GTGGAATCAA 
GTGGAATCAA 
GTGGAATCAA 
GTGGAATCAA 
GTGGAATCAA 
GTGGAATCAA 
GTGGAATCAA 
GTGGAATCAA 



AAAGTTTATG 
AAAGTTTATG 
AAAGTTTATG 
AAAGTTTATG 
AAAGTTTATG 
AAAGTTTATG 
AAAGTTTATG 
AAAGTTTATG 
AAAGTTTATG 
AAAGTTTATG 
AAAGTTTATG 



800 

ATTTTCTTTG 
ATTTTCTTTG 
ATTTTCTTTG 
ATTTTCTTTG 
ATTTTCTTTG 
ATTTTCTTTG 
ATTTTCTTTG 
ATTTTCTTTG 
ATTTTCTTTG 
ATTTTCTTTG 
ATTTTCTTTG 



********** ********** ********** ********** ********** 



801 

tCATTTGGAA 
tCATTTGGAA 
tCATTTGGAA 
tCATTTGGAA 
tCATTTGGAA 
tCATTTGGAA 
tCATTTGGAA 
nCATTTGGAA 
tCATTTGGAA 
gCATTTGGAA 
tCATTTGGAA 



816 
AATAAA 
AATAAA 
AATAAA 
AATAAA 
AATAAA 
AATAAA 
AATAAA 
AATAAA 
AATAAA 
AATAAA 
AATAAA 



».********* ****** 



SEQ ID NO. 6212 

STRAIN 2603 frame: 1 

MILK I CRAAYSLQWGGVYQLALLD YPR I KAFELER I GAP I AYEKQYKRKTE I QCDDKHLL 
AKIVHFLKYNSFTFPYI PKYREAAATFNEDGI SLTSDFLSHTCTI ETAKL I FKEGKI LSA 
VKAFNKPAEVLVKDKRNAAGDPKDYFDYVMLNWSNTNSG^ 

FKPGVSFHFTYQDI INHPDS I FDGYHPAKI KNQLSIAEHLVACVI PKHYQEDYQSLVPND 
LKHRVYYLDYCNETLYEWNQKVYDFLCHLENK 

SEQ ZD NO. 6213 

STRAIN A909 frame: 1 

LLDYPRIKAFELERIGAFIAYEKQYKRKIEIQCDDKHLLTKIVHFLKYNSFTFPYIPKYR 
EAAATFNEBGISLTSDFLSHTCTIETAKLIFKEGKILSAVKAFNKPAEVLVNDKRNAAGD 
PKIDYFDYVMLNWS^m^SGYRLWERLLGKAPSEQELTVAFKPGVSFHFNYQD I INHPDS I 
FDGYHPAKI KNQLSIAEHLVACVI PKHYQEDYQSLVPNDEKHRVYYLDYCNETLYEWNQK 
VYDFLCHLENK 

SEQ ID NO. 6214 
STRAIN H36B frame: 3 

KAFELERIGAFIAYEKQYKRKIEIQCDDKHLLTKIVHFLKYNSFTFPYI PKYREAAATFN 
EDGI SLTSDFLSHTCTI ETAKL I FKEGKI LSAVKAFNKPAEVLVNDKRNAAGDPKDYFDY 
VMLNWSNTNSGYRLVMERLLGKAPSEQELTVAFKPGVSFHFNYQDI INHPDS I FDGYHPA 
KI KNQLSLAEHLVACVI PKHYQEDYQSLVPNDLKHRVYYLDYCNETLYEWNQKVYDFLCH 
LENK 

SEQ ID NO. 6215 
STRAIN 18RS21 frame: 1 

LLDYPRI KAFELERIGAFI AYEKQYKRKTE I QCDD KHLLAKI VHFLKYNS FTFPY I PKYR 
EAAATFNEDG I SLTSDFLSHTCTI ETAKL I FKEGKI LSAVKAFNKPAEVLVKDKRNAAGD 
PKDYFDYVMLNWSNTNSGYRLVMERLLGKAPSEQELTVGFKPGVS FHFTYQD I INHPDS I 
FDGYHPAKI KNQLSLAEHLVACVI PKHYQEDYQSLVPNDLKHRVYYLDYCNETLYEWNQK 
VYDFLCHLENK 

SEQ ZD NO. 6216 

STRAIN M732 frame: 1 

LLDYPRI KAFELER I GAF I AYEKQYKRKTE I QCDDKHLLAKI VHFLKYNS FTFPY I PKYR 
EAAATFNEDG I SLTSDFLSHTCTI ETAKL I FKEGKI LSAVKAFNKPAEVLVKDKRNAAGD 
PKDYFDYWLNWSNTNSGYRLVMERLLGKAPSEQELTVGFKPGVSFHFTYQDI INHPDS I 
FDGYHPAKI KNQLSLAEHLVACVI PKHYQEDYQSLVPNDLKHRVYYLDYCNETLYEWNQK 
VYDFLXHLENK 

SEQ ZD NO. 6217 

STRAIN COH1 frame: 1 

LLDYPRI KAFELERI GAFI AYEKQYKRKTE I QCDDKHLLAKI VHFLKYNS FTFPY I PKYR 
EAAATFNEDG I SLTSDFLSHTCT I ETAKL I FKEGKI LSAVKAFNKPAEVLVKDKRNAAGD 
PKDYFDYVMLNWSNTNSGYRLVMERLLGKAPSEQELTVGFKPGVS FHFTYQD I INHPDS I 
FDGYHPAKI KNQLSLAEHLVACVI PKHYQEDYQSLVPNDLKHRVYYLDYCNETLYEWNQK 
VYDFLWHLENK 

SEQ ZD NO. 6218 

STRAIN M781 frame: 1 
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LLDYPRIKAFELERIGAFIAYEKQYKRKTEIQCDDKHLLAKIVHFLKYNSFTFPYIPKYR 
EAAATFNEDG I SLTSDFLSHTCTI ETAKLI FKEGKI LSAVKAFNKPAEVLVKDKRNAAGD 
PKDYFDYVMLNWSNTNSGYRLVMERLLGKAPSEQELTVGFKPGVSFHFTYQDI INHPDS I 
FDGYHPAKI KNQLSLAEHLVACVI PKHYQEDYQS LVPNDLKHRVYYLDYCNETLYE WNQK 
VYDFLCHLENK 

SEQ ID NO. 6219 

STRAIN CJB1 10 frame: 1 

LLDYPRI KAFELERI GAFI AYEKQYKRKIE IQCDDKHLLTKI VHFLKYNSFTFP Y I PKYR 
EAAATFNEDG I SLTSDFLSHTCTI ETAKL I FKEGKI LSAVKAFNKPAEVLVNDKRNAAGD 
PKDYFDYVMLNWSNTNSGYRLVMERLLGKAPSEQELTVAFKPGVSFHFNYQDI INHPDS I 
FDGYHPAKI KNQLSLAEHLVACVI PKHYQEDYQSLVPNDLKHRVYYLDYCNETLYEWNQK 
VYDFLCHLENK 

SEQ ID NO. 6220 

STRAIN 1169NT frame: 2 

I KAFELERI GAFI AYEKQYKRKTE I QCDDKHLLAKI VHFLKYNSFTFPY I PKYREAAATF 
NEDG I SLTSDFL SHTCT I ETAKL I FKEGKI LSAVKAFNKPAEVLVNDKRNAAGD PKDYFD 
YVMLNWSNINSGYRLVMERLLGKAPSEQELWGFKPGVSFHFTYQDI INHPDS I FDGYHP 
AKI KNQLSLAEHLVACVI PKHYQEDYQNLVPNDLKHRVYYLDYCNETLYEWNQKVYDFLC 
HLENK 

SEQ ID NO. 6221 
STRAIN JM9130013 frame: 1 

I GAF I AYEKQYKRKI E I QCDDKHLLTKI VHFLKYNSFTF P Y I PKYREAAATFNEDGI SLT 
SDFLSHTCTI ETAKLI FKEGKILSAVKAFNKPAEVLVNDKRNAAGDPKDYFDYVMLNWSN 
TNSGYRLVMERLLGKAPSEQELTVAFKPGVS FHFNYQD I INHPDS I FDGYHPAKI KNQLS 
LAEHLVACVIPKHYQEDYQSLVPNDLKHRVYYLDYCNETLYEWNQKVYDFLCHLENK 

SEQ ID NO. 6222 
STRAIN 090 frame: 3 

DYPL I KAFELER I GAF I AYEKQYKRKI E I QCDDKHLLTKI VHFLKYNS FT FP Y I PKYREA 
AATFNEDGI SLTSDFLSHTCTI ETAKL I FKEGKI L S AVKAFNKPAEVL VND KRNAAGDP K 
DYFDYVMLNWSNTNSGYRLVMERLLGKAPSEQELTVAFKPGVSFHFNYQDI INHPDS I FD 
GYHPAKI KNQLSLAEHLVACVI PKHYQEDYQSLVPNDLKHRVYYLDYCNETLYEWNQKVY 
DFLCHLENK 



PRETTY of: /biotmp/msal85358 . 2 { * } May 13, 2003 07:11 



msal85358 
rasal853S8 .2(271 

msal85358 . 

msal85358 . 
msal85358 .2{ 
msal85358 .2{ 
msal85358 .2{ 

msal85358 . 

rasal85358 . 

msal85358 . 

msa!85358 . 



.2{271_090"- 
_JM9130013 
2{2711H36B 
2{271_A909; 
271_CJB110 
271_1169NT 
271_18RS21 ; 
2{271_2603 
2{271_M732 
2{271_M78l} 
2{271_C0H1} 
Consensus 



msal85358 
msal85358 .2(271 

msal85358 . 

msal85358 . 
rasal85358 .2( 
msal85358 .2( 
msal85358 .2( 

msal85358 . 

nisal85358 . 

msal85358 . 

rasal85358 . 



.2{271_090 
_JM9130013 
2(271_H36B 
2{271_A909; 
271_CJB110 
271_1169NT 
271_18RS21 
2{271_2603; 
2(271_M732} 
2{271_M781> 
2(271_COHl} 
Consensus 



msal85358 
msal85358 .2(271 

msa!85358 . 

msal85358 . 
nisal85358 .2{ 
msal85358 .2{ 
msa!85358 .2{ 

msal85358 . 

msal85358 . 

msal85358 . 

msal85358 . 



2{271_090 
_JM9130013 
2{271_H36B 
2{271_A909 
271_CJB110} 
271_1169NT} 
271_18RS21} 
2(271_2603) 
2{271_M732} 
2(271_M78lJ 
2{271_C0H1} 



1 

dyplika felerlGAFI 

IGAFI 

ka felerlGAFI 

-LLdyprika felerlGAFI 

-LLdyprika felerlGAFI 

ika felerlGAFI 

-LLdyprika felerlGAFI 

milkicraay slqwggvyql aLLdyprika felerlGAFI 

-LLdyprika felerlGAFI 

-LLdyprika felerlGAFI 

• LLdyprika felerlGAFI 

********** ********** *** ***** 



51 

EIQCDDKHLL 

EIQCDDKHLL 

EIQCDDKHLL 

EIQCDDKHLL 

EIQCDDKHLL 

EIQCDDKHLL 

EIQCDDKHLL 

EIQCDDKHLL 

EIQCDDKHLL 

EIQCDDKHLL 

EIQCDDKHLL 
********** 

101 

HTCTIETAKL' 

HTCTIETAKL 

HTCTIETAKL 

HTCTIETAKL 

HTCTIETAKL 

HTCTIETAKL 

HTCTIETAKL 

HTCTIETAKL 

HTCTIETAKL 

HTCTIETAKL 

HTCTIETAKL 



tKIVHFLKYN 

t KI VHFLKYN 

tKIVHFLKYN 

tKIVHFLKYN 

tKIVHFLKYN 

aKI VHFLKYN 

a KI VHFLKYN 

aKI VHFLKYN 

aKI VHFLKYN 

aKI VHFLKYN 

aKI VHFLKYN 
_********* 



I FKEGKI LSA 
I FKEGKI LSA 
I FKEGKI LSA 
I FKEGKI LSA 
IFKEGKILSA 
I FKEGKILSA 
IFKEGKILSA 
IFKEGKILSA 
IFKEGKILSA 
IFKEGKILSA 
IFKEGKILSA 



SFTFPYIPKY 
SFTFPYIPKY 
SFTFPYIPKY 
SFTFPYIPKY 
SFTFPYIPKY 
SFTFPYIPKY 
SFTFPYIPKY 
SFTFPYIPKY 
SFTFPYIPKY 
SFTFPYIPKY 
SFTFPYIPKY 



REAAATFNED 
REAAATFNED 
REAAATFNED 
REAAATFNED 
REAAATFNED 
REAAATFNED 
REAAATFNED 
REAAATFNED 
REAAATFNED 
REAAATFNED 
REAAATFNED 



********** ********** 



VKAFNKPAEV 
VKAFNKPAEV 
VKAFNKPAEV 
VKAFNKPAEV 
VKAFNKPAEV 
VKAFNKPAEV 
VKAFNKPAEV 
VKAFNKPAEV 
VKAFNKPAEV 
VKAFNKPAEV 
VKAFNKPAEV 



LVnDKRNAAG 
LVnDKRNAAG 
LVnDKRNAAG 
LVnDKRNAAG 
LVnDKRNAAG 
LVnDKRNAAG 
LVkDKRNAAG 
LVkDKRNAAG 
LVkDKRNAAG 
LVkDKRNAAG 
LVkDKRNAAG 



50 

AYEKQYKRKI 
AYEKQYKRKI 
AYEKQYKRKI 
AYEKQYKRKI 
AYEKQYKRKI 
AYEKQYKRKt 
AXEKQYKRKt 
AYEKQYKRKt 
AYEKQYKRKt 
AYEKQYKRKt 

AYEKQYKRKt 
*********_ 

100 

GISLTSDFLS 

GISLTSDFLS 

GISLTSDFLS 

GISLTSDFLS 

GISLTSDFLS 

GISLTSDFLS 

GISLTSDFLS 

GISLTSDFLS 

GISLTSDFLS 

GISLTSDFLS 

GISLTSDFLS 
********** 

150 

DPKDYFDYVM 
DPKDYFDYVM 
DPKDYFDYVM 
DPKDYFDYVM 
DPKDYFDYVM 
DPKDYFDYVM 
DPKDYFDYVM 
DPKDYFDYVM 
DPKDYFDYVM 
DPKDYFDYVM 
DPKDYFDYVM 



943 



WO 2004/018646 



PCT/US2003/026827 



Table 62: Comparative Sequences relating to SAG0690 



Consensus 



********** ********** ********** **-******* ********** 



msal85358 
msal853 58 .2(271 

msal85358 . 

msal85358 . 
msal85358 . 2{ 
msal85358 
msal85358 

msal85358 . 

msal85358 . 

msal85358 . 

msal85358 . 



.2{271_090 
_JM9130013 
2{271_H36B 
2{271_A909 
. X 271_CJB110 
.2i271_1169NT 
.2{271_18RS21 
2{271_2603 
2{271_M732 
2l271_M781 
2{271_COHl 
Consensus 



msal85358 
msal85358 .2(271 

msal85358 . 

msal85358 . 
msal85358 .2( 
msal853S8 . 2{ 
msal85358 . 2{ 

msal85358 . 

msal85358 . 

msal85358 . 

msal8S358 . 



.2{271_090 
_JM9130013 
2{271_H36B 
2(271_A909 
271_CJB110 
271_1169NT 
271_18RS21 
2(271_2603 
2{271_M732 
2(271_M781 
2(271_COHl} 
Consensus 



151 

LNWSNTNSGY 
LNWSNTNSGY 
LNWSNTNSGY 
LNWSNTNSGY 
LNWSNTNSGY 
LNWSNTNSGY 
LNWSNTNSGY 
LNWSNTNSGY 
LNWSNTNSGY 
LNWSNTNSGY 
LNWSNTNSGY 



RLVMERLLGK 
RLVMERLLGK 
RLVMERLLGK 
RLVMERLLGK 
RLVMERLLGK 
RLVMERLLGK 
RLVMERLLGK 
RLVMERLLGK 
RLVMERLLGK 
RLVMERLLGK 
RLVMERLLGK 



APSEQELTVa 
APSEQELTVa 
APSEQELTVa 
APSEQELTVa 
APSEQELTVa 
APSEQELTVg 
APSEQELTVg 
APSEQELTVg 
APSEQELTVg 
APSEQELTVg 
APSEQELTVg 



FKPGVSFHFn 
FKPGVSFHFn 
FKPGVSFHFn 
FKPGVSFHFn 
FKPGVSFHFn 
FKPGVSFHFt 
FKPGVSFHFt 
FKPGVSFHFt 
FKPGVSFHFt 
FKPGVSFHFt 
FKPGVSFHFt 



********** ********** *********_ *********- 



201 

I FDGYHPAKI 

I FDGYHPAKI 

I FDGYHPAKI 

I FDGYHPAKI 

I FDGYHPAKI 

I FDGYHPAKI 

I FDGYHPAKI 

I FDGYHPAKI 

I FDGYHPAKI 

I FDGYHPAKI 

I FDGYHPAKI 
********** 



KNQLSLAEHL 
KNQLSLAEHL 
KNQLSLAEHL 
KNQLSLAEHL 
KNQLSLAEHL 
KNQLSLAEHL 
KNQLSLAEHL 
KNQLSLAEHL 
KNQLSLAEHL 
KNQLSLAEHL 
KNQLSLAEHL 



VACVIPKHYQ 
VACVI PKHYQ 
VACVIPKHYQ 
VACVIPKHYQ 
VACVIPKHYQ 
VACVIPKHYQ 
VACVIPKHYQ 
VACVIPKHYQ 
VACVIPKHYQ 
VACVIPKHYQ 
VACVIPKHYQ 



EDYQsLVPND 
EDYQsLVPND 
EDYQsLVPND 
EDYQsLVPND 
EDYQsLVPND 
EDYQnLVPND 
EDYQSLVPND 
EDYQsLVPND 
EDYQsLVPND 
EDYQsLVPND 
EDYQsLVPND 



********** ********** ****_***** 



200 

YQDIINHPDS 
YQDIINHPDS 
YQDIINHPDS 
YQDIINHPDS 
YQDIINHPDS 
YQDIINHPDS 
YQDIINHPDS 
YQDIINHPDS 
YQDIINHPDS 
YQDIINHPDS 
YQDIINHPDS 
********** 

250 

LKHRVYYLDY 
LKHRVYYLDY 
LKHRVYYLDY 
LKHRVYYLDY 
LKHRVYYLDY 
LKHRVYYLDY 
LKHRVYYLDY 
LKHRVYYLDY 
LKHRVYYLDY 
LKHRVYYLDY 

LKHRVYYLDY 
********** 



msal85358 
msal85358 .2(271 
msal85358 . 
msa!85358 . 
msal85358 .2f 
' msal85358.2( 
msal85358.2( 
msal85358 . 
msal85358 . 
msal85358 . 
msal85358 . 



.2{271_090} 
_JM9130013} 
2(271_H36B} 
2(271_A909} 
271_CJB110> 
271_1169NT) 
271_18RS2l} 
2(271_2603} 
2{271_M732} 
2(271_M78l} 
2{271_COHl} 
Consensus 



251 

CNETLYEWNQ 
CNETLYEWNQ 
CNETLYEWNQ 
CNETLYEWNQ 
CNETLYEWNQ 
CNETLYEWNQ 
CNETLYEWNQ 
CNETLYEWNQ 
CNETLYEWNQ 
CNETLYEWNQ 
CNETLYEWNQ 



272 

KVYDFLcHLE NK 
KVYDFLcHLE NK 
KVYDFLCHLE NK 
KVYDFLcHLE NK 
KVYDFLcHLE NK 
KVYDFLcHLE NK 
KVYDFLCHLE NK 
KVYDFLcHLE NK 
KVYDFLxHLE NK 
KVYDFLcHLE NK 
KVYDFLwHLE NK 



********** ******_*** ** 
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Table 63: Comparative Sequences relating to SAG1912 



SEQ ID NO. 6301 
STRAIN 2603 

ATGAAAAGTCGAAAAAAAGATAAATTGGTATTGAGGTTAAC7VACAACACTATTGGTTTTT 
GGTTTGGGTGGGGTTTGGTTTTATAATTATAAAAATGATAATGTCGAACCGACAGTCACT 
AGTG CAT CGG AT CAAACGACGACTTTTATT CAAAC GATTT CT C CAACAG CTATTG AAATT 
TCTAAGACCT ATGATTTGT ATG CGT CAGTCCT ATTAG CACAAGCT ATTT TGGAAT CATC C 
AGTGGACAAT CAGATTTGT CTAAGG CT C CT AATTATAAC CT CTTTGG CAT CAAAGGAGAA 
T ATAAAGGT AAAT CTGT C CAAATG C CT ACTTTAG AAGATGAT GGG AAAGG CAAT ATGACT 
CAAAT C CAAG CT CCTTTTCG CG CCT AT CCAAATTATTCTG CTTCACT AT ATGAT TATG CT 
GAGTTAGTATCT AGT CAAAAGTATG CATCTGTTTGGAAAT CAAAT AC CT CTT CTTAT AAG 
GATG CTACTG CAG CT CT AACAGGT C TTTATG CGACAG ATACTG CTTATG CTAGT AAATT A 
AACCAAATTATTGAAAC CTACAGTCTAGATG CTTATGAT AAA 

SEQ ID NO. 63 02 
STRAIN 090 

GGGGTTTGGTTTTATAATTATAA • 

AAATGATAATGTCGAACCGACAGTCACTAGTGCATCGGATCAAACGACGA 
CTTTTATTCAAACGATTT CTCCAA.CAG CT ATTGAAATTTCTAAGAC CTAT 
GATTTGTATGCGT CAGT CTTATTAG CACAAGCTATTTTGGAATCAT C CAG 
TGG ACAATCAGATTTGT CTAAGGCT CCTAATT ATAAC CT CTTTGG CATCA 
AAGG AGAATATAAAGGT AAAT CTGT C CAAATGCCTACTTTAGAAGATGAT 
GGGAAAGGCAAT AT GACTCAAATC CAAG CT CCTTTTCGCG CCTAT CCAAA 
TTATTCTGCTTCACTATATGATTATGCTGAGTTAGT^^ 
ATG CATCTGTTTGG AAATCAAATACCTCIT CTTATAAG^ CTACTG CA 
G CTCTAACAGGT CTTTATG CGACAGATACTG CTT ATG CTAGTAAATT AAA 
CCAAATTATTGAAAC CTACAGT CTAGATG CTTATGATAAA 

SEQ ID NO. 6303 
STRAIN A909 

GGGGTTTGGTTTTATAATTATAA 

AAATGAT AATGT CGAAC CGACAGT CACTAGTG CATCGGAT CAAACGACGA 
CTTTTATTCAAACGATCTCTCCAACAGCT 

GATTTGTAT G CGT CAGT CTTAT TAG CACAAGCTATTTTGGAATCAT C CAG 
TGGACAATCAGATTTGTCTAAGGCTCCTAATTATAACCT CTTTGG 
AAGGAGAATATAAAGGTAAATCTGTCCAAATGCCTACTTTAGAAGATGAT 
GGGAAAGGCAATATGACTCAAATCCAAGCTCCTTTTCGCGCCTATCCAAA 
TTATTCTGCITCACTATATGATTATGCT 

ATGCATCTGCTTGGAAATCA^ATACTTCTTCTTATAAGGATGCTACrGCA 
G CT CTAACAGGT CTTTATG CGAt^GATACTG CTT ATG CTAGTAAATTAAA 
CCAAATTATTGAAAC CTACAGT CTAGATG CTTATG ATAAA 

SEQ ID NO. 6304 

STRAIN H36B 

GGGGTTTGGTTTTATAATTATAAAAATGATA 

ATGTCGAACCGACAGTCACTAGTGCATCGGATCAAACGACGACTTTTATT 
CAAACC4ATTTCTCCAACAGCTATTGAAATTTCT 

TGCGT CACTCTTATT AGCACAAGCTATTTTGGAAT CATC CAGTGGACAAT 
CAGATTTGT CTAAGGCT CCTAATTATAACCT CTTTGG CAT CAAAGGAGAA 
TATAAAGGTAAATCTGTCCAAATGCCTACTCTAGAAGATGATGGGAAAGG 
CAATATGACT CAAAT CCAAG CT CCTTTT CG CG C CTAT CCAAATTATTCTG 
CTTCACTATATGATTATGCTGAGTTAGTATCTAGTCAAAAGTaTGC^ 
GCTTGGAAAT CAAATACTT CTT CTTAT AAGGATG CTACTG CAGCTCTAAC 
AGGT CTITTATGCGACAGATACTGCTTATG CTAGT AAATTAAACCA^ 
TTGAAACCTACAGT CTAGATG CTT ATG ATAAA 

SEQ ID NO. 6305 

STRAIN 18RS21 

GGGGTTTGGTTTTATAATTATAAAAAT GAT AATG 

TCGAACCGACAGTCACTAGTGCATCGGATCAAACGACGACriTT^ 
ACGATTTCT C CAACAGCTATTG AAATTTCTAAGAC CTATG ATTTGTATG C 
GTCAGTCTT ATT AG CACAAG CTATTTTGG AATCAT C CAGTGGACAAT CAG 
ATTTGT CTAAGGCT CCT AATTATAACCTCTTTGG CATCAAAGGAGAATAT 
AAAGGTAAATCTGTCCAAATGC CTACTTTAGAAGATGATGGG AAAGG CAA 
TATG ACT CAAAT CCAAG CT C CTTTT CG CG C CT ATC CAAATTATTCTG CTT 
CACTATATGATTATG CTGAGTT AGT AT CTAGT CAAAAGTATG CAT CTGTT 
TGGAAATCAAAT ACCT CTT CTTAT AAGGATG CTACTG CAGCT CT AACAGG 
TCITTATG CGACAGATACT GCTTATGCTAGTAAATTAAAC CAAATTAT^ 
AAAC CTACA.GT CTAGATGCTTATGAT AAA 

SEQ ID NO. 6306 
STRAIN M732 

GGGGTTTGGTTTTATAATTATAA 

AAATGAT AATGTCGAAC CGACAGT CACTAGTG CATCGGAT CAAACGACGA 
CTTTTATT CAAACGATTT CT C CAACAG CTATTGAAATTT CTAAGACCTAT 
GATTTGT ATG CGTCAGT CTTATTAG CACAAG CTATTTTGG AATCATC CAG 
TGGACAAT CAGATTTGT CT AAG G CT C CTAATT AT AACCTCTT TGG CATCA 
AAGGAGAAT ATAAAGGTAAATCIGT CCAA^TG CCT ACTTTAG AAGATGAT 

GGGAAAGGCAATATGACT CAAATC CAAG CT CCTTTTCGCG C CTATCCAAA 
TTATTCTGCTTCACTATATGATTATGCTGAGTTAGTATCTAGTCAAAAGT 
ATGCATCTGTTTGGAAAT CAAATACTT CTT CTTAT AAGGATG CTACT 
G CT CT AACAGGT CTTTATG CGACAGATACTG CTT ATG CTAGTAAATT AAA 
CCAAATTATTGAAAC CTACAGT CTAGATG CTTATGATAAA 
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Table 63: Comparative Sequences relating to SAG1912 



SEQ ID NO. 6307 
STRAIN COH1 

GGGGTTTGGTTTTATAATTATAA 

AAATGATAATGTCX3AACCGACAGTCACTAGTGCATCGGATCAAACGACGA 
CTTTT ATT CAAACG ATTTCT C CAACAG CT ATTGAAATTT CTAAG AC CTAT 
GATTTGTAT G CGTCAGT CTT ATTAG CACAAG CTATTTTGGAATCATC CAG 
TGGACAATCAGATTTGT CTAAGGCT CCT AATTATAAC CT CTTTGG CATCA 
AAGGAGAATATAAAGGTAAAT CTGTCCAAATG CCT ACTTTAG AAG ATGAT 
GGGAAAGG CAAT AT GACT CAAAT C CAAGCT CCTTTT CG CG C CTAT C CAAA 
TTATTCTGCTTC^CTATATGATTATGCTGAGTTAGTATCTAGTCAAAAGT 
ATGCATCTGTTTGGAAATCAAATACrTCTTCT^ 

GCTCTAACAGGTCTTTATGCGACAGATACTGCTTATGCTAGTAAATTAAA 
CCAAATTATTGAAACCT ACAGT CT AGATG CTT AT GAT AAA 

SEQ XD NO. 6308 
STRAIN M781 

GGGGTTTGGTTTTATAATTATAAAAATGA 

TAATGTCGAACCGAC^GTCACTAGTGCATCGGATCAAACGACGACTTTTA 
TT CAAACGATTTCT C CAACAG CTATTGAAATTTCTAAGAC CT ATGATTTG 
TATG CGT CAGTCTTATT AG CACAAG CTATTTTGGAATCAT CCAGTGGACA 
ATCAGATTTGT CTAAGG CT CCTAATT ATAACCTCTTTGGCAT CAA^G GAG 
AATAT AAAGGTAAAT CTGT C CAAA.TGC CTACTTTAGAAG ATGAT GGGAAA 
GGCAATATGACTCAAATCCAAG CT CCTTTT CG CG C CTAT C CAAATTATT C 
TG CTT CACTATATGATT ATG CTGAGTT AGTAT CTAGTCAAAAGTATG CAT 
CTGTTTGGAAATCAAATACri' C'i'i' CTTATAAGGATGCTACTG CAG CT CT A 
ACAGGT CTTTATG CGACAGATACTGCTTATG CTAGTAAAITAAAC CAAAT 
TATTGAAAC CTACAGTCTAGATGCTTATGATAAA 

SEQ XD NO. 6309 
STRAIN CJB110 

GGGGTTTGGTTTTATAATTATAAAAATGATAATGT 

CGAAC CGACAGT CACTAGTG CAT CGGATCAAACG ACG AC TTTTAT T CAAA 
CGATTTCT C CAACAG CTATT GAAATTT CTAAG AC CTATGATTTGT ATG CG 
TCAGT CTTATTAG CACAAG CTATTTTGGAAT CAT CCAGTGGACAATCAG A 
TTTGTCTAAGGCrCCTAATTATAACCTCTTTGGCATCAAAG^ 
AAGGTAAAT CTGT CCAAATG C CTACTTTAG AAGATGATGGGAAAGGCAAT 
ATGACTCAAATCCAAGCTCCITTTCGCG 

ACTATATGATTATGCTGAGTTAGTATCTAGTCAAAAGTATGCATCTGTTT 
GGAAATCAAATACCTCTTCTTATAAGGATGCTACTGCAGCTCTAACAGGT 
CTTTATG CGACAGATACTG CTTATGCTAGTAAATTAAAC CAAATTATTG A 
AACCTACAGTCTAGATGCTTATGATAAA 

SEQ ID NO. 6310 
STRAIN 1169NT 

GGGGTTTGGTTTTATAATTATAAAAATGATAATGT 

CGAACAGACAGTCACTAGTGCATCGGATCAAACGACGACTTTTATT 
CGATTTCCCC^CAGCTATTGAAATTTCrAAGACCTATGATTTGTATGCG 
TCAGT CTTATTAGCACAAGCTATTTTGGAATCAT C CAGTGGACAATCAGA 
TTTGTCTAAGGCTC CTAATTATAACCTCTTTGGCATCAAAGGAGAATATA 
AAG<3TAAATCTGTCCAAATGCCTACTTTAGAAGATGATGGGAAAGGC!AAT 
ATGACTCAAATCCAAGCTCCITTTCGCGCCTAT^ 
ACTATATGATTATGCTGAGTTAGTATCTAGTCAAAAGTATGCATC^ 
GGAAATCAAATACTT CTTCTTATAAGG ATG CTACTGCAG CT CTAACAGGT 
CTTTATG CGACAGATACTG CTTATG CTAGT AAATTAAAC CAAATTATTG A 
AACCTACAGTCTAGATGCTTATGATAAA 

SEQ XD NO. 6311 
STRAIN JM9 1300 13 

TTTX3GTTTTATAATTATAAAAATGATAATGT CGAAC CGACAGT CACTAGT 
G CAT CGGAT CAAACG ACGACTTTT ATT CAAACGATTT CC C CAACAG CTAT 
TGAAATTTCT AAGAC CT ATGATTTGTATGCGT CAGT CTTATTAG CACAAG 
CTATTTTGGAAT CAT CCAGTGGACAAT CAG ATTTGTCTAAGGCT C CTAAT 
TATAACCTCTTTGG CAT CAAAGGAGAATATAAAGGT AAAT CTGT T CAAAT 
GC CTACTTTAGAAG ATG ATGGGAAAGGTAATATGACC CAAATCCAAG CT C 
CTTTT CG CG CCTATC CAAATTATT CTG CTT CACTATATGATTATG CTGAG 
- TT AGTAT CTAGT CAAAAGTATG CAT CTGTTTGGAAATCAAATACCTCTT C 
TTATAAGGATGCTACTG CAG CT CTAACAGGTCTTT ATGCG ACAGATACTG 
CTTATGCTAGTAAATTAAAC CAAATTATTG AAAACTACAGTCTAGATG CT 
TATGATAAA 

PRETTY of : /biotmp/msa243324 . 2 { * } February 11, 2003 05:11 . - 



msa243324.2{275_A909; 

maa243324 .2{275_H36B; 
msa243324 . 2 {275_090 
msa243324.2{275_18RS21 

msa243324 . 2 {275_2603 
msa243324.2{275_CJB110 

msa243324.2{275_j:OHl 

msa243324 . 2 { 275_M732 } 



atgaaaagtc gaaaaaaaga taaattggta ttgaggttaa caacaacact 
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Table 63: Comparative Sequences relating to SAG1912 



msa243324.2{275_M781 
msa243324 .2{275_1169NT 
msa243324.2{275_JM9130013 

Consensus 



msa243324.2{275_A909} 
msa243324.2{275_H36B} 
msa243324 . 2 {275_090 } 
msa243324 .2{275_18RS2l} 
msa243324.2{275_2603} 
msa243324 . 2 {275_CJB110 } 
msa243324.2f275_COHl} 
msa243324.2(275_M732} 
msa243324.2{275__M78l} 
msa243324.2{275_1169NT} 
msa243324 . 2 { 275_JM913 0013 } 

Consensus 



msa243324 . 2 {275_A909 } 
msa243324.2{275_H36B} 
msa243324 . 2 { 275_090 } 
msa243324.2{275_18RS21 
msa243324.2{275_2603 
Ttisa243324 . 2 {275_CJB110 
msa243324.2{27 5_COHl/ 
msa243324.2f 275_M732 
msa243324 .2{275_M781 
msa243324 . 2 { 275_1169NT 
msa243324-2{275_JM9130013 

Consensus 



********** ********** ********** ********** ********** 



msa243324 . 

msa243324. 
msa243324 
msa243324.2{ 

msa243324 . 
msa243324.2{ 

msa243324 . 

msa243324 . 

msa243324 . 
msa243324.2{ 
msa243324.2{275 



2{275_A909} 
2{275_H36B} 
.2{275_090} 
275_JL8RS2l} 
2{275_2603} 
275_CJB110} 
2{275_C0H1} 
2{275_M732} 
2{275_M781} 
275_1169NT} 
,_JM9130013} 
Consensus 



msa243324.2{275_A909} 
msa243324.2{275_H36B} 
msa243324.2{275_090} 
msa243324 . 2 {275_18RS21 
msa243324.2{275_2603 
msa243324.2{275_CJB110 
msa243324.2(275_COHl 
msa243324.2(275_M732 
msa243324 . 2{ 275_M781 
msa243324.2{275_1169NT 
msa243324.2{275_JM9130013 

Consensus 



msa243324.2(275__A909) 
msa243324 . 2 {275_H36B 
msa243324.2{275_090; 
msa243324 . 2 { 275_18RS21 
msa243324.2{275_2603, 
msa243324.2{275_CJB110 
msa243324.2{275_COHl 
ms a2 43 3 24 . 2 { 2 7 5_M73 2 
msa2 43 3 24 . 2 { 27 5_M78 1 } 
msa243324.2{275_1169NT} 
msa243324.2{275_JM9130013} 

Consensus 



msa243324.2{275 > _A909 

msa243324.2{275__H36B> 
msa243324.2{275_090 
msa243324 . 2 { 275_18RS21 

msa243324.2{275_2603 
msa243324.2{275_CJB110 

msa243324 . 2 { 275_C0H1 } 



51 

g gggTTTGGTT 

g gggTTTGGTT 

' ,g gggTTTGGTT 

g gggTTTGGTT 

attggttttt ggtttgggtg gggTTTGGTT 

g gggTTTGGTT 

g 

g 

g 

g 



gggTTTGGTT 
gggTTTGGTT 
gggTTTGGTT 
gggTTTGGTT 
TTTGGTT 



TTATAATTAT 
TTATAATTAT 
TTATAATTAT 
TTATAATTAT 
TTATAATTAT 
TTATAATTAT 
TTATAATTAT 
TTATAATTAT 
TTATAATTAT 
TTATAATTAT 
TTATAATTAT 



100 

AAAAATGATA 
AAAAATGATA 
AAAAATGATA 
AAAAATGATA 
AAAAATGATA 
AAAAATGATA 
AAAAATGATA 
AAAAATGATA 
AAAAATGATA 
AAAAATGATA 
AAAAATGATA 



********** *********_ ******* ********** ********** 



GACAGTCACT 
GACAGTCACT 
GACAGTCACT 
GACAGTCACT 
GACAGTCACT 
GACAGTCACT 
GACAGTCACT 
GACAGTCACT 
GACAGTCACT 
GACAGTCACT 
GACAGTCACT 



101 

ATGTCGAACc 
ATGTCGAACc 
ATGTCGAACC 
ATGTCGAACc 
ATGTCGAACc 
ATGTCGAACc 
ATGTCGAACc 
ATGTCGAACc 
ATGTCGAACc 
ATGTCGAACa 

ATGTCGAACc 
*********- 

151 

CAAACGATTT 

CAAACGATTT 

CAAACGATTT 

CAAACGATTT 

CAAACGATTT 

CAAACGATTT 

CAAACGATTT 

CAAACGATTT 

CAAACGATTT 

CAAACGATTT 

CAAACGATTT 
********** *_******** 



AGTG CATCGG 
AGTGCATCGG 
AGTGCATCGG 
AGTGCATCGG 
AGTGCATCGG 
AGTGCATCGG 
AGTGCATCGG 
AGTGCATCGG 
AGTGCATCGG 
AGTGCATCGG 
AGTGCATCGG 



ATCAAACGAC. 

ATCAAACGAC 

ATCAAACGAC 

ATCAAACGAC 

ATCAAACGAC 

ATCAAACGAC 

ATCAAACGAC 

ATCAAACGAC 

ATCAAACGAC 

ATCAAACGAC 

ATCAAACGAC 



150 

GACTTTTATT 
GACTTTTATT 
GACTTTTATT 
GACTTTTATT 
GACTTTTATT 
GACTTTTATT 
GACTTTTATT 
GACTTTTATT 
GACTTTTATT 
GACTTTTATT 
GACTTTTATT 



********** ********** ********** ********** 



CtCCAACAGC 
CtCCAACAGC 
CtCCAACAGC 
CtCCAACAGC 
CtCCAACAGC 
CtCCAACAGC 
CtCCAACAGC 
CtCCAACAGC 
CtCCAACAGC 
CcCCAACAGC 
CcCCAACAGC 



201 

TGCGTCAGTC 
TGCGTCAGTC 
TGCGTCAGTC 
TGCGTCAGTC 
TGCGTCAGTC 
TGCGTCAGTC 
TGCGTCAGTC 
TGCGTCAGTC 
TGCGTCAGTC 
TGCGTCAGTC 
TGCGTCAGTC 



TTATTAGCAC 
TTATTAGCAC 
TTATTAGCAC 
TTATTAGCAC 
TTATTAGCAC 
TTATTAGCAC 
TTATTAGCAC 
TTATTAGCAC 
TTATTAGCAC 
TTATTAGCAC 
TTATTAGCAC 



********** ********** 



TATTGAAATT 
TATTGAAATT 
TATTGAAATT 
TATTGAAATT 
TATTGAAATT 
TATTGAAATT 
TATTGAAATT 
TATTGAAATT 
TATTGAAATT 
TATTGAAATT 

TATTGAAATT 
********** 



AAGCTATTTT 

AAGCTATTTT 

AAGCTATTTT 

AAGCTATTTT 

AAGCTATTTT 

AAGCTATTTT 

AAGCTATTTT 

AAGCTATTTT 

AAGCTATTTT 

AAGCTATTTT 

AAGCTATTTT 
********** 



200 

ATGATTTGTA 

ATGATTTGTA 

ATGATTTGTA 

ATGATTTGTA 

ATGATTTGTA 

ATGATTTGTA 

ATGATTTGTA 

ATGATTTGTA 

ATGATTTGTA 

ATGATTTGTA 

ATGATTTGTA 
********** 

250 

AGTGGACAAT 
AGTGGACAAT 
AGTGGACAAT 
AGTGGACAAT 
AGTGGACAAT 
AGTGGACAAT 
AGTGGACAAT 
AGTGGACAAT 
AGTGGACAAT 
AGTGGACAAT 

AGTGGACAAT 
********** ********** 



TCTAAGACCT 
TCTAAGACCT 
TCTAAGACCT 
TCTAAGACCT 
TCTAAGACCT 
TCTAAGACCT 
TCTAAGACCT 
TCTAAGACCT 
TCTAAGACCT 
TCTAAGACCT 
TCTAAGACCT 
********** 



GGAATCATCC 
GGAATCATCC 
GGAATCATCC 
GGAATCATCC 
GGAATCATCC 
GGAATCATCC 
GGAATCATCC 
GGAATCATCC 
GGAATCATCC 
GGAATCATCC 
GGAATCATCC 



251 

CAGATTTGTC 
CAGATTTGTC 
CAGATTTGTC 
CAGATTTGTC 
CAGATTTGTC 
CAGATTTGTC 
CAGATTTGTC 
CAGATTTGTC 
CAGATTTGTC 
CAGATTTGTC 

CAGATTTGTC 
********** 

301 

TATAAAGGTA 
TATAAAGGTA 
TATAAAGGTA 
TATAAAGGTA 
TATAAAGGTA 
TATAAAGGTA 
TATAAAGGTA 



TAAGGCTCCT 
TAAGGCTCCT 
TAAGGCTCCT 
TAAGGCTCCT 
TAAGGCTCCT 
TAAGGCTCCT 
TAAGGCTCCT 
TAAGGCTCCT 
TAAGGCTCCT 
TAAGGCTCCT 
TAAGGCTCCT 



AATTATAACC 
AATTATAACC 
AATTATAACC 
AATTATAACC 
AATTATAACC 
AATTATAACC 
AATTATAACC 
AATTATAACC 
AATTATAACC 
AATTATAACC 
AATTATAACC 



TCTTTGGCAT 
TCTTTGGCAT 
TCTTTGGCAT 
TCTTTGGCAT 
TCTTTGGCAT 
TCTTTGGCAT 
TCTTTGGCAT 
TCTTTGGCAT 
TCTTTGGCAT 
TCTTTGGCAT 
TCTTTGGCAT 



********** ********** ********** 



AATCTGTcCA 
AATCTGTcCA 
AATCTGTcCA 
AATCTGTcCA 
AATCTGTcCA 
AATCTGTcCA 
AATCTGTcCA 



AATGCCTACT 
AATGCCTACT 
AATGCCTACT 
AATGCCTACT 
AATGCCTACT 
AATGCCTACT 
AATGCCTACT 



TTAGAAGATG 
TTAGAAGATG 
TTAGAAGATG 
TTAGAAGATG 
TTAGAAGATG 
TTAGAAGATG 
TTAGAAGATG 



300 

CAAAGGAGAA 

CAAAGGAGAA 

CAAAGGAGAA 

CAAAGGAGAA 

CAAAGGAGAA 

CAAAGGAGAA 

CAAAGGAGAA 

CAAAGGAGAA 

CAAAGGAGAA 

CAAAGGAGAA 

CAAAGGAGAA 
********** 

350 

ATGGGAAAGG 
ATGGGAAAGG 
ATGGGAAAGG 
ATGGGAAAGG 
ATGGGAAAGG 
ATGGGAAAGG 
ATGGGAAAGG 
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Table 63: Comparative Sequences relating to SAG1912 



msa243324.2{275_M732} 
msa243324.2{275_M78l} 
msa243324.2{275_1169NT} 
msa243324.2{275_JM9130013} 

Consensus 



msa243324.2{275_A909} 
msa243324.2{275_H36B} 
msa243324. 2 {275^090} 
msa243324 . 2 {275_18RS2l} 
msa2433 24 .2(27 5_2 603' 
msa243324 .2{275_CJB110 
msa243324.2{275_COHl 
msa243324 . 2 {275_M732 } 
msa243324 . 2 { 275__M78 1 } 
msa243324.2{275_1169NT} 
msa243324 . 2 { 275_JM9130013 } 

Consensus 



msa243324 . 2 {275_A909 ; 
msa243324.2{275_H36B; 
rasa243324.2{275_090 
msa243324 . 2 { 275_18RS21 
msa243324 . 2{275J2603 
msa243324 . 2 { 275_CJB110 } 
msa243324.2{275_COHlj 
msa243324 . 2 { 275_M732 } 
msa243324.2{275_M781> 
msa243324.2{275_1169NT} 
msa243324.2{275_JM9130013} 

Consensus 



msa243324.2{275_A909} 
msa243324.2{275_H36B} 
msa243324 . 2 { 275_090 } 
msa243324 . 2 { 275_18RS21 ' 
msa243324 . 2 {275_2603 
msa243324 . 2 { 275_CJB110 [ 
msa243324 . 2{275_COHl} 
msa243324 . 2 ( 275_M732 * 
msa243324.2{275_M78lj 
msa243324.2{275_1169NT. 
msa243324.2{275_JM9130013} 

Consensus 



msa243324 . 2 \ 275_A909 } 
msa243324.2{275_H36B| 
msa243324.2{275_090} 
msa243324 . 2 { 275_18RS21 } 
msa243324.2{275_2603} 
msa243324 . 2 { 275__CJB110 \ 
msa243324 . 2 { 275_COHl } 
msa243324 . 2 { 275J4732 
msa243324 . 2 { 275__M78 1 
msa243324 . 2 { 275_1169NT i 
msa243324.2{275_JM9130013 

Consensus 



msa243324.2{275_A909 
msa243324.2{275_H36B , 
msa243324.2{275__090 
msa243324 . 2 { 275__18RS21. 

msa243324.2{275_2603 
msa243324 . 2 { 275_CJB110 
msa243324 . 2 f 275_C0H1 
msa243324.2{275_M732} 
msa243324.2{275_M78l} 
msa243324.2{275_1169NT} 
msa243324 .2{275_JM9130013} 

Consensus 



TATAAAGGTA 
TATAAAGGTA 
TATAAAGGTA 
TATAAAGGTA 



AATCTGTcCA 
AATCTGTcCA 
AATCTGTcCA 
AATCTGTtCA 



AATGCCTACT 
AATGCCTACT 
AATGCCTACT 
AATGCCTACT 



TTAGAAGATG 
TTAGAAGATG 
TTAGAAGATG 
TTAGAAGATG 



ATGGGAAAGG 
ATGGGAAAGG 
ATGGGAAAGG 
ATGGGAAAGG 



********** *******-** ********** ********** ********** 



351 

cAATATGACt 
cAATATGAC t 
CAATATGACt 
cAATATGACt 
CAATATGACt 
CAATATGACt 
CAATATGACt 
cAATATGACt 
CAATATGACt 
cAATATGACt 
tAATATGACc 



CAAATCCAAG 
CAAATCCAAG 
CAAATCCAAG 
CAAATCCAAG 
CAAATCCAAG 
CAAATCCAAG 
CAAATCCAAG 
CAAATCCAAG 
CAAATCCAAG 
CAAATCCAAG 
CAAATCCAAG 



CTCCTTTTCG 
CTCCTTTTCG 
CTCCTTTTCG 
CTCCTTTTCG 
CTCCTTTTCG 
CTCCTTTTCG 
CTCCTTTTCG 
CTCCTTTTCG 
CTCCTTTTCG 
CTCCTTTTCG 
CTCCTTTTCG 



CGCCTATCCA 
CGCCTATCCA 
CGCCTATCCA 
CGCCTATCCA 
CGCCTATCCA 
CGCCTATCCA 
CGCCTATCCA 
CGCCTATCCA 
CGCCTATCCA 
CGCCTATCCA 
CGCCTATCCA 



400 

AATTATTCTG 
AATTATTCTG 
AATTATTCTG 
AATTATTCTG 
AATTATTCTG 
AATTATTCTG 
AATTATTCTG 
AATTATTCTG 
AATTATTCTG 
AATTATTCTG 
AATTATTCTG 



_********- ********** ********** ********** ********** 



401 

CTTCACTATA 
CTTCACTATA 
CTTCACTATA 
CTTCACTATA 
CTTCACTATA 
CTTCACTATA 
CTTCACTATA 
CTTCACTATA 
CTTCACTATA 
CTTCACTATA 
CTTCACTATA 



TGATTATGCT 
TGATTATGCT 
TGATTATGCT 
TGATTATGCT 
TGATTATGCT 
TGATTATGCT 
TGATTATGCT 
TGATTATGCT 
TGATTATGCT 
TGATTATGCT 
TGATTATGCT 



GAGTTAGTAT 
GAGTTAGTAT 
GAGTTAGTAT 
GAGTTAGTAT 
GAGTTAGTAT 
GAGTTAGTAT 
GAGTTAGTAT 
GAGTTAGTAT 
GAGTTAGTAT 
GAGTTAGTAT 
GAGTTAGTAT 



********** ********** ********** 



CTAGTCAAAA 

CTAGTCAAAA 

CTAGTCAAAA 

CTAGTCAAAA 

CTAGTCAAAA 

CTAGTCAAAA 

CTAGTCAAAA 

CTAGTCAAAA 

CTAGTCAAAA 

CTAGTCAAAA 

CTAGTCAAAA 
********** 



451 

GcTTGGAAAT 
GcTTGGAAAT 
GtTTGGAAAT 
GtTTGGAAAT 
GtTTGGAAAT 
GtTTGGAAAT 
GtTTGGAAAT 
GtTTGGAAAT 
GtTTGGAAAT 
GtTTGGAAAT 
GtTTGGAAAT 
*_******** 

501 

AGGTCTTTAT 
AGGTCTTTAT 
AGGTCTTTAT 
AGGTCTTTAT 
AGGTCTTTAT 
AGGTCTTTAT 
AGGTCTTTAT 
AGGTCTTTAT 
AGGTCTTTAT 
AGGTCTTTAT 
AGGTCTTTAT 



CAAATACtTC 

CAAATACtTC 

CAAATACcTC 

CAAATACcTC 

CAAATACcTC 

CAAATACcTC 

CAAATACtTC 

CAAATACtTC 

CAAATACtTC 

CAAATACtTC 

CAAATACcTC 
*******_** 



TTCTTATAAG 
TTCTTATAAG 
TTCTTATAAG 
TTCTTATAAG 
TTCTTATAAG 
TTCTTATAAG 
TTCTTATAAG 
TTCTTATAAG 
TTCTTATAAG 
TTCTTATAAG 

TTCTTATAAG 
********** 



GATGCTACTG 
GATGCTACTG 
GATGCTACTG 
GATGCTACTG 
GATGCTACTG 
GATGCTACTG 
GATGCTACTG 
GATGCTACTG 
GATGCTACTG 
GATGCTACTG 

GATGCTACTG 
********** 



GCGACAGATA 
GCGACAGATA 
GCGACAGATA 
GCGACAGATA 
GCGACAGATA 
GCGACAGATA 
GCGACAGATA 
GCGACAGATA 
GCGACAGATA 
GCGACAGATA 
GCGACAGATA 



CTGCTTATGC 
CTGCTTATGC 
CTGCTTATGC 
CTGCTTATGC 
CTGCTTATGC 
CTGCTTATGC 
CTGCTTATGC 
CTGCTTATGC 
CTGCTTATGC 
CTGCTTATGC 
CTGCTTATGC 



********** ********** ********** 



TAGTAAATTA 
TAGTAAATTA 
TAGTAAATTA 
TAGTAAATTA 
TAGTAAATTA 
TAGTAAATTA 
TAGTAAATTA 
TAGTAAATTA 
TAGTAAATTA 
TAGTAAATTA 

TAGTAAATTA 
********** 



450 

GTATGCATCT 
GTATGCATCT 
GTATGCATCT 
GTATGCATCT 
GTATGCATCT 
GTATGCATCT 
GTATGCATCT 
GTATGCATCT 
GTATGCATCT 
GTATGCATCT 
GTATGCATCT 
********** 

500 

CAGCTCTAAC 
CAGCTCTAAC 
CAGCTCTAAC 
CAGCTCTAAC 
CAGCTCTAAC 
CAGCTCTAAC 
CAGCTCTAAC 
CAGCTCTAAC 
CAGCTCTAAC 
CAGCTCTAAC 
CAGCTCTAAC 
********** 

550 

AACCAAATTA 
AACCAAATTA 
AACCAAATTA 
AACCAAATTA 
AACCAAATTA 
AACCAAATTA 
AACCAAATTA 
AACCAAATTA 
AACCAAATTA 
AACCAAATTA 
AACCAAATTA 
********** 



551 

TTGAAAc CTA 
TTGAAAcCTA 
TTGAAAc CTA 
TTGAAAcCTA 
TTGAAAcCTA 
TTGAAAcCTA 
TTGAAAcCTA 
TTGAAAcCTA 
TTGAAAcCTA 
TTGAAAcCTA 
TTGAAAaCTA 



CAGTCTAGAT 
CAGTCTAGAT 
CAGTCTAGAT 
CAGTCTAGAT 
CAGTCTAGAT 
CAGTCTAGAT 
CAGTCTAGAT 
CAGTCTAGAT 
CAGTCTAGAT 
CAGTCTAGAT 
CAGTCTAGAT 



582 

GCTTATGATA AA 
GCTTATGATA AA 
GCTTATGATA AA 
GCTTATGATA AA 
GCTTATGATA AA 
GCTTATGATA AA 
GCTTATGATA AA 
GCTTATGATA AA 
GCTTATGATA AA 
GCTTATGATA AA 
GCTTATGATA AA 



******_*** ********** ********** ** 



SEQ XD NO. 6312 
MKSR^ 

SKTYDLYASVLLAQAILESSSGQSDLSKAPNYNLFGIKGEYKGKSVQMPTLEDIXSKGNMT 
QI QAP FRAYPNYSAS LYDYAELVS SQKYAS VWKSNTS S YKDATAALTGLYATDTAYAS KL 
NQIIETYSLDAYDK 
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Table 63: Comparative Sequences relating to SAG1912 



SEQ ID NO. 6313 
STRAIN 090 frame: 1 . 

GVWFYNYKNDNVEPTVTSASDQTTTFIQTISPTAIEISKTYDIiYASVLIiAQAILESSSGQ 
SDLSKAPNYNLFGIKGEYKGKSVQMPTLEDDGKGNMTQI QAPFRAYPNYSASLYDYAELV 
SSQKYASWKSNTSS YKDATAALTGLYATDTAYASKLNQ 1 1 ETYSLDAYDK 

SEQ ID NO. 6314 
STRAIN A909 frame: 1 

GVWFYNYKNDNVEPTVTSASDQTTTFI QTI SPTAI EI SKTYDLYASVLLAQAI LESS SGQ 
SDLSKAPNYNLFGIKGEYKGKSVQMPTLEDDGKGNMTQIQAPFRAYPNYSASLYDYAELV 
S SQKYASAWKSNTS S YKDATAALTGLYATDTAYASKLNQ 1 1 ETYSLDAYDK 

SEQ ID NO. 6315 
STRAIN H36B frame: 1 

GWFYNYKNDNVEPTVTSASDQTTTFICTI SPTAIEI SKTYDLYASVLLAQAILESSSGQ 
SDLSKAPNYNLFGIKGEYKGKSVQMPTLEDDGKGNMTQIQAPFRAYPNYSASLYDYAELV 
S SQKYAS AW KSNTSS YKDATAALTGLYATDTAYASKLNQ 1 1 ETYSLDAYDK 

SEQ ID NO. 6316 
STRAIN 18RS21 frame: 1 

GVWFYNYKNDNVEPTVTSASDQTTTFI QT I SPTAI E I SKTYDLYASVLLAQAILESSSGQ 
SDLS KAPNYNLFG I KGE YKGKS VQMPTLEDDGKGNMTQI QAP FRAYPNY SAS LYD YAELV 
S SQKYAS VWKSNTS S YKDATAALT GLYATDTAYAS KLNQ 1 1 ETYSLDAYDK 

SEQ ID NO. 6317 

STRAIN M732 frame: 1 

GVWFYNYKNDNVEPTVTSASDQTTTFI QTI S PTAI EI SKTYDLYASVLLAQAI LESSSGQ 
SDLS KAPNYNLFG I KGEYKGKSVQMPTLEDDGKGNMTQ I QAP FRAYPNYS AS LYD YAELV 
S SQKYAS VWKSNTS S YKDATAALTGLYATDTAYAS KLNQ 1 1 ETYSLDAYDK 

SEQ ID NO. 6318 

STRAIN M78 1 frame: 1 

GVWFYNYKNDNVEPTVTSASDQTTTFI QTI SPTAI EI SKTYDLYASVLLAQAILESSSGQ 
SDLSKAPNYNLFGI KGEYKGKSVQMPTLEDDGKGNMTQ I QAP FRAYPNYS AS LYD YAELV 
S SQKYAS VWKSNTS S YKDATAALTGLYATDTAYAS KLNQ 1 1 ETYSLDAYDK 

SEQ ID NO. 6319 
STRAIN CJB1 10 frame: 1 

GWFYNYKNDNVEPTVTSASDQTTTFICTI SPTAIEI SKTYDLYASVLLAQAI LESSS 
SDLSKAPNYNLFGI KGE YKGKS VQMPTLEDDGKGNMTQI QAPFRAYPNYSASLYDYAELV 
S SQKYAS VWKSNTSS YKDATAALTGLYATDTAYAS KLNQ 1 1 ETYSLDAYDK 

SEQ ID NO. 6320 
STRAIN 1 169NT frame: 1 

GVWFYNYKNDNVEQT VTSASDQTTTF I QT I S PTAIE I SKT YDLYAS VLLAQAI LE S S SGQ 
SDLSKAPNYNLFGI KGEYKGKSVQMPTLEDDGKGNMTQ I QAPFRAYPNYSASLYDYAELV 
S SQKYAS VWKSNTS S YKDATAALTGLYATDTAYAS KLNQI I ETYSLDAYDK 

SEQ ID NO. 6321 
STRAIN JM9 1300 13 frame: 3 

WFYNYKNDNVEPTVTSASDQTTTFIQT I S PTAIE I SKTYDLYAS VLLAQAI LESSSGQSD 
LS KAPNYNLFG! KGEYKGKSVQMPTLEDDGKGNMTQ I QAPFRAYPNYSASLYDYAELVS S 
QKYAS VWKSNTS S YKDATAALTGLYATDTAYASKLNQ 1 1 ENYSLDAYDK 

PRETTY of : /biotmp/msa243476 . 2 { * } February 11, 2003 05:17 

1 50 

gvWFYNY KNDNVEpTVT SASDQTTTFI 

gvWFYNY KNDNVEpTVT SASDQTTTFI 

mksrkkdklv lrltttllvf glggvWFYNY KNDNVEpTVT SASDQTTTFI 

gvWFYNY KNDNVEpTVT SASDQTTTFI 

gvWFYNY KNDNVEpTVT SASDQTTTFI 

gvWFYNY KNDNVEpTVT SASDQTTTFI 

gvWFYNY KNDNVEpTVT SASDQTTTFI 

gvWFYNY KNDNVEpTVT SASDQTTTFI 

WFYNY KNDNVEpTVT SASDQTTTFI 

gvWFYNY KNDNVEqTVT SASDQTTTFI 

********** ********** ***_-***** ******_*** ********** 

51 100 
QTI SPTAIEI SKTYDLYASV LLAQAILESS SGQSDLSKAP NYNLFGIKGE 
QTI SPTAIEI SKTYDLYASV LLAQAILESS SGQSDLSKAP NYNLFGIKGE 
QTI SPTAIEI SKTYDLYASV LLAQAILESS SGQSDLSKAP NYNLFGIKGE 
QTI SPTAIEI SKTYDLYASV LLAQAILESS SGQSDLSKAP NYNLFGIKGE 
QTISPTAIEI SKTYDLYASV LLAQAILESS SGQSDLSKAP NYNLFGIKGE 
QTI SPTAIEI SKTYDLYASV LLAQAILESS SGQSDLSKAP NYNLFGIKGE 
QTISPTAIEI SKTYDLYASV LLAQAILESS SGQSDLSKAP NYNLFGIKGE 
QTISPTAIEI SKTYDLYASV LLAQAILESS SGQSDLSKAP NYNLFGIKGE 
QTISPTAIEI SKTYDLYASV LLAQAILESS SGQSDLSKAP NYNLFGIKGE 
QTISPTAIEI SKTYDLYASV LLAQAILESS SGQSDLSKAP NYNLFGIKGE 
********** ********** ********** ********** ********** 



msa243476. 2 {275_O90} 
rasa243476 . 2 { 275_18RS21 } 
msa243476. 2 { 275^2603} 
msa243476.2{275_CJB110} 
msa243476.2(275_M732) 
msa243476 . 2 { 27 5_M78 1 } 
msa243476 . 2 {275_A909 } 
msa243476 . 2 {275_H36B} 
msa243476.2{275_JM9130013 \ 
msa243476.2{275_1169NT) 

Consensus 



msa243476 . 2 {275_090} 
msa243476 . 2 { 275_18RS2 1 } 
msa243476. 2 {275_2603} 
msa243476.2{275_CJB110) 
msa243476.2{275_M732) 
msa243476 .2|275_M78l} 
m3a243476.2(275_A909} 
msa243476 . 2 { 275_H36B J 
msa243476.2{275_JM9130013f 
msa243476.2{275_1169NT} 

Consensus 



949 
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Table 63: Comparative Sequences relating to SAG1912 



msa243476 
msa243476.2{ 

msa243476 . 
msa2 43476 .2{ 

msa243476 . 

msa243476 . 

rasa243476 . 

rasa243476 . 
msa243476.2{275 
tnsa243476 .2{ 



.2{275_090} 
275_18RS21~ 
2{275_2603 
275_CJB110, 
2{275_M732} 

2{275_M78l} 
2f275_A909 
2{275_H36B) 

JM9130013 
275_1169NT 
Consensus 



msa24347 
msa243476 .2 
msa243476 
msa243476 .2 
tnsa243476 
msa243476 
msa243476 
rasa243476 
msa243476.2{27 
msa243476 . 2 



6.2{275_090 
{275_18RS21 
.2{275_2603 
{275__CJB110 
.2l275__M732; 
.2f 275JM781 
.2{275_A909 
-2{275_H36B 
5_JM9130013} 
{275_1169NT} 
Consensus 



101 

YKGKSVQMPT 
YKGKSVQMPT 
YKGKSVQMPT 
YKGKSVQMPT 
YKGKSVQMPT 
YKGKSVQMPT 
YKGKSVQMPT 
YKGKSVQMPT 
YKGKSVQMPT 
YKGKSVQMPT 



LEDDGKGNMT 
LEDDGKGNMT 
LEDDGKGNMT 
LEDDGKGNMT 
LEDDGKGNMT 
LEDDGKGNMT 
LEDDGKGNMT 
LEDDGKGNMT 
LEDDGKGNMT 
LEDDGKGNMT 



QIQAPFRAYP 
QIQAPFRAYP 
QIQAPFRAYP 
QIQAPFRAYP 
QIQAPFRAYP 
QIQAPFRAYP 
QIQAPFRAYP 
QIQAPFRAYP 
QIQAPFRAYP 
QIQAPFRAYP 



NYSASLYDYA 
NYSASLYDYA 
NYSASLYDYA 
NYSASLYDYA 
NYSASLYDYA 
NYSASLYDYA 
NYSASLYDYA 
NYSASLYDYA 
NYSASLYDYA 
NYSASLYDYA 



150 

ELVSSQKYAS 
ELVSSQKYAS 
ELVSSQKYAS 
ELVSSQKYAS 
ELVSSQKYAS 
ELVSSQKYAS 
ELVSSQKYAS 
ELVSSQKYAS 
ELVSSQKYAS 
ELVSSQKYAS 



********** ********** ********** ********** ********** 



151 

vWKSNTSSYK 
vWKSNTSSYK 
vWKSNTSSYK 
vWKSNTSSYK 
vWKSNTSSYK 
vWKSNTSSYK 
aWKSNTSSYK 
aWKSNTSSYK 
vWKSNTSSYK 
VWKSNTSSYK 



DATAALTGLY 
DATAALTGLY 
DATAALTGLY 
DATAALTGLY 
DATAALTGLY 
DATAALTGLY 
DATAALTGLY 
DATAALTGLY 
DATAALTGLY 
DATAALTGLY 



ATDTAYASKL 
ATDTAYASKL 
ATDTAYASKL 
ATDTAYASKL 
ATDTAYASKL 
ATDTAYASKL 
ATDTAYASKL 
ATDTAYASKL 
ATDTAYASKL 
ATDTAYASKL 



NQIIEtYSLD 
NQIIEtYSLD 
NQIIEtYSLD 
NQIIEtYSLD 
NQIIEtYSLD 
NQIIEtYSLD 
NQI lEtYSLD 
NQIIEtYSLD 
NQIIEnYSLD 
NQIIEtYSLD 



_********* ********** ********** *****_**** 



194 

AYDK 

AYDK 

AYDK 

AYDK 

AYDK 

AYDK 

AYDK 

AYDK 

AYDK 

AYDK 
**** 
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Table 64: Comparative Sequences relating to SAG 0827 



SEQ ID NO. 6401 
STRAIN 2603 

ATG AACAAGT CT AAG AAAAT CG AAAATT ATCAATTACTATTACTACAAG CGCAAG CT CTA 
TTCTCAGATGAAACAAATGCTCTTGCCAACTTATCAAATGCTTCAGCTATGCTAAATGCT 
ATG CTTC CAAATT CTGT ATTT ACAGGCTTTTATTT ATTTG ATGG AGAAG AGTTAATT CTT 
GGCCCITTCCAGGGTGGTGTATCATGTGTGCATATTACTTTAGGAAAAGGTGTTTGTGGT 
GAATCrGCACAAACTGCTAAGAGGCTGATCGTTGAmA.TGTTACAAAGCATGCTAACTAT 
ATCT C CTGTGATT CAAAAG CTATGAGTGAAAT CGTAGTAC CTATGTTTAAAAATGG CAAA 
CTTCT AGGAGTT CTAGATTTAGATT CTTCTTTAGT AG CAGATTATGATGAGATTG AT CAA 
GAATACTTAGAAAAATTTGTAGGTATTCTAGTAGAAGATACX?^^ 
. TTTGGAGTTGAAAAG 

SEQ ID NO. 6402 
STRAIN 090 

CTCTATTCTCAGATGAAACAAATGCTCCTGCGAACTTA 

T CAAATG CTT CAGCTATG CTAAATG CT ATG CTT C CAAATT CTGT ATTTAC 

AGGCTTTTATTTATTTGATGGAAAGGAGTTAATTCTTGG CC CT1T 

GTGGTGTATCATGTGTGCATATTACTTTAGGAAAAGGTGTTTGTGGTGAA 

TCTCCACAAACTGCTAAGACGCTGATTGTTGATGATGTT^ 

TAACTATATCTCCTGTGATTCAAAAGCTATGAGTGAAATCGTAGTACCTA 

TGTTT AAAAATGG CAAACTT CTAGGAGTT CTAGATTTAGATT CTT CTTTA 

GTAGCAGATTATGATGAGATTGATCAAGAATACTTAGAAAAATTTGTAGG 

TATT CTAGTAGAACATACGATTTGGAATTTGGATA 

SEQ ID NO. 6403 

STRAIN A909 

CTCTATTCTCAGATGAAACAAATGCTCTTGCCAA 

CTTATCAAATGCTTCAG CTATGCTAAATGCTATG CTTCCAAATT CTGTAT 
TTACAGG (CTTTT ATTTACTTGATGGAG AAGAGTTAATTCIT^ C 
CAGGGTGGTGTATCATGTGTGCATACTACTTTAGGA 

TGAAT CTGCACAAACTG CTAAGACGCTGAT CGTTGATGATGTTACAAAG C 
ATGCTAACTATATCT CCTGTGATT CAAAAG CTATGAGTGAAATCGTAGTA 
CCTATGTTTAAAAATGG CAAACTT CTAGGAGTT CT AG ATTTAGATTCTT C 
TTTAGTAGCAGATTATGATGAGATTGATCAAGAATACTTAGAAAAATTTG 
TAGGTATTCTAGTAGAACATACGATTTGGAATTTGGATATGTTTGGAGTT 
GAAAAG 

SEQ ID NO. . 6404 
STRAIN H36B 

CrCTATTCTCAGATGAAACAAATGCTCTTGC 

CAACTTATCAAATG CTT CAG CTATGCTAAaTG CTATG CTT CCAAATT CTG 
TATTTACAGGCTTTTATTTATTTO 

TTC CAGGGTGGTGT ATCATGTGTG CAT ATTACTTTAGGAAAAGGTGTTTG 

TGGTGAATCTGCACAAACT G CTAAGACGCTGATCGTTGATGATGTTACAA 

AGCATGCTAACTATATCTCCTGTGATTCAAAAGCTATGAGTGAAATCGTA 

GTAC CTATGTTTAAAAATGG CAAACTTCTAGG AGTTCT 

TT CTTTAGTAGCAC4ATTATGATGAGATTGATCAAGAATACTTAGAAAAAT 

TTGTAGGTATTCTAGTAGAACATACGATTTGGAATTTGGATATGTTTGGA 

GTTGAAAAG 

SEQ ID NO. 6405 
STRAIN 18RS21 

CTCTATTCTCAGATGAAACAAATGCTCTTGCC^^ 

ATCAAATGCTTCAGCTATG CTAAATGC!TATGCTTCCAAATTCTGTATTTA 
CAGG CTTTTATTTATTTGATGGAGAAGAGTTAATT CTTGG CC CTTTC CAG 
GGTGGTGTATCATGTGTGCATATTACTTTAG 

AT CTG CACAAACTGCTAAGACGCTGAT CGTTGATGATGTTACAAAG CATG 

CT AACTATAT CT C CTGTGATTCAA^GCTATG AGTGAAAT CGTAGTACCT 

ATGTTTAAAAATGGCAAACTTCTAGGAGTTCTAGATTTAGAT^ 

AGTAGCAGATTATGATGAGATTGATCAAGAATAC37TAGAAAAATTTGTAG 

GTATTCTAGTAGAACATACGATTTGGAATTTGGATATGTTTGGAGTTGAA 

AAG 

SEQ ID NO. 6406 
STRAIN M732 

CTCTATTCTCAG ATG AAACAAATG CT CTTG CCAACTT 
ATCAAATGCTTCAGCTATG CTAAATG CTATGCTT C CAAATTCTGT ATTTA 
CAGGCTTTTATTTATTTGATGGAG AC<3AGTTAATT Cl^ CAG 
GGTGGTGTATCATGTGTGCATATTACTTTAGGAAAAGGTGTTTGTGGTGA 
AT CTGCACAAACTG CTAAG ACG CTG ATTGTTGATGATGTTACAAAG CATG 
CT AACTATATCTCCTGTGATTCAAAAG CTATGAGTGAAAT CGTAGTACC C 
ATGTTTAAAAATGG CAAACTT CTAGGAGTT CTAGATTTAGATT 
AGTAG CAGATTATGATG AG ATTGAT CAAGAATACTTAGAAAAATTTGTAG 
GTATT CT AGTAGAACAT AC GATTTGGAATTTGGATATGTTTGGAGTTGAA 
AAG 

SEQ ID NO. 6407 
STRAIN COH1 

CT CTATT CT CAG ATG AAACAAATG CTCTTG C CAAC 

TTATCAAATGCTTCAGCTATGCTAAATGCTATGCTTCCAAATTCTGTATT 
TACAGGCTTTTATTTATTTGATGGAGAGGAGTTAATTCTTGGCC 
AGGGTGGTGT AT CAT GTGTG CATATT ACTTTAGGAAAAGGTGTTTGTGGT 
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GJU^^TCTGCACAAACTGCTAAGACGCTGATTGTTGATGATGTTACAAAGCA 
TGCTAACTATATCTCCTGTGATTCAAAAGCTATGAGTGAAATCGTAGTAC 
CCATGTTTAAAAATGGCAAACTTCTAGGAGTTCTAGATTTAGATTCTTCT 
TTAGTAGCAGATTATGATGAGATTGATCAAGAATACTTAGAAAAATTTGT 
AGGTATTCTAGTAGAACATACGATTTGGAATTTGGATATGTTTGGAGTTG 
AAAAG 



SEQ ID NO. 6408 
STRAIN M78 1 

CTCTATTCTCAGATGAAACAAATG CTCTTGCCAACTT 

AT CAAATGCTTCAG CTATG CTAAATG CTATGCTT C GAAATTCTGTATTTA 

CAGG CTTTTATT TATTTGATGG AGAGGAGTTAATT CTTGG CC CTTTT CAG 

GGTGGTGTATCATGTGTGCATATTACrTTAGGAAAAGGTGTTTGTGGTGA 

ATCTGCACAAACTGCTAAGACGCrGATTGTTGATGATGTTACAAAGCATG 

CTAACTATATCTCCTGTGATTCAAAAGCTATGAGTGAAATCGTAGTACCC 

ATGTTTAAAAATGGCAAACTTCTAGGAGTTCTAGATTTAGATTCITCTTT 

AGTAG CAGATTATGATG AG ATTGAT CAAGAAT ACTTAGAAAAATTTGT AG 

GTATTCTAGTAGAACATACGATTTGGAATTrGGATATGTTTGGAGTTGAA 

AAG 



SEQ ID NO. 6409 
STRAIN CJB110 

CTCTATTCTCAGATGAAACAAATGCTCTTGCCAACTTA 
TCAAATGCTTCAG CTATG CT AAATG CTATG CTTC CAAATT CTGT ATTTAC 
AGGCTTTTATTTATTTGATGGAAAGGAGTTAATTCT^ 
GTGGTGTAT CATGTGTG CATATTACTTTAGGAAAAGGTGTTTGTGGTGAA 
TC!TG CACAAACTGCTAAGACGCTGATTGTTGATGATGTTACAAAG CATG C 
TAACTATAT CTCCTGTGATTCAAAAG CTATGAGTGAAAT CGTAGT ACCT A 
TGTTT AAAAATGGCAAACTTCTAGGAGTT CTAGATTTAGATT CTT CTTT A 
GT AGCAGATT ATGATGAGATTGAT CAAGAATACTTAGAAAAATTTGTAGG 
TATTCTAGTAGAACATACGATTTGGAATTTGGATATGTTTGGAGTTGAAA 

AG 



SEQ ID NO. 6410 
STRAIN 1169NT 

CTCTATTCTCAGATGAAACAAATGCTCTTGCCAACTTA 
TCAAATG CTT CAG CTATGCT AAATG CTATG CTTC CAAATT CTGT ATTTAC 
AGGCTTTTATTTATTTGATG GAGAAGAGTTAATT CTTGG C CCTTT C CAGG 
GTGGTGTATCATGTGTGCATATTACITTTAGGAAAAGGTGTTTGTGGTGAA 
TCTG CAGzy^CTGCT A^GACG CTGATTGTTGATGATGTTACAAAGCATG C 
TAACTATAT CT C CTGTG ATTCAAAAGCTATGAGTGAAATCGTAGTAC CCA 
TGTTTAAAAATGG CAAACTT CTAGGAGTT CTAGATTTAGATT CTT CTTTA 
GTAG CAGATTATGATGAGATTG AT CAAGAATACTTAGAAAAATTTGTAGG 
TATTCTAGTAGAACATACGATTTGGAATTTGGATATGTTTGGAGTTGAAA 

AG 



SEQ ZD NO. 6411 
STRAIN JM9 1300 13 

CTCTATTCTCAGATGA&ACAAATGCTCTTGCCA 

TCAAATG CTT CAG CTATG CTAAATG CTATG CITC CAAATTCTGT ATTTAC 
AGGCTTTTATTTATTTGATGGAGAAGAGTTAATT CTTGG CCCTTT CCAGG 
GTGGTGTAT CATGTGTG CATATTACTTTAGGAAAAGGTGTTTGTGGTGAA 
TCTG CACAAACTG CTAAGACG CTGATCGTTGATGATGTTACAAAGCATGC 
TAACTATAT CT C CTGTGATT CAAAAGCTATGAGTGAAAT CGTAGTACCTA 
TGTTTAAAAATGGCAAACITCTAGGAGTTCT 

GTAG CAGATTATGATGAGATTGAT CAAGAATACTTAGAAAAATTTGTAGG 

TATTCTAGTAGAACATACGATTTGGAATTTGGATATGTTTGGAG 

AG 

PRETTY of: /fc>iotmp/msa236796 . 2 { * } February 11, 2003 02:42 



50 



msa2 3 67 96 . 2 { 2 8 2_COHl ; 
msa236796.2{282_M732; 
msa236796.2{282_M781> 
msa236796.2{282_090 
msa236796.2{282_CJB110 
msa236796 . 2 {282_18RS21 
msa236796.2{282_2603' 
msa236796.2(282_A909 '■ 
msa236796.2{282_H36B '• 
msa236796.2{282_JM9130013} 
msa236796.2{282_1169NT} 

Consensus 



atgaacaagt ctaagaaaat cgaaaattat caattattat tactacaagc 



********** ********** ********** ********** ********** 



msa236796. 

msa236796 . 

msa236796 . 
msa236796 
msa236796 
msa236796 

msa236796 



2{282_COHl) 
2f282_M732} 
2{282_M78l} 
J .2{282_090} 
,2{282_CJB110) 
.2{282_18RS2l} 
2{282J2603} 



51 

CTCTA 

CTCTA 

CTCTA 

CTCTA 

CTCTA 

CTCTA 

gcaagCTCTA 



TTCTCAGATG 
TTCTCAGATG 
TTCTCAGATG 
TTCTCAGATG 
TTCTCAGATG 
TTCTCAGATG 
TTCTCAGATG 



AAACAAATGC 
AAACAAATGC 
AAACAAATGC 
AAACAAATGC 
AAACAAATGC 
AAACAAATGC 
AAACAAATGC 



TCTTGCCAAC 
TCTTGCCAAC 
TCTTGCCAAC 
TCTTGCCAAC 
TCTTGCCAAC 
TCTTGCCAAC 
TCTTGCCAAC 



100 

TT AT CAAATG 
TTATCAAATG 
TTATCAAATG 
TTATCAAATG 
TTATCAAATG 
TTATCAAATG 
TTATCAAATG 
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msa236796 . 2 { 282_A909} CTCTA TTCT CAGATG AAACAAATGC TCTTGCCAAC TTATCAAATG 

tnsa236796 -2{282_H36B} CTCTA TTCT CAGATG AAACAAATGC TCTTGCCAAC TTATCAAATG 

msa236796 . 2{282_JM9130013} CTCTA TTCT CAGATG AAACAAATGC TCTTGCCAAC TTATCAAATG 

msa2367 96 .2 {23 2_1169NT} CTCTA TTCTCAGATG AAACAAATGC TCTTGCCAAC TTATCAAATG 

Consensus ********** ********** ********** ********** ********** 



msa236796 .2{282_C0H1; 
msa236796 .2(282_M732 
msa236796.2(282_M781 
msa236796.2{282_090 
msa236796.2{282_CJB110 
msa236796.2{282_18RS21 ! 
msa2 3 57 96 . 2 { 2 8 2_2 603 
,msa236796 .2{282_A909 
msa236796.2{282_H36B 
msa2 3 67 96. 2 { 28 2_JM9 13 0013 
msa236796 . 2 {282_1169NT/ 

Consensus 



101 

CTTCAGCTAT 
CTTCAGCTAT 
CTTCAGCTAT 
CTTCAGCTAT 
CTTCAGCTAT 
CTTCAGCTAT 
CTTCAGCTAT 
CTTCAGCTAT 
CTTCAGCTAT 
CTTCAGCTAT 
CTTCAGCTAT 



GCTAAATGCT 
GCTAAATGCT 
GCTAAATGCT 
GCTAAATGCT 
GCTAAATGCT 
GCTAAATGCT 
GCTAAATGCT 
GCTAAATGCT 
GCTAAATGCT 
GCTAAATGCT 
GCTAAATGCT 



ATGCTTCCAA 
ATGCTTCCAA 
ATGCTTCCAA 
ATGCTTCCAA 
ATGCTTCCAA 
ATGCTTCCAA 
ATGCTTCCAA 
ATGCTTCCAA 
ATGCTTCCAA 
ATGCTTCCAA 
ATGCTTCCAA 



ATTCTGTATT 
ATT CTGTATT 
ATTCTGTATT 
ATTCTGTATT 
ATTCTGTATT 
ATTCTGTATT 
ATTCTGTATT 
ATTCTGTATT 
ATTCTGTATT 
ATTCTGTATT 
ATTCTGTATT 



150 

TACAGGCTTT 
TACAGGCTTT 
TACAGGCTTT 
TACAGGCTTT 
TACAGGCTTT 
TACAGGCTTT 
TACAGGCTTT 
TACAGGCTTT 
TACAGGCTTT 
TACAGGCTTT 
TACAGGCTTT 



********** ********** ********** ********** ********** 



msa236796. 

msa236796 . 

msa236796. 
rasa236796 
msa236796 , 2 
msa236796 .2 

msa236796 . 

msa236796 . 

msa236796 . 
msa236796.2{282 
msa236796 . 2 { 



2{282_COHl} 
2{282_M73 2} 
2{282_M78l} 
•2{282_090} 
282_CJB110} 
282_18RS2l} 
2{282_2603} 
2{282_A90 9} 
2{282_H36B} 
_JM9130013} 
282_1169NT} 
Consensus 



151 

TATTTATTTG 
TATTTATTTG 
TATTTATTTG 
TATTTATTTG 
TATTTATTTG 
TATTTATTTG 
TATTTATTTG 
TATTTATTTG 
TATTTATTTG 
TATTTATTTG 
TATTTATTTG 



ATGGAgAgGA 
ATGGAgAgGA 
ATGGAgAgGA 
ATGGAaAgGA 
ATGGAaAgGA 
ATGGAgAaGA 
ATGGAgAaGA 
ATGGAgAaGA 
ATGGAgAaGA 
ATGGAgAaGA 
ATGGAgAaGA 



GTTAATTCTT 
GTTAATT CTT 
GTTAATTCTT 
GTTAATTCTT 
GTTAATTCTT 
GTTAATT CTT 
GTTAATTCTT 
GTTAATTCTT 
GTTAATTCTT 
GTTAATTCTT 
GTTAATTCTT 



GGCCCTTTtC 
GGCCCTTTtC 
GGCCCTTTtC 
GGCCCTTTcC 
GGCCCTTTcC 
GGCCCTTTcC 
GGCCCTTTcC 
GGCCCTTTcC 
GGCCCTTTcC 
GGCCCTTTcC 
GGCCCTTTcC 



200 

AGGGTGGTGT 
AGGGTGGTGT 
AGGGTGGTGT 
AGGGTGGTGT 
AGGGTGGTGT 
AGGGTGGTGT 
AGGGTGGTGT 
AGGGTGGTGT 
AGGGTGGTGT 
AGGGTGGTGT 
AGGGTGGTGT 



********** *****_*_** ********** ********_* ********** 



msa236796 . 

msa236796 . 

msa236796 . 
msa236796 
msa236796 -2{ 
msa236796 .2{ 

msa236796 . 

msa236796 . 

msa236796 . 
msa236796.2{282 
msa236796.2{ 



2(282_COHl) 
2{282_M732} 
2{282_M78l} 
.2{282_090} 
282__CJB110} 
28 2_18RS2l} 
2{282_2603} 
2{282_A909} 
2{282_H36B} 
_»JM9130013} 
282_1169NT} 
Consensus 



201 

ATCATGTGTG 

ATCATGTGTG 

ATCATGTGTG 

ATCATGTGTG 

ATCATGTGTG 

ATCATGTGTG 

ATCATGTGTG 

ATCATGTGTG 

ATCATGTGTG 

ATCATGTGTG 

ATCATGTGTG 
********** 



CATATTACTT 

CATATTACTT 

CATATTACTT 

CATATTACTT 

CATATTACTT 

CATATTACTT 

CATATTACTT 

CATATTACTT 

CATATTACTT 

CATATTACTT 

CATATTACTT 
********** 



TAGGAAAAGG 

TAGGAAAAGG 

TAGGAAAAGG 

TAGGAAAAGG 

TAGGAAAAGG 

TAGGAAAAGG 

TAGGAAAAGG 

TAGGAAAAGG 

TAGGAAAAGG 

TAGGAAAAGG 

TAGGAAAAGG 
********** 



TGTTTGTGGT 
TGTTTGTGGT 
TGTTTGTGGT 
TGTTTGTGGT 
TGTTTGTGGT 
TGTTTGTGGT 
TGTTTGTGGT 
TGTTTGTGGT 
TGTTTGTGGT 
TGTTTGTGGT 
TGTTTGTGGT 



250 

GAATCTGCAC 
GAATCTGCAC 
GAATCTGCAC 
GAATCTGCAC 
GAATCTGCAC 
GAATCTGCAC 
GAATCTGCAC 
GAATCTGCAC 
GAATCTGCAC 
GAATCTGCAC 
GAATCTGCAC 



********** ********** 



msa236796 . 

msa236796 . 

msa236796 . 
msa23 67 96 
msa.236796 . 2 { 
msa236796.2{ 

msa236796 . 

msa236796. 

msa236796. 
msa236796.2{282 
msa236796 .2{ 



2{282_COHl} 
2{282_M732) 
2{282_M78l} 
.2{282_090} 
282_CJB110} 
282_18RS21> 
2{282_2603} 
2{282_A909} 
2{282_H36B; 

JM9130013 
282_1169NT] 
Consensus 



251 

AAACTGCTAA 
AAACTGCTAA 
AAACTGCTAA 
AAACTGCTAA 
AAACTGCTAA 
AAACTGCTAA 
AAACTGCTAA 
AAACTGCTAA 
AAACTGCTAA 
AAACTGCTAA 

AAACTGCTAA 
********** 



GACGCTGATt 

GACGCTGATt 

GACGCTGATt 

GACGCTGATt 

GACGCTGATt 

GACGCTGATC 

GACGCTGATc 

GACGCTGATC 

GACGCTGATc 

GACGCTGATc 

GACGCTGATt 
*********_ 



GTTGATGATG 

GTTGATGATG 

GTTGATGATG 

GTTGATGATG 

GTTGATGATG 

GTTGATGATG 

GTTGATGATG 

GTTGATGATG 

GTTGATGATG 

GTTGATGATG 

GTTGATGATG 
********** 



TTACAAAGCA 
TTACAAAGCA 
TTACAAAGCA 
TTACAAAGCA 
TTACAAAGCA 
TTACAAAGCA 
TTACAAAGCA 
TTACAAAGCA 
TTACAAAGCA 
TTACAAAGCA 
TTACAAAGCA 
********** 



300 

TGCTAACTAT 

TGCTAACTAT 

TGCTAACTAT 

TGCTAACTAT 

TGCTAACTAT 

TGCTAACTAT 

TGCTAACTAT 

TGCTAACTAT 

TGCTAACTAT 

TGCTAACTAT 

TGCTAACTAT 
********** 



msa236796. 

rasa236796 . 

msa236796. 
msa236796 
msa236796 .2{ 
msa236796.2{ 

tnsa236796. 

msa236796 . 

msa236796 - 
msa236796 .2(282 
tnsa236796«2{ 



2{282_COHlV 
2{282__M732' 
2{282_M781^ 
.2{282_090' 
282^06110' 
282_18RS21 
2(282_2603 
2{282_A909 
2{282_H36B 
_JM9130013 
282_1169NT; 
Consensus 



301 

ATCTCCTGTG 
ATCTCCTGTG 
ATCTCCTGTG 
ATCTCCTGTG 
ATCTCCTGTG 
ATCTCCTGTG 
ATCTCCTGTG 
ATCTCCTGTG 
ATCTCCTGTG 
ATCTCCTGTG 
ATCTCCTGTG 



ATTCAAAAGC 
ATTCAAAAGC 
ATTCAAAAGC 
ATTCAAAAGC 
ATTCAAAAGC 
ATTCAAAAGC 
ATTCAAAAGC 
ATTCAAAAGC 
ATTCAAAAGC 
ATTCAAAAGC 
ATTCAAAAGC 



TATGAGTGAA 
TATGAGTGAA 
TATGAGTGAA 
TATGAGTGAA 
TATGAGTGAA 
TATGAGTGAA 
TATGAGTGAA 
TATGAGTGAA 
TATGAGTGAA 
TATGAGTGAA 
TATGAGTGAA 



ATCGTAGTAC 
ATCGTAGTAC 
ATCGTAGTAC 
ATCGTAGTAC 
ATCGTAGTAC 
ATCGTAGTAC 
ATCGTAGTAC 
ATCGTAGTAC 
ATCGTAGTAC 
ATCGTAGTAC 
ATCGTAGTAC 



********** ********** ********** ********** 



350 

CcATGTTTAA 

CcATGTTTAA 

CcATGTTTAA 

CtATGTTTAA 

CtATGTTTAA 

CtATGTTTAA 

CtATGTTTAA 

CtATGTTTAA 

CtATGTTTAA 

CtATGTTTAA 

CcATGTTTAA 
*_******** 



msa23 67 96 . 2 { 2 8 2_COHl 
msa236796.2(282_M732 
msa236796 . 2{282_M781 
msa236796 .2{282_090 
msa23 67 96 . 2 {282_CJB110 
msa236796,2{282__18RS2l' 



351 

AAATGGCAAA 
AAATGGCAAA 
AAATGGCAAA 
AAATGGCAAA 
AAATGGCAAA 
AAATGGCAAA 



CTTCTAGGAG 
CTTCTAGGAG 
CTTCTAGGAG 
CTTCTAGGAG 
CTTCTAGGAG 
CTTCTAGGAG 



TTCTAGATTT 
TTCTAGATTT 
TTCTAGATTT 
TTCTAGATTT 
TTCTAGATTT 
TTCTAGATTT 



AGATTCTTCT 
AGATTCTTCT 
AGATTCTTCT 
AGATTCTTCT 
AGATTCTTCT 
AGATTCTTCT 



400 

TTAGTAGCAG 
TTAGTAGCAG 
TTAGTAGCAG 
TTAGTAGCAG 
TTAGTAGCAG 
TTAGTAGCAG 



953 
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Table 64: Comparative Sequences relating to SAG 0827 



msa23 6796 .2{282_2603) 
msa236796 .2{282_A909) 
msa236796 .2.(28 2JH36B} 
msa236796 . 2{282_JM9130013 } 
tnsa2367 96.2{28 2_1169NT} 

Consensus 



AAATGGCAAA 

AAATGGCAAA 

AAATGGCAAA 

AAATGGCAAA 

AAATGGCAAA 
********** 



CTTCTAGGAG 
CTTCTAGGAG 
CTTCTAGGAG 
CTTCTAGGAG 
CTTCTAGGAG 



TTCTAGATTT 
TTCTAGATTT 
TTCTAGATTT 
TTCTAGATTT 
TTCTAGATTT 



AGATTCTTCT 
AGATTCTTCT 
AGATTCTTCT 
AGATTCTTCT 
AGATTCTTCT 



TTAGTAGCAG 
TTAGTAGCAG 
TTAGTAGCAG 
TTAGTAGCAG 
TTAGTAGCAG 



********** ********** ********** ********** 



msa236796 .2(282_COHl 
msa23 67 96 .2{282_M732 
msa23 6796 . 2 { 282_M781 , 
msa236796 .2 {282_090} 
msa236796 .2 f 282_CJB110 } 
msa236796.2(282_18RS2l} 
msa236796 ,2f 282J2603} 
msa23 6796 . 2 f 28 2_A909 } 
msa236796 .2{282_H36B) 
msa236796. 2 (282_JM9 130013} 
msa236796.2{282_1169NT} 

Consensus 



401 

ATTATGATGA 
ATTATGATGA 
ATTATGATGA 
ATTATGATGA 
ATTATGATGA 
ATTATGATGA 
ATTATGATGA 
ATTATGATGA 
ATTATGATGA 
ATTATGATGA 
ATTATGATGA 



GATTGATCAA 
GATTGATCAA 
GATTGATCAA 
GATTGATCAA 
GATTGATCAA 
GATTGATCAA 
GATTGATCAA 
GATTGATCAA 
GATTGATCAA 
GATTGATCAA 
GATTGATCAA 



GAATACTTAG 
GAATACTTAG 
GAATACTTAG 
GAATACTTAG 
GAATACTTAG 
GAATACTTAG 
GAATACTTAG 
GAATACTTAG 
GAATACTTAG 
GAATACTTAG 
GAATACTTAG 



AAAAATTTGT 
AAAAATTTGT 
AAAAATTTGT 
AAAAATTTGT 
AAAAATTTGT 
AAAAATTTGT 
AAAAATTTGT 
AAAAATTTGT 
AAAAATTTGT 
AAAAATTTGT 
AAAAATTTGT 



********** ********** ********** ********** 



450 

AGGTATTCTA 

AGGTATTCTA 

AGGTATTCTA 

AGGTATTCTA 

AGGTATTCTA 

AGGTATTCTA 

AGGTATTCTA 

AGGTATTCTA 

AGGTATTCTA 

AGGTATTCTA 

AGGTATTCTA 
********** 



msa236796 . 

msa236796 . 

msa236796 . 
msa236796 
msa2367 96 .2{ 
msa2367 96 ,2{ 

msa236796 . 

msa236796 . 

msa236796 . 
msa236796.2{282 
msa236796.2{ 



2{282_COHl' 
2{282_M732 i 
2{282_M781 ! 
,2{282_090 
282_CJB110 
282_18RS21 
2{282_2603 
2{282_A909; 
2{282JH36B 
_JM9130013 
282_1169NT 
Consensus 



451 

GTAGAACATA 
GTAGAACATA 
GTAGAACATA 
GTAGAACATA 
GTAGAACATA 
GTAGAACATA 
GTAGAACATA 
GTAGAACATA 
GTAGAACATA 
GTAGAACATA 
GTAGAACATA 



CGATTTGGAA 
CGATTTGGAA 
CGATTTGGAA 
CGATTTGGAA 
CGATTTGGAA 
CGATTTGGAA 
CGATTTGGAA 
CGATTTGGAA 
CGATTTGGAA 
CGATTTGGAA 
CGATTTGGAA 



TTTGGATAtg 
TTTGGATAtg 
TTTGGATAtg 
TTTGGATA — 
TTTGGATAtg 
TTTGGATAtg 
TTTGGATAtg 
TTTGGATAtg 
TTTGGATAtg 
TTTGGATAtg 
TTTGGATAtg 



495 

tttggagttg aaaag 
tttggagttg aaaag 
tttggagttg aaaag 



********** ********** ********, 



tttggagttg 
tttggagttg 
tttggagttg 
tttggagttg 
tttggagttg 
tttggagttg 
tttggagttg 



aaaag 
aaaag 
aaaag 
aaaag 
aaaag 
aaaag 
aaaag 



SEQ ZD NO. 6412 
STRAIN 2603 frame: 1 

mnkskki enyqlll.lqaqalfsdetnalanlsnasamlnamlpnsvftgfylfdgeel.il 
gp fqggv s cvh i tlgkg vcge s aqtaktl i vdd vt khany i s cd s kams e i wpm f kngk 
llgvldldsslvadyde idqeylekfvgi lveht i wnldmfgvek 

SEQ ID NO. 6413 

STRAIN 090 frame: 3 

LFSDETNALANLSNASAMLNAMLPNSVFTGFYLFDGKEL I LG PFQGGVS CVH I TLGKG VC 
GESAQTAKTLIVDDVTKHANYI SCDSKAMSEI WPMFKNGKLLGVLDLDSSLVADYDE ID 
QEYLEKFVGILVEHTIWNLD 

SEQ XD NO. 6414 

STRAIN A909 frame: 3 

LFSDETNAIANLSNASAMLNAMLiPNSVFTGFYLFDGEEL I LGPFQGGVS CVH I TLGKGVC 
GESAQTAKTL I VDDVTKHANYI SCDSKAMSEI WPMFKNGKLLGVLDLDSSLVADYDEID 
QEYLEKFVG I LVEHT I WNLDMFGVEK 

SEQ XD NO. 6415 
STRAIN H36B frame: 3 

LFSDETNALANL SNASAMLNAMLPNS VFTGFYLFDGEEL I LGPFQGGVS CVHI TLGKGVC 
GESAQTAKTLIVDDVTKHANYI SCDSKAMSEI WPMFKNGKLLGVLDLDSSLVADYDEID 
QEYLEKFVG I LVEHT I WNLDMFGVEK 

SEQ XD NO. 6416 
STRAIN 18RS21 frame: 3 

LFSDETNALANLSNASAMLNAMLPNSVFTGFYLFDGEELILGPFQ^VSOmiTLGKGVC 
GESAQTAKTL I VDDVTKHANYI SCDSKAMSEI WPMFKNGKLLGVLDLDSSLVADYDEID 
QEYLEKFVG I LVEHT I WNLDMFGVEK 

SEQ ID NO. 6417 
STRAIN M732 frame: 3 

LFSDETNALANLSNASAMLNAMLPNS VFTGFYLFDGEEL I LGPFQGGVS CVH I TLGKGVC 
GESAQTAKTL I VDD VTKHANY I S CD SKAMS E I WPMFKNGKLLGVLDLDSSLVADYDE I D 
QEYLEKFVG I LVEHT I WNLDMFGVEK 

SEQ XD NO. 6418 
STRAIN COH1 frame: 3 

LFSDETNALANLSNASAMLNAMLPNSVFTGFYLFDGEEL I LGPFQGGVS CVH I TLGKGVC 
GESAQTAKTL I VDDVTKHANYI SCDSKAMSEI WPMFKNGKLLGVLDLDSSLVADYDE I D 
QEYLEKFVG I LVEHT I WNLDMFGVEK 

SEQ XD NO. 6419 
STRAIN M78 1 frame: 3 

LFSDETNALANLSNASAMLNAMLPNSVFTGFYLFDGEELI LGPFQGGVSCVH ITLGKGVC 
GESAQTAKTL I VDD VTKHANY I SCDSKAMSEI WPMFKNGKLLGVLDLDSSLVADYDE ID 
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Table 64: Comparative Sequences relating to SAG 0827 



QEYLEKFVGI LVEHTI WNLDMFGVEK 

SEQ ID NO. 6420 

STRAIN M781 frame: 3 

LFSDETNALANLSNASAMIoNAMLPNSVFTGFYLFDGEEL I LGPFQGGVSCVH ITLGKGVC 
GESAQTAKTLIVDDVTKHANYISCDSKAMSEIWPMFKNGKLLGVLDLDSSLVADYDEID 
QEYLEKFVGILVEHTI WNLDMFGVEK 

SEQ ID NO. 6421 

STRAIN CJB1 10 frame: 3 

LFSDETNAIoANLSNASAMLNAMLPNSVFTG 

GESAQTAKTL I VDDVTKHANYI SCDSKAMSEI WPMFKNGKLLGVLDIiDSSLVADYDEI D 
QE YIiEKFVGI LVEHTI WNLDMFGVEK 

SEQ ID NO. 6422 
STRAIN 1I69NT frame: 3 

LFSDETNAIJUOIiSNASAMLNAMLPNSVFTGFYLFDGEELILGPFO^VSCVHITLGKGVC 
GESAOTAKTLIVDDVTKHANYISCDSKAMSEIW 

QEYLEKFVGI LVEHTI WNLDMFGVEK 

SEQ ID NO. 6423 
STRAIN JM9130013 frame: 3 

LFSDETNALANLSNASAMLNAMLPNSVFTGFYLFDGEELI LGPFQGGVSCVH ITLGKGVC 
GE S AQTAKTL I VDD VTKHANY I S CD S KAMS E I WPMF KNG KL LG VLDLD S S L VAD YD E I D 
QEYLEKFVGI LVEHT I WNLDMFGVEK 

PRETTY of: /biotmp/msa237960 . 2 { * } February 11, 2003 02:46 



msa237960 .2{ 
msa237960 .2 { 

msa237960 . 

msa237960 . 

msa237960 . 

msa237960 . 
msa237960.2{282 

msa237960 . 

msa237960 . 
msa237960 
msa237960 .2 { 



282_1169NT} 
282_18RS2ll 
2{282_2603) 
2(282_A909} 
2{282_COHl} 
2{282_H36Bj 
_JM9130013) 
2/282_M732" 
2{282_M781 
.2{282_090 
282_CJB110 
Consensus 



L FSDETNALAN 

Ij FSDETNALAN 

mnkskkieny qllllqaqaL FSDETNALAN 

L FSDETNALAN 

L FSDETNALAN 

L FSDETNALAN 

L FSDETNALAN 

jj FSDETNALAN 

L FSDETNALAN 

L FSDETNALAN 

L FSDETNALAN 



50 

MLPNSVFTGF 

MLPNSVFTGF 

MLPNSVFTGF 

MLPNSVFTGF 

MLPNSVFTGF 

MLPNSVFTGF 

MLPNSVFTGF 

MLPNSVFTGF 

MLPNSVFTGF 

MLPNSVFTGF 

MLPNSVFTGF 
********** 



msa237960 .2{ 
msa237960 .2{ 

msa237960 . 

msa237960 . 

msa237960 . 

msa237960 . 
msa237960.2{282 

msa237960 . 

msa237960 . 
msa237960 
msa237960 .2 { 



282_1169NT} 
282_18RS21 
2{282_2603 
2f 282_A909 
2 (282_C0H1 
2{282_H36B) 
_JM9130013 
2l282__M732 
2{282_M78l} 
.2{282_090} 
282_CJB110} 
Consensus 



51 

YLFDGeELIL 

YLFDGeELIL 

YLFDGeELIL 

YLFDGeELIL 

YLFDGeELIL 

YLFDGeELIL 

YLFDGeELIL 

YLFDGeELIL 

YLFDGeELIL 

YLFDGkELIL 

YLFDGkELIL 
*****_**** 



GPFQGGVSCV 

GPFQGGVSCV 

GPFQGGVSCV 

GPFQGGVSCV 

GPFQGGVSCV 

GPFQGGVSCV 

GPFQGGVSCV 

GPFQGGVSCV 

GPFQGGVSCV 

GPFQGGVSCV 

GPFQGGVSCV 
********** 



HITLGKGVCG 

HI TLGKG VCG 

HITLGKGVCG 

HITLGKGVCG 

HITLGKGVCG 

HITLGKGVCG 

HITLGKGVCG 

HITLGKGVCG 

HITLGKGVCG 

HITLGKGVCG 

HITLGKGVCG 
********** 



ESAQTAKTLI 

ESAQTAKTLI 

ESAQTAKTLI 

ESAQTAKTLI 

ESAQTAKTLI 

ESAQTAKTLI 

ESAQTAKTLI 

ESAQTAKTLI 

ESAQTAKTLI 

ESAQTAKTLI 

ESAQTAKTLI 
********** 



100 

VDDVTKHANY 
VDDVTKHANY 
VDDVTKHANY 
VDDVTKHANY 
VDDVTKHANY 
VDDVTKHANY 
VDDVTKHANY 
VDDVTKHANY 
VDDVTKHANY 
VDDVTKHANY 
VDDVTKHANY 
********** 



msa237960 .2 
msa237960 .2 

rasa237960 . 

msa237960 . 

msa237960 . 

msa237960 . 
msa237960.2{282 

msa237960 . 

msa237960 . 
msa237960 
msa237960 .2{ 



282_1169NTj 
282_18RS21) 
2{282_2603} 
2{282_A909} 
2(282_COHl" 
2{282_H36B I 
_JM9130013' 
2{282_M732} 
2{282_M781} 
.2{282_090) 
282_CJB110} 
Consensus 



101 

ISCDSKAMSE 

ISCDSKAMSE 

ISCDSKAMSE 

ISCDSKAMSE 

ISCDSKAMSE 

ISCDSKAMSE 

ISCDSKAMSE 

ISCDSKAMSE 

ISCDSKAMSE 

ISCDSKAMSE 

ISCDSKAMSE 
********** 



IWPMFKNGK 

IWPMFKNGK 

IWPMFKNGK 

IWPMFKNGK 

IWPMFKNGK 

IWPMFKNGK 

IWPMFKNGK 

IWPMFKNGK 

IWPMFKNGK 

IWPMFKNGK 

IWPMFKNGK 
********** 



LLGVLDLDSS 
LLGVLDLDS S 
LLGVLDLDSS 
LLGVLDLDSS 
LLGVLDLDSS 
LLGVLDLDSS 
LLGVLDLDSS 
LLGVLDLDSS 
LLGVLDLDSS 
LLGVLDLDSS 
LLGVLDLDSS 



LVADYDEIDQ 
LVADYDEIDQ 
LVADYDEIDQ 
LVADYDEIDQ 
LVADYDEIDQ 
LVADYDEIDQ 
LVADYDEIDQ 
LVADYDEIDQ 
LVADYDEIDQ 
LVADYDEIDQ 
LVADYDEIDQ 



********** ********** 



150 

EYLEKFVGIL 

EYLEKFVGIL 

EYLEKFVGIL 

EYLEKFVGIL 

EYLEKFVGIL 

EYLEKFVGIL 

EYLEKFVGIL 

EYLEKFVGIL 

EYLEKFVGIL 

EYLEKFVGIL 

EYLEKFVGIL 
********** 



msa237960 .2{ 
msa237960 .2( 

msa237960 . 

msa237960 . 

msa237960 . 

msa237960 . 
msa237960.2{282 

msa237960 . 

msa237960 . 
msa237960 
msa237960 .2{ 



282_1169NT} 
282_18RS2l} 
2(282_2603} 
2{282_A909} 
2(282_COHl" 
2{282_H36B 
_JM9130013 
2(282_M732 
2{282_M781 
.2{282_090 
282_CJB110 
Consensus 



151 

VEHTIWNLDm 

VEHTIWNLDm 

VEHTIWNLDm 

VEHTIWNLDm 

VEHTIWNLDm 

VEHTIWNLDm 

VEHTIWNLDm 

VEHTIWNLDm 

VEHTIWNLDm 

VEHTIWNLD- 

VEHTIWNLDm 
*********_ 



165 
fgvek 
fgvek 
fgvek 
fgvek 
fgvek 
fgvek 
fgvek 
fgvek 
fgvek 

fgvek 
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Table 65: Comparative Sequences relating to SAG0231 



SEQ ID NO. 6501 
STRAIN 2603 

ATGAAAAAGAGTACCCAAATAATACTACTAATAGTTGCA 

TT ATT CATAC TTGTTTTTAG CG GAGGATTTT ATATGAAAG AACAACAAAG AAAAG AAGAA 
CT AAAACGG AAT CG AGAATATG AAGTT AGT CTAGT CAAAG CATTG AAAAATT C CTATGAG 
AATAT AG AAG AAAT AAAAAT CACACAT CCTGTTT CAACTG AAATT CCTGGAG ATTGG CAT 
TGTACTGTAAAGATTTCATTTAATGATAAAAAATCTATTGTTTATAATATTACACATAAT 
TTGGAATCGAAAAAAAATTATAG C G GAAAATT TAATGAAAAAAATATGAATTTTT TTGAT 
T CAAGAATTGGTAAAACAAAAAAAACTAT AAAAATTATTTTT TCAGATGGTCAGGAG AAG 
ATACAA 

SEQ ID NO. 6502 
STRAIN 090 

GGAGGATTTTATATGAAAGAACA 

ACAAAGAAAAGAAGAACTAAAACGGAATCGAGAATATGAAGTTAGTCTAG 
TCAAAGCATTGAAAAATTCCTATGAGAATATAGAAGAAATAAAAATCACA 
CATCCTGTTT CAACTGAAATT CCTGGAGATTGG CATTGT ACT GTAAAGAT 
TTCATTTAATGATAAAAAATCTATTGTTTATAATAT^ 
AATCGAAAAAAAATTATAGCGGAAATTTTAATGAAAAAAATATGAATTTT 
TTTGATT CAAGAATTGGTAAAACAAAAAAAACTATAAAAATT ATTTTTT C 
AGA t GG t CAGGAGAAGAT a CAA 

SEQ ID NO. 6503 
STRAIN A909 

GGAGGATTTTATATGAAAGAACAACAA 

AGAAAAGAAG AACTAAAACGGAAT CGAGAATATGAAGTTAGT CTAGT CAA 
AG CATTGAAAAATTC CTATGAG AAT ATAGAAGAAATAAAAAT CACACAT C 
CTGTTTCAACTG AAATTC CTGGAGATTGG CATTGTACT CA 
TTTAATGATAAAAAAT CTATTGTTTATAATATTACACATAATTTGGAATC 
GAAAAAAAATTATAGCGGAAAATTTAATGAAAAAAATATGAAT^ 
ATTCAAGAATTGGTAAAACAAAAAAAACTATAAAAATTAaTT'TlT'CAGAT 
GG t CAGGAG AAG ATACAA 

SEQ ID NO. 6504 
STRAIN H36B 

GGAGGATTTTATATGAAAGAACA 

ACAAAGAAAAGAAGAACTAAAACGGAATCGAGAATATGAAGTTAGTCTAG 
TCAAAG CATTGAAAAATT C CTATG AGAATATAGAAGAAATAAAAATCACA 
CATCCTGTTT CAACTGAAATTCCTGGAGATTGGCA.TTGTAOT 
TTCATTTAATGATAAAAAATCTATTGTTTATAATATTACACATAAT^ 
AAT CGAAAAAAAATTATAG CGGAAAATTTAATGAAAAAAATATGAATTTT 
TTTGATT CAAGAATTGGTAAAACAAAAAAAACTATAAAAATTAt TTTTT C 
AGATGG t CAGGAGAAGATaCAA 

SEQ XD NO. 6505 
STRAIN I8RS21 

GGAGGATTTTATATGAAAGAACAAC 

AAAGAAAAGAAGAACTAAAACGGAATCGAGAATATGAAGTTAGTCTAGTC 
AAAG CATTG AAAAATTC CTATGAGAATATAGAAGAAATAAAAAT CACACA 
TCCTGTTT(ZAACTGAAATTCCTGGAGATTGGCATTGTACTGTAAA 
CATTTAATGATAAAAAATCTATTGTTTATAATATTACACATAATTTGGAA 
TCGAAAAAAAATTATAGCGGAAAATTTAATGAAAAAAATATGAATTTTTT 
TG ATT CAAG AATTGGTAAAACAAAAAAAACTATAAAAATTATTTT^ CAG 
ATGG t CAGGAGAAGATaCAA 

SEQ ID NO. 6506 
STRAIN M781 

GGAGGATTTTATATGAAAGAACAACAAAGAAAA 

GAAGAACTAAAACGGAATCGAGAATATGAAGTTAGTCTAGTCAAAGCATT 
GAAAAATTC CTATG AGAATATAGAAGAAATAAAAATCACACATC CTGTTT 
CAACTGAAATT C CTGGAGATTGG CATTGTACTGTAAAGATTT CATTTAAT 
GATAAAAAATCTATTGTTTATAATATTACACATAATTTGGAATCGAAAAA 
AAATTATAG CGGAAAATTTAATGAAAAAAATATGAATTTTTTTGATT CAA 
GAATTGGTAAAACAAAAAAAACTATAAAAATTATTTTTTCAGATGGTCAG 
GAGAAGATACAA 

SEQ ID NO. 6507 
STRAIN CJB110 

GGAGGATTTTATATGAAAGAACAACAAAGAAAAGAAGAA 

CTAAAACGGAATCGAGAATATGAAGTTAGT CTAGTCAAAG CATTGAAAAA 

TTCCTATGAGAATATAGAAGAAATAAAAATCACACATCCTGTTTC1AACTG 

AAA'rTCCTGGAGATTGG CATTGTACTGTAAAGATTT CATTT 

AAAT CTATTGTTTATAATATTACACATAATTTGG AATCGAAAAAAAATT A 

TAGCGGAAATTTTAATGAAAAAAATATGAATTTTTTTGATTCAA 

GTAAAACAAAAAAAACTATAAAAATTATTTTTTCAGATGGTCA 

ATACAA 

SEQ ID NO. 6508 
STRAIN I169NT 

GGAGGATTTTATATGAAAGAACAACAAAG 

AAAAGAAGAACTAAAACGGAATCGAGAATATGAAGTTAGTCTAGTCAAAG 
CATTGAAAAATTCCTATGAGAATATAGAAGAAATAAAAATCACACATCCT 
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Table 65: Comparative Sequences relating to SAG0231 



GTTT CAACTGAAATT CCTGGAGATTGG CATTGTACTGTAAAG ATTT CATT 
TAAT GAT AAAAAAT CTATTGTTTAT AATATTACACATAATTTGGAAT CG A 
AAAAAAATT ATAGT G GAAAATT T AATG AAAAAAAT ATGAATTTTTTT GAT 
TCAAGAATTGGTAAAACAAAAAAAACTATAAAAATTATTTTTTCAGATGG 
TCAGGAGAAGATACAA 

SEQ XD NO. 6509 

STRAIN JM9 1300 13 

GGAGGATTTTATATGAAAGAACAAC 

AAAGAAAAGAAGAACTAAAACGGAATCGAGAATATGAAGTTAGTCTAGTC 
AAAGCATTGAAAAATTCCTATGAGAATATAGAAGAAATAAAAATCACACA 
TCCTGTTT CAACTGAAATT CCTGGAGATTGGCATTGTACTGTAAAGATTT 
CATTTAATGATAAAAAATCTATTGTTTATAATATTACACATAATTTGGAA 
T CGAAAAAAAATTATAG CGG AAAATTTAATGAAAAAAATATGAATTTTTT 
TG ATT CAAGAATTGGTAAAACAAAAAAAACTATAAAAATTATTTTTTCAG 
AtGGt CAGGAGAAG ATACAA 

PRETTY of: /biotmp/msa75400 .2 {*} March 10, 2003 09:56 

1 50 

msa75400 . 2{286_090 J 

msa75400 . 2{286_CJB110 } 

msa75400. 2{286_18RS2l} 

*msa75400.2{286_2603} atgaaaaaga gtacccaaat aatactacta atagttgcat tattcatact 

msa75400 .2 f 286_A909 j 

msa75400 .2(286_H36B} 

msa75400 . 2 {286_JM9130013 } 

msa75400 .2{286_M78l) 

msa75400.2{286_1169NT} 

Consensus ********** ********** ********** ********** ********** 

51 100 

msa7 5400 .2(28 6_090} GGAGGATTTT ATATGAAAGA ACAACAAAGA AAAGAAGAAC 

msa75400.2{286_CJB110) GGAGGATTTT ATATGAAAGA ACAACAAAGA AAAGAAGAAC 

msa75400.2{286_18RS2l} GGAGGATTTT ATATGAAAGA ACAACAAAGA AAAGAAGAAC 

msa75400.2{286_2603} tgtttttagc GGAGGATTTT ATATGAAAGA ACAACAAAGA AAAGAAGAAC 

rasa75400.2{286_A909} GGAGGATTTT ATATGAAAGA ACAACAAAGA AAAGAAGAAC 

msa75400.2{286_H36B} GGAGGATTTT ATATGAAAGA ACAACAAAGA AAAGAAGAAC 

msa75400 . 2 {286_JM9130013 } GGAGGATTTT ATATGAAAGA ACAACAAAGA AAAGAAGAAC 

msa75400.2{286_M78l} GGAGGATTTT ATATGAAAGA ACAACAAAGA AAAGAAGAAC 

msa75400 .2{286_1169NT} GGAGGATTTT ATATGAAAGA ACAACAAAGA AAAGAAGAAC 

Consensus ********** ********** ********** ********** ********** 



msa75400 
msa75400.2{ 
msa75400.2{ 
msa75400. 
msa75400. 
msa75400. 
msa75400.2{286 
msa75400 . 
msa75400.2{ 



.2{286_090) 
286_CJB110} 
286_18RS21} 
2(286_2603) 
2(286_A909} 
2{286__H36B} 
_JM9130013} 
2{286_M781) 
286_1169NT} 
Consensus 



101 

TAAAACGGAA 
TAAAACGGAA 
TAAAACGGAA 
TAAAACGGAA 
TAAAACGGAA 
TAAAACGGAA 
TAAAACGGAA 
TAAAACGGAA 
TAAAACGGAA 



TCGAGAATAT 
TCGAGAATAT 
TCGAGAATAT 
TCGAGAATAT 
TCGAGAATAT 
TCGAGAATAT 
TCGAGAATAT 
TCGAGAATAT 
TCGAGAATAT 



GAAGTTAGTC 
GAAGTTAGTC 
GAAGTTAGTC 
GAAGTTAGTC 
GAAGTTAGTC 
GAAGTTAGTC 
GAAGTTAGTC 
GAAGTTAGTC 
GAAGTTAGTC 



********** ********** ********** 



TAGTCAAAGC 
TAGTCAAAGC 
TAGTCAAAGC 
TAGTCAAAGC 
TAGTCAAAGC 
TAGTCAAAGC 
TAGTCAAAGC 
TAGTCAAAGC 
TAGTCAAAGC 
********** 



150 

ATTGAAAAAT 
ATTGAAAAAT 
ATTGAAAAAT 
ATTGAAAAAT 
ATTGAAAAAT 
ATTGAAAAAT 
ATTGAAAAAT 
ATTGAAAAAT 
ATTGAAAAAT 
********** 



rasa75400 .2{286_090' 
msa7 5400.2(286_CJBll( " 
msa75400 . 2 { 286JL8RS23 
msa75400 . 2 ( 286_2603 : 
msa75400 .2{286__A909 
msa75400.2{286_H36B 
msa7 54 0 0 . 2 { 2 8 6_JM913 0 0 13 
msa75400.2{286_M781 
msa75400.2{286_1169NT 

Consensus 



151 

TC CTATGAGA 
TCCTATGAGA 
TCCTATGAGA 
TCCTATGAGA 
TCCTATGAGA 
TCCTATGAGA 
TCCTATGAGA 
TCCTATGAGA 

TCCTATGAGA 
********** 



ATATAGAAGA 

ATATAGAAGA 

ATATAGAAGA 

ATATAGAAGA 

ATATAGAAGA 

ATATAGAAGA 

ATATAGAAGA 

ATATAGAAGA 

ATATAGAAGA 
********** 



AATAAAAATC 
AATAAAAATC 
AATAAAAATC 
AATAAAAATC 
AATAAAAATC 
AATAAAAATC 
AATAAAAATC 
AATAAAAATC 

AATAAAAATC 
********** 



ACACATCCTG 
ACACATC CTG 
ACACATCCTG 
ACACATCCTG 
ACACATCCTG 
ACACATCCTG 
ACACATCCTG 
ACACATCCTG 

ACACATCCTG 
********** 



200 

TTTCAACTGA 
TTTCAACTGA 
TTTCAACTGA 
TTTCAACTGA 
TTTCAACTGA 
TTTCAACTGA 
TTTCAACTGA 
TTTCAACTGA 

TTTCAACTGA 
********** 



msa75400 
msa75400.2{ 
msa75400.2{ 
msa75400 . 
msa75400 . 
msa75400 . 
msa75400.2{286 
msa75400 . 
msa75400.2{ 



.2{286_090} 
286__CJB110" 
286_18RS21 
2f286_2603 
2{286_A909) 
2{286_H36B) 
_JM9130013j 
2{286_M781) 
286_1169NT} 
Consensus 



201 

AATTCCTGGA 
AATTCCTGGA 
AATTCCTGGA 
AATTCCTGGA 
AATTCCTGGA 
AATTCCTGGA 
AATTCCTGGA 
AATTCCTGGA 
AATTCCTGGA 
********** 



G ATTGG CATT 

GATTGG CATT 

GATTGGCATT 

GATTGG CATT 

GATTGGCATT 

GATTGGCATT 

GATTGGCATT 

GATTGGCATT 

GATTGGCATT 
********** 



GTACTGTAAA 
GTACTGTAAA 
GTACTGTAAA 
GTACTGTAAA 
GTACTGTAAA 
GTACTGTAAA 
GTACTGTAAA 
GTACTGTAAA 
GTACTGTAAA 
********** 



GATTTCATTT 

GATTT CATTT 

GATTTCATTT 

GATTTCATTT 

GATTTCATTT 

GATTTCATTT 

GATTTCATTT 

GATTTCATTT 

GATTTCATTT 
********* * 



250 

AATGATAAAA 
AATGATAAAA 
AATGATAAAA 
AATGATAAAA 
AATGATAAAA 
AATGATAAAA 
AATGATAAAA 
AATGATAAAA 
AATGATAAAA 
********** 



msa75400.2{286_090} 
msa75400.2{286_CJB110} 
msa75400 . 2{286_18RS2l} 



251 300 
AATCTATTGT TTATAATATT ACACATAATT TGGAATCGAA AAAAAATTAT 
AATCTATTGT TTATAATATT ACACATAATT TGGAATCGAA AAAAAATTAT 
AATCTATTGT TTATAATATT ACACATAATT TGGAATCGAA AAAAAATTAT 
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Table 65: Comparative Sequences relating to SAG0231 



msa75400 . 2 {286_2603 ; 
msa75400.2{286_A909; 
msa75400 . 2 { 286_H36B 
msa75400.2{286_JM9130013 
msa75400.2{286_M781 
msa75400 . 2 { 286_1169NT 

Consensus 



AAT CTATTGT 
AATCTATTGT 
AATCTATTGT 
AATCTATTGT 
AATCTATTGT 
AATCTATTGT 



TTATAATATT 
TTATAATATT 
TTATAATATT 
TTATAATATT 
TTATAATATT 
TTATAATATT 



ACACATAATT 
ACACATAATT 
ACACATAATT 
ACACATAATT 
ACACATAATT 
ACACATAATT 



TGGAATCGAA 
TGGAATCGAA 
TGGAATCGAA 
TGGAATCGAA 
TGGAATCGAA 
TGGAATCGAA 



AAAAAATTAT 
AAAAAATTAT 
AAAAAATTAT 
AAAAAATTAT 
AAAAAATTAT 
AAAAAATTAT 



********** ********** ********** ********** ********** 



msa75400 .2{286_090} 
msa754 00 . 2 { 286_CJB110 } 
msa75400 . 2 (286_18RS21 } 
msa75400 .2l286_2603} 
msa75400.2{286_A909] 
msa75400.2{286__H36B; 
msa75400.2{286_JM9130013 
msa75400 .2{286_M781 
msa75400.2{286_1169NT} 

Consensus 



00. 2( 
00. 2{ 



msa75400 
msa754 
msa7 54 

rasa75400 . 
msa75400 . 
rasa75400 . 
msa75400.2{286 
msa75400 . 
msa75400.2{ 



.2{286_090} 
286_CJB110} 
286_18RS2l} 
2 {286^2603} 
2{286_A909} 
2{286_H3SB} 
_JM9130013} 
2{286_M781} 
286_1169NT} 
Consensus 



301 

AGcGGAAAtT 
AGcGGAAAtT 
AGcGGAAAaT 
AGcGGAAAaT 
AGcGGAAAaT 
AGcGGAAAaT 
AGcGGAAAaT 
AGcGGAAAaT 
AG t GGAAAaT 



TTAATGAAAA 
TTAATGAAAA 
TTAATGAAAA 
TTAATGAAAA 
TTAATGAAAA 
TTAATGAAAA 
TTAATGAAAA 
TTAATGAAAA 
TTAATGAAAA 



AAATATGAAT 
AAATATGAAT 
AAATATGAAT 
AAATATGAAT 
AAATATGAAT 
AAATATGAAT 
AAATATGAAT 
AAATATGAAT 
AAATATGAAT 



**_*****_* ********** ********** 



351 

TAAAACAAAA 

TAAAACAAAA 

TAAAACAAAA 

TAAAACAAAA 

TAAAACAAAA 

TAAAACAAAA 

TAAAACAAAA 

TAAAACAAAA 

TAAAACAAAA 
********** 



AAAACTATAA 

AAAACTATAA 

AAAACTATAA 

AAAACTATAA 

AAAACTATAA 

AAAACTATAA 

AAAACTATAA 

AAAACTATAA 

AAAACTATAA 
********** 



AAATTATTTT 
AAATTATTTT 
AAATTATTTT 
AAATTATTTT 
AAATTATTTT 
AAATTATTTT 
AAATTATTTT 
AAATTATTTT 

AAATTATTTT 
********** 



350 

CAAGAATTGG 
CAAGAATTGG 
CAAGAATTGG 
CAAGAATTGG 
CAAGAATTGG 
CAAGAATTGG 
CAAGAATTGG 
CAAGAATTGG 
CAAGAATTGG 
* * * *'* ***** 

400 

CAGGAGAAGA 

CAGGAGAAGA 

CAGGAGAAGA 

CAGGAGAAGA 

CAGGAGAAGA 

CAGGAGAAGA 

CAGGAGAAGA 

CAGGAGAAGA 

CAGGAGAAGA 
********** ********** 



TTTTTTGATT 
TTTTTTGATT 
TTTTTTGATT 
TTTTTTGATT 
TTTTTTGATT 
TTTTTTGATT 
TTTTTTGATT 
TTTTTTGATT 

TTTTTTGATT 
********** 



TTCAGATGGT 
TTCAGATGGT 
TTCAGATGGT 
TTCAGATGGT 
TTCAGATGGT 
TTCAGATGGT 
TTCAGATGGT 
TTCAGATGGT 
TTCAGATGGT 



msa7540 
msa75400 .2 
msa75400.2 
msa75400 
msa75400 
msa75400 
msa75400 .2(28 
msa75400 
msa75400 .2 



401 

0.2{286_090} TACAA 

{ 2 8 6_C JB1 1 0 } TACAA 

{286_18RS2l| TACAA 

.2{286_2603} TACAA 

.2{286_A909} TACAA 

.2{286_H36B} TACAA 

6_JM9130013} TACAA 

.2{286_M78l} TACAA 

{286_1169NT} TACAA 

Consensus ***** 



SEQ ID NO. 6510 
STRAIN 2603 frame: 1 

MKKSTQI ILLI VALF ILVFSGGFYMKEQQRKEELKRNREYEVSLVKALKNSYENIEEI KI 

THPVSTEIPGDWHCTVKISFNDKKSIVYNITHNL^ 

RTIKI IFSDGQEKIQ 

SEQ JD NO. 6511 
STRAIN 090 

GGFYMKEQQRKEELKRNRE YEVSLVKALKNS YENI EE I KI THPVSTE I PGD 

WHCTVKI SFNDKKS I VYNITHN^SKKNYSGNFNEKJWNFFDSRIGKTKKTIKirFSDGQ 

EKIQ 

SEQ ID NO. 6512 
STRAIN A909 

GGFYMKEQQRKEELKRNRE YEVSLVKALKNS YEN I EE I K I THPVSTE I PGD WH 
CTVKI SFNDKKS I VYNITHNLESKKNYS 

IQ 

SEQ XD NO. 6513 
STRAIN H36B 

GGFYMKEQQRKEELKRNRE YEVSLVKALKNS YENI EE I KI THPVSTE I PGD 

WHCTVKI SFNDKKS I VYNI THNLE S KKNYSGKFNEKNMNFFD SRI GKTKKT I KI I FSDGQ 

EKIQ 

SEQ ID NO. 6514 
STRAIN 18RS21 

GGFYMKEQQRKEELKRNREYEVSLVKALKNS YENI EE I KI THPVSTE I PGDW 
HCTVKISFNDKKSI VYNITHNLESKKNYSGKFNEKNMNFFDSRIGKTKKTIKI I FSDGQE 
KIQ 

SEQ ID NO. 6515 
STRAIN CJB1 10 

GGFYMKEQQRKEELKRNRE YEVSLVKALKNSYEN I EE I KI THPVSTE I PGDWHCTVK 
I SFNDKKS I VYN I THNLESKKNYSGNFNEKNMNFFDSR I GKTKKT I KI IFSDGQEKIQ 

SEQ ID NO. 6516 
STRAIN JM9130013 

GGFYMKEQQRKEELKRNRE YEVSLVKALKNSYEN I EE I KI THPVSTE I PGDW 



958 



WO 2004/018646 



PCT/US2003/026827 



Table 65: Comparative Sequences relating to SAG0231 



HCIVKISFNDKKSIVYNITHNLESKKNYSGKFNEKN^ 
KIQ 

SEQ ID NO. 6517 
STRAIN 1 169NT frame: 1 

GGFYMKEQQRKEELKRNRE YEVSLVKALKNS YEN I EE I KITHPVSTE I PGDWHCTVKI S F 
NDKKS I VYNI THNLESKKNYSGKFNEKNMNFFDSR I GKTKKT I KI I FSDGQEKI Q 

SEQ XD NO. 6518 
STRAIN M78 1 frame: 1 

GGFYMKEQQRKEELKRNRE YEVSLVKALKNS YEN I EE I KITHPVSTE I PGDWHCTVKISF 
NDKKS I VYNI THNLESKKNYSGKFNEKNMNFFDSRIGKTKKTIKI IFSDGQEKIQ 

PRETTY of : /biotmp/msa75376 . 2 { * } March 10, 2003 10:01 



msa75376.2{286_090} 
msa75376.2(286_1169NT} 
msa75376.2{286_18RS2l} 
msa753 76 .2{286_2603} 
msa75376.2{286_A909 
ms a7 53 7 6 . 2 { 2 8 6_C JB110 
msa75376.2{286_H36B 
msa75376 . 2 {2 86_JM9130013 
msa75376.2{286_M781; 

Consensus 



GGFYMKEQQR 

_ GGFYMKEQQR 

GGFYMKEQQR 

mkkstqiill ivalfilvfs GGFYMKEQQR 

GGFYMKEQQR 

GGFYMKEQQR 

GGFYMKEQQR 

GGFYMKEQQR 

GGFYMKEQQR 



KEELKRNREY 
KEELKRNREY 
KEELKRNREY 
KEELKRNREY 
KEELKRNREY 
KEELKRNREY 
KEELKRNREY 
KEELKRNREY 
KEELKRNREY 



50 

EVSLVKALKN 
EVSLVKALKN 
EVSLVKALKN 
EVSLVKALKN 
EVSLVKALKN 
EVSLVKALKN 
EVSLVKALKN 
EVSLVKALKN 
EVSLVKALKN 



********** ********** ********** ********** ********** 



msa75376.2{286_090} 
msa75376 . 2 {286_1169NT} 
msa75376.2{286_18RS2l} 
msa75376 . 2 {286J2603 } 
msa75376 . 2{286_A909} 
msa75376 . 2 {286_CJB110 } 
msa75376 . 2{286__H36B} 
msa75376 . 2 {286_JM9130013 } 
msa75376.2{286_M78l} 
Consensus 



51 

SYENIEEIKI 

SYENIEEIKI 

SYENIEEIKI 

SYENIEEIKI 

SYENIEEIKI 

SYENIEEIKI 

SYENIEEIKI 

SYENIEEIKI 

SYENIEEIKI 
********** 



THPVSTEIPG 

THPVSTEIPG 

THPVSTEIPG 

THPVSTEIPG 

THPVSTEIPG 

THPVSTEIPG 

THPVSTEIPG 

THPVSTEIPG 

THPVSTEIPG 
********** 



DWHCTVKISF 

DWHCTVKISF 

DWHCTVKISF 

DWHCTVKISF 

DWHCTVKISF 

DWHCTVKISF 

DWHCTVKISF 

DWHCTVKISF 

DWHCTVKISF 
********** 



NDKKS I VYNI 
NDKKS I VYNI 
NDKKS I VYNI 
NDKKS I VYNI 
NDKKS I VYNI 
NDKKS I VYNI 
NDKKS I VYNI 
NDKKS I VYNI 
NDKKS I VYNI 



100 

THNLESKKNY 
THNLESKKNY 
THNLESKKNY 
THNLESKKNY 
THNLESKKNY 
THNLESKKNY 
THNLESKKNY 
THNLESKKNY 
THNLESKKNY 



********** ********** 



msa75376 
msa75376.2{ 
msa75376.2{ 
msa75376 . 
msa75376 . 
msa75376.2{ 
msa75376 . 
msa75376.2{286 
msa75376 . 



.2{286__090" 
2 8 6_1169NT 
286_18RS21 
2{286_2603 
2{286_A909 
286_ > CJB110 
2{286_H3 6B 
_JM9130013 
2{286_M781/ 
Consensus 



101 

SGnFNEKNMN 
SGkFNEKNMN 
SGkFNEKNMN 
SGkFNEKNMN 
SGkFNEKNMN 
SGnFNEKNMN 
SGkFNEKNMN 
SGkFNEKNMN 
SGkFNEKNMN 



**_******* ********** 



135 

FFDSRIGKTK KTIKIIFSDG QEKIQ 

FFDSRIGKTK KTIKIIFSDG QEKIQ 

FFDSRIGKTK KTIKIIFSDG QEKIQ 

FFDSRIGKTK KTIKIIFSDG QEKIQ 

FFDSRIGKTK KTIKIIFSDG QEKIQ 

FFDSRIGKTK KTIKIIFSDG QEKIQ 

FFDSRIGKTK KTIKIIFSDG QEKIQ 

FFDSRIGKTK KTIKIIFSDG QEKIQ 

FFDSRIGKTK KTIKIIFSDG QEKIQ 
********** ***** 
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Table 66: Comparative Sequences relating to SAG 0754 



SEQ ID NO. 6601 
STRAIN 2603 

TTGACAAGG CAT ATAAAAATTT CTATACT AAATTTACAAAATGAAGG AG AGGGAACTATG 
GAAATACTG ATT GCAGGTGGTAGTGGTTTTTTAG G AAAG CAGAT AATAAAAG CAGCG CTT 
AGAAAAGGGCATAAAGTGGCTTACTTATCMGACATGAAGGTAAAGGTGATATATTTAi\G 
GAT CCTAGATTAAC CT ACATTAGGG GAGATATTACAG AAG CTGATAAGATT CATTTAGAA 
GACAG AACTTTTGATATATT AATTG ACTGTATTGG AG CGATTAAG C C CAATCAACTAGAT 
GAG CTTAAC GTTAAAGCAACCCAAAAAGCAGTAG CACTCTGTCACAAAAATCAAATACCA 
AAGTTAGTTTATATTTCAG CCAACAGCGG CT ATT CAGCTTACATTAAAAGTAAAAGG AAG 
GCAGAGCAGATAAT CAAAG CAAG CGGT CTGGATT ATCTTTTTGT AAGAC CAGGTTTGATG 
TATGGTGAAGAGCGACCTCTCTCGATTTTCCMGCa^GTGTATAAAGTTATTTAGTCAT 
TTGCCTTTCrrTAGGTATTGTTGTACAAAAGGTCTTTCCAACTAA 

GAAGCAATCGTTACTACGCTTAGGAAAAAACC^CCCAAAAAATCCTTTCTATTGAAGAA 
TTAAATAATAAA 

SEQ ID NO. 6602 
STRAIN 090 

ACAAGGCATATAAAAATTTCTATACTAAATTTACAAAAT 
GAAGGAGAGGGAACTATGGAAATACTGATTGCAGGTGGTAGTGGTTTTTT 
AGGAAAGCAGATAAT AAAAG CAGCG CTTACAAAAGGG CAT AAAGTGG CTT 
ACTTATCAAGACATGAAGGTAAAGGTGATATATTTAAGGATCCTAGATTA 
ACCTACATTAGGGGAGATATTACAGAAGCTaATAAGATTCATTTAGAAGA 
CAGAACTTTTGATAT ATTAATTGACTGTATTGGAG CGATTAAGCC CAAT C 
AACTAGATGAGCTT AACGTTAAAG CAACC CAAAAAGCAGTAG CACTCTGT 
CACAAAAATCAAATACCAAAGTTAGTTTATATTTCAGCCAACAGCGGCT 
TTCAGCTTACATTAAAAGT AAAAGG AAGG CAGAG CAGATAAT CAAAG CAA 
GCGGTCTGGATTAT CTTTTTGT AAGAC CAGGTTTGATGTATGGTGAAGAG 
CGACCTCTCT CKIATTTT CCAAG C CAAGTGTATAAAGTTATTTAGTCATTT 
GCCTTTCTTAGGTATTGTTGTACAAAAGGT CTTT C CAACTAAGGTTGTG A 
TAGTGGCAGAAGCAATCGTTACTACGCTTAGGAAAAAACCAACCCAAAAA 
AT CCTTT CT ATTGAAGAATTAAATAAT AAA 

SEQ ID NO. 6603 
STRAIN A909 

ACAAGGCATATAAAAATTTCTATACTAAATTTACAAAATG 

AAGGAGAGGGAACTATGGAAATACTGATTG CAGGTGGTAGTGGTTTTTTA 

GGAAAGCAGATAATAAAAGCAGCGCTTACAAAAGGGCATAAAGTGGCTTA 

CTTATCAAGACATGAAGGTAAAGGTGATATATTTAAGGATCCTAGATTAA 

CCTACATTAGGGGAGATATTACAGAAGCTGATAAGATTCATTTAGAAGAC 

AGAACTTTTGAT ATATTAATTGACTGTATTGGAG CGATTAAGC CCAATCA 

ACTAGATGAG CTTAACGTTAAAGCAAC CCAAAAAGCAGTAG CACT CTGT C 

ACAAAAATCAAATACCAAAGTTAGTTTATATTTCAGCCAACA 

TCAG CTTACATTAAAAGTAAAAGGAAGG CAGAG CAGATAAT CAAAGCAAG 

CGGTCIXX^TTATCT'TITT'GTAAGACCAGGTTTGATGTATGGTGAAGAGC 

GAC CT CT CT CGATTT T C CAAG CCAAGTGTATAAAGTTATTTAGT CATTTG 

C CTTT CTTAGGTATTGTTGT ACAAAAGGT CTTTC GAA 

AGTGGCAGAAGCAATCGTTACTACGCTTAGGAAAAAACCAACCCAAAAAA 

TC CTTTCTATTGAAGAATTAAATAATAAA 

SEQ ID NO. 6604 
STRAIN H36B 

TATAAAAATTT CTATACTAAATTTACAAAATGAAGGAGAGGGAACTATGG 

AAATACTGATTGCAGGTGGTAGTGG'l"l"l"l"i - rAGGAAAGCAGATAATAAAA 

GCAGCGCTTACAAAAGGGCATAAAGTGGCTTACrTATCAAGACATGAAGG 

TAAAGGTGATATATTTAAGGATC(2TAGATTAACCTACATTAGGGGAGATA 

TTACAGAAG CTGATAAGATT CATTT AGAAGACAG AACTTTTG AT ATATTA 

ATTGACTGTATTGGAGCGATTAAG C C CAATCAACTAGATGAG CTTAACGT 

TAAAGCAACCCAAAAAGCAGTAGCACTCTGTCACAAAAATG^^ 

AGTTAGTTTATATTTCAGCCAACAGCGGCTATTCAGCTT 

AAAAGGAAGGCAGAGCAGATAATCAAAGCAAGCGGTCTGGATTATCTTTT 

TGTAAGACCAGGTTTGATGTATGGTGAAGAGCGACCTCTCTCG 

AAG C CAAGTGTATAAAGTTATTTAGT CATTTGCCTTT CTTAGGTATTGTT 

GTACAAAAGGT CTTTCCAACTAAGGTTGTG ATAGTGG CAGAAGCAAT CGT 

TACTACGCTTAG GAAAAAAC CAACC CAAAAAATC CTTTCTATTG AAGAAT 

TAAATAATAAA 

SEQ ID NO. 6605 
STRAIN 18RS21 

ACAAGGCATAT AAAAATTT CTATACTAAATTTACAAAAT 
GAAGGAGAGGGAACTATGGAAATACTGATTGCAGGTGGTAGTGGTTTTTT 
AOmAAGCAGATAATAAAAGCAGCGCTTACAAAAGGGCATAAAGTGGCTT 
ACTTATCAAGACATGAAGGTAAAGGTGATATATTTAAGGATCCTAGATTA 
AC CTACATTAGGGGAGATATTAC^GAAG CTGATAAGATTCATTTAGAAGA 
CAGAACTTTTGATAT ATTAATTGACTGTATTGGAG CGATTAAG CC CAAT C 
AACTAGATGAGCTT AACGTTAAAGCAACCCAAAAAGCAGTAGCACTCTGT 
CACAAAAAT CAAAT ACCAAAGTTAGTTTATATTT GAG CCAACAG CGGCTA 
TT CAG CTTACATTAAAAGTAAAAGGAAGG CAGAGCAGATAATCAAAGCAA 
G CGGT CTGGATTATCTTTTTGT AAGACCAGGTTTG ATGTATGGTGAAGAG 
CG AC CTCT CT CG ATTTTCCAAG CCAAGTGTATAAAGTT^ 
G C CTTT'C"1"1AGGTATTGTTGTACAAAAGGTCTTT C CAACTAAGGTTGTGA 
TAGTGGCAGAAG CAATCGTT ACTACG CTT'AGC AAAAAACCAACC CAAAAA 
ATCCTTTCTATTGAAGAATTAAATaATAAA 



960 



WO 2004/018646 



PCT/US2003/026827 



Table 66: Comparative Sequences relating to SAG 0754 



SKQ ID NO. 6606 
STRAIN M732 

CAAAATGAAGGAGAgGGAACTATGgA^TACTGATTGCAGGTGGTAGTGG 
TTTT CTAGGG AAG CAGATAATAAAAG CAG CG CTTACAAAAGGGCATAAGG 
TGG CTTACTTATCAAGG CATGAAGGTAAAGGTGATATATTTAAGGAT C cT 
AGATT AAC CTACATTAAGGGAG AT ATTACAGAAG CTGATAAG ATT CATTT 
AGaACATAGAAATTTTGATATATTAATTGACTGTATTGGAGCGATTAAGC 
CCAAT CAACTAG ATGAG CTTAACGTTAAAG CAAC C CAAAAAG CAGTAGCA 
CT CTGT CACAAAAAT CAAATAC CAAAGTT AGT TTACATTT CAGC CAATAG 
CGGCTATTCAGCITACATTAAAAGTAAA^GGAAGGCAGAGCAGATAATCA 
AAG CAAG CGGTCTGGATTAT CTTTTTGTAAGAC CAGGTTTGATGTATGGT 
GAAG AG CGAC CT CT CTCGAT TTTC CAAG C CAAGT GTATAAAATT ATT TAG 
TCATTTGCCTTTCITAGGTATTGTTGTACAAAAAGTCT^ 
TTGTGATAGTGGCAGAAGCAATCGTTACTTCGCTTAGGAAAAAACCAACT 
CAAAAAATC CTTTCT ATTG AAGAATTAAAT AATAAA 

SSQ ID NO. 6607 
STRAIN COH1 

ACAAGG CATATAAAAATTT CTATACTAAATTTAC 

AAAATGAAGGAGAGGGAACTATGGAAATACTGATTGCAGGTGGTAGTGGT 
TTT CT AGGGAAG CAG ATAATAAAAGCAGCG CTTACAAAAGGG CATAAGGT 
GGCITACTTATCAAGGCATGAAGGTAAAGGTGATATATTTAAGGATCCTA 
GATTAAC CTACATTAAGGGAGAT ATTACAGAAG CTGATAAGATT CATTT A 
GAACATAGAAATTTTGATATATTAATTGACTGTATTGGAGCGATTAAGCC 
CAATCAACTAGATGAGCIT'AACGTTAAAGCAACCCAAAAAGCAGTAGC^ 
TCTGTCACAAAAATCAAATACCAAAGTTAGTTTACATTTCAGCC 
GG CT ATT CAG CTTACATTAAAAGTAAAAGGAAGG CAGAG CAGATAATCAA 
AGCAAGCGGT CTGGATTAT CTTTTTGTAAGAC CAGGTTTG ATGT ATG GTG 
AAGAGCGAC CTCT CTCGA'ITTT CCAAG C CAAGTGTATAAAATTATTT AGT 
CATTTGCCTTT CTTAGGTATTGTTGTACA?U^AAGTCT 
TGTGATAGTGGCAGAAGCAATCGTTACnTTCGCTTAGGAAAAAAC CAACT C 
AAAAAAT CCTTTCTATTGAAGAATTAAATAATAAA 

SEQ ID NO. 6608 
STRAIN M781 

ACAAGGCATATAAAAATTT CTATACTAAATTT aCA 

AAATGAAGGAGAGGGAACTATGGAAATACTGATTGCAGGTGGTAGTGGTT 

TTCTAGGGAAGCAGATAATAAAAG CAGCG CTTACAAAAGGG CATAAGGTG 

GCTTACTTATCAAGG CATGAAGGTAAAGGTGATATATTTAAGGAT CCTAG 

ATTAACCTACATTAAGGGAGATATTACAGAAGCTGATAAGATTCATTTAG 

AACATAGAAATTTTGATATATTAATTGACTGTATTGGAGCGATTAAGCCC 

AATCAACTAGATGAGCITAACGTTAAAGCAACCCA 

CTGTCACAAAAATCAAATACCAAAGTTAGTTTACATTTCAG 

GCTATTC^GCTTAC^TTAAAAGTAAAAGGAAGGa^GAGCAGATAATCAAA 

GCAAGCGGTCTGGATTATCTTTTTGTAAGACCAGGTTTGATGTATGGTGA 

AGAGCGACCTCTCTCGATTTTCCAAGCCAAGTGTATAAAATTATTTAGTC 

ATTTGCCTTTCTTAGGTATT?GTTGTACAA 

GTGATAGTGGC^GAAGCAATCGTTACTTCGCTTAGGAAAAAACC^CTCA 
AAAAATCCTTT CTA t TGAAGAATTAAATAATAAA 

SEQ ID NO. 6609 
STRAIN 1169NT 

ACAAGGCATATAAAAATTTCTATA(^AAATTTACAA^ 
ATGAAGGAGAGGGAACTATGGAAATACTGATTGCAGGTGGTAGTGGTTTT 
TTAGGAAAG CAGATAATAAAAG CAG CG CTTACAAAAGGG CAT AAGTTGGC 
TTACTTATCAAGACATGAAGGTAAAGGTGATATATTTAAGGATCCTAGAT 
TAAC CTACATTAAGGGAGATATTACAGAAG CTGATAAGATTCATTTAGAA 
GACAGAACTTTTGATATATTAATTGACTGTATTGGAGCGATTAAGCCCAA 
TCAA.CTAGATGAG CTTAACGTTAAAG CAAC CCAAAAAG CAGTAGCA 
GTCACAAAAATCAAATACCAAAGTTAGTTTACATTTCAGCCAACAGCGGC 
TATTCAG CTT ACATTAGAAGTAAAAGG AA.GG CAG AGCAGATAATCAAAGC 
AAGCGGTCTGGATTATCTTTTTGTAAGACCAGGTTTGATGTATGGTGAAG 
AG CGACCTCT CT CG ATTTT C CAAGCCAAGTGT AT AAAATTATTTAGT CAT 
TTGC CTTTCTTAGGT ATTGTTGTACAAAAGGT CTTTC CAACT 
GATAGTGGCAGAAGCAATCGTTACTACGCTTAGGACAAAACCAACTCAAA 
AAAT C CTTT CTATTG AAGAATTAAATAATAAA 

SEQ ID NO. 6610 
STRAIN CJB110 

ACAAGG CATATAAAAATTT CTATACT A^TTTACAAA 
ATGAAGGAGAGGGAACTATGGAAATACTGATTGCAGGTGGTAGTGGTTTT 
TTAGGAAAGCAGATAATAAAAGCAG C^CTTACAAAAGGGCATAAAGTGGC 
TTACTTATCAAGACATGAAGGTAAAGGTGATATATTTAAGGATCCTAGAT 
TAACCTACATTAGGGC^GATATTACAGAAGCT 

GACAGAACTTTTGATATATTAATTGACTGTATTGGAGCGATTAAGCCCAA 
TCAACTAGATGAGCrTAACGTTAAAGCAAC CCAAAAAGCAGTAG CACTCT 
GTCACAAAAATCAAATACCAAAGTTAGTTTATATTT C 
TATTC7\GCTTA(^TTAAAAGTAAAA.GGAAGGCAGAGCAGATAATCAAAGC 
AAG CGGT CTC4GATT ATCTTTTTGTAAGAC CAGGTTTG ATGTATGGTGAAG 
AG CGACCTCT CT CGATTTT C CAAG C CAAGTGTATAAAGTTATTT AGT CAT 
TTGCCTTTCTTAGGTATTGTTGTACAAAAG^ 

C^TAGTGGC^GAAGCMTCGTTACTACGCTTAGGAAAAAACCAACCCAAA 
AAATCCTTT CTATTGAAGAATTAAATAATAAA 
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Table 66: Comparative Sequences relating to SAG 0754 



SEQ ID NO. 6611 
STRAIN JM9130013 

ACAAGGCATATAAAAATTTCTATACTAAATTTACAAAATG 
AAGGAGAGGGAACTATGGAAATACTGATTGCAGGTGGTAGTGGTTTTTTA 
GGAAAGCAGATAAT AAAAG CAG CG CTTACAAAAGGG CAT AAAGTGG CTTA 
CTTAT CAAG ACATG AAGGTAAAGGTGATAT ATTTAAGGAT CCTAG ATTAA 
CcTACATTAGGGGAGATATTACAGAAGCrGATAAGATTCATTTAGAAGAC 
AGAACTTTTGATAT ATTAATTGACTGTATTGGAGCGATTAAG C CCAATCA 
ACTAGATGAG CTTAACGTT AA^G CAAC CCAAAAAG CAGT AG CACT CTGTC 
ACAAA^ATCAAATAC CAAA.GTTAGTTT ATATTTCAG C CAACAGCGG CTAT 
TGAG CTTACATTAAAAGTAAAAGGAAGG CAGAG CAGATAATCAAAGCAAG 
CGGTCTGGATTATCTTTTTGTAAGACCAGGTTTGATGTATGGTGAAGAGC 
GACCTCTCTCGATTTTCCAAGCCAAGTGTATAAAGTTATTTAGTCATTTG 
CCtTTCTTAgGTATTGTTGTACAAAAGGTCTTTCCAACTAAGGTTGTGAT 
AGTGG CAGAAG CAAT CGTTACT ACG CTTAGGAAAAAACCAAC C CAAAAAA 
TCCTTTCTATTGAAGAATTAAATAATAAA 

PRETTY of; /biotmp/msal37119 . 2 { * } April 10, 2003 03:30 



{3 03_COH1} 
{303_M732} 



msal37119 .2 
msal37119.2 
rasal37119.2{303_m78l} 
msal37119 .2{303_090} 
msal37119.2{303_18RS21^ 
msal37119 . 2 { 303_2603 
msal37119.2{303_A909 
msal37119.2{303_CJB110 
msal37119.2{303_H36B! 
msal37119.2{303_JM9130013} 
msal37119 . 2 {303_1169NT} 

Consensus 



acaaggc atataaaaat ttctatacta 



acaaggc 

acaaggc 

acaaggc 

ttgacaaggc 

acaaggc 

acaaggc 

acaaggc 

acaaggc 

*** — _ — — 



atataaaaat 
atataaaaat 
atataaaaat 
atataaaaat 
atataaaaat 
atataaaaat 
-tataaaaat 
atataaaaat 
atataaaaat 



ttctatacta 
ttctatacta 
ttctatacta 
ttctatacta 
ttctatacta 
ttctatacta 
ttctatacta 
ttctatacta 
ttctatacta 



aatttaCAAA 

CAAA 

aatttaCAAA 
aatttaCAAA 
aatttaCAAA 
aatttaCAAA 
aatttaCAAA 
aatttaCAAA 
aatttaCAAA 
aatttaCAAA 

aatttaCAAA 
**** 
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ATGAAGGAGA 

ATGAAGGAGA 

ATGAAGGAGA 

ATGAAGGAGA 

ATGAAGGAGA 

ATGAAGGAGA 

ATGAAGGAGA 

ATGAAGGAGA 

ATGAAGGAGA 

ATGAAGGAGA 

ATGAAGGAGA 
********** 



msal37119 . 

msal37119. 

msal37119. 
msa!37119 
msal37119.2{ 

msal37119 . 

msal37119. 
msal37119.2{ 

msal37119 . 
msal37119.2{303 
msal37119.2{ 



2{303_COHl} 
2<303_M732} 
2{303_m78l} 
.2{303_090} 
303_18RS21) 
2{303_2603} 
2{303_A909J 
303_CJB110~ 
2{303_H36B' 
_JM9130013' 
303_1169NT] 
Consensus 



51 

GGGAACTATG 

GGGAACTATG 

GGGAACTATG 

GGGAACTATG 

GGGAACTATG 

GGGAACTATG 

GGGAACTATG 

GGGAACTATG 

GGGAACTATG 

GGGAACTATG 

GGGAACTATG 
********** 



GAAATACTGA 

GAAATACTGA 

GAAATACTGA 

GAAATACTGA 

GAAATACTGA 

GAAATACTGA 

GAAATACTGA 

GAAATACTGA 

GAAATACTGA 

GAAATACTGA 

GAAATACTGA 
********** 



TTGCAGGTGG 

TTGCAGGTGG 

TTGCAGGTGG 

TTGCAGGTGG 

TTGCAGGTGG 

TTGCAGGTGG 

TTGCAGGTGG 

TTGCAGGTGG 

TTGCAGGTGG 

TTGCAGGTGG 

TTGCAGGTGG 
********** 



TAGTGGTTTT 
TAGTGGTTTT 
TAGTGGTTTT 
TAGTGGTTTT 
TAGTGGTTTT 
TAGTGGTTTT 
TAGTGGTTTT 
TAGTGGTTTT 
TAGTGGTTTT 
TAGTGGTTTT 

TAGTGGTTTT 
********** 



100 

cTAGGgAAGC 

cTAGGgAAGC 

cTAGGgAAGC 

tTAGGaAAGC 

tTAGGaAAGC 

tTAGGaAAGC 

tTAGGaAAGC 

tTAGGaAAGC 

tTAGGaAAGC 

tTAGGaAAGC 

tTAGGaAAGC 
_****_**** 



msal37119 . 

msal37119. 

msal37119 . 
msal37119 
msal37119.2{ 

msal37119. 

msal37119 . 
msal37119.2{ 

msal37119 . 
msal37119.2{303 
msa!37119.2{ 



2{303_COH1] 
2{303_M732] 
2{303_m78l] 
.2{303_090] 
303_18RS2i; 
2{303_2603 
2{303__A909 
303_C0B110 
2{303__H36B 
_JM9130013} 
303__1169NT} 
Consensus 



101 

AGATAATAAA 

AGATAATAAA 

AGATAATAAA 

AGATAATAAA 

AGATAATAAA 

AGATAATAAA 

AGATAATAAA 

AGATAATAAA 

AGATAATAAA 

AGATAATAAA 

AGATAATAAA 
********** 



AGCAGCGCTT 

AGCAGCGCTT 

AGCAGCGCTT 

AGCAGCGCTT 

AGCAGCGCTT 

AGCAGCGCTT 

AGCAGCGCTT 

AGCAGCGCTT 

AGCAGCGCTT 

AGCAGCGCTT 

AGCAGCGCTT 
********** 



ACAAAAGGGC 

ACAAAAGGGC 

ACAAAAGGGC 

ACAAAAGGGC 

ACAAAAGGGC 

ACAAAAGGGC 

ACAAAAGGGC 

ACAAAAGGGC 

ACAAAAGGGC 

ACAAAAGGGC 

ACAAAAGGGC 
********** 



ATAAggTGGC 

ATAAggTGGC 

ATAAggTGGC 

ATAAagTGGC 

ATAAagTGGC 

ATAAagTGGC 

ATAAagTGGC 

ATAAagTGGC 

ATAAagTGGC 

ATAAagTGGC 

ATAAgtTGGC 
****_„**** 



150 

TTACTTATCA 

TTACTTATCA 

TTACTTATCA 

TTACTTATCA 

TTACTTATCA 

TTACTTATCA 

TTACTTATCA 

TTACTTATCA 

TTACTTATCA 

TTACTTATCA 

TTACTTATCA 
********** 



msal37119 . 

msal37119. 

msal37119 . 
msal37119 
msal37119.2{ 

msal37119 . 

msal37119 . 
msal37119.2{ 

msal37119 . 
msal37119.2{303 
msal37119.2{ 



2{303_C0H1 
2{303_M732 
2{303_m781 
.2{303_090 
303_18RS21 
2f303_2603 
2{303_A909 
3 03_CJB110 
2{303_H36B 
_JM9130013 
303_1169NT 
Consensus 



151 

AGgCATGAAG 
AGgCATGAAG 
AGgCATGAAG 
AGaCATGAAG 
AGaCATGAAG 
AGaCATGAAG 
AGaCATGAAG 
AGaCATGAAG 
AGaCATGAAG 
AGaCATGAAG 
AGaCATGAAG 
**^ ******* 



GTAAAGGTGA 
GTAAAGGTGA 
GTAAAGGTGA 
GTAAAGGTGA 
GTAAAGGTGA 
GTAAAGGTGA 
GTAAAGGTGA 
GTAAAGGTGA 
GTAAAGGTGA 
GTAAAGGTGA 
GTAAAGGTGA 
********** 



TATATTTAAG 

TATATTTAAG 

TATATTTAAG 

TATATTTAAG 

TATATTTAAG 

TATATTTAAG 

TATATTTAAG 

TATATTTAAG 

TATATTTAAG 

TATATTTAAG 

TATATTTAAG 
********** 



GATCCTAGAT 

GATCCTAGAT 

GATCCTAGAT 

GATCCTAGAT 

GATCCTAGAT 

GATCCTAGAT 

GATCCTAGAT 

GATCCTAGAT 

GATCCTAGAT 

GATCCTAGAT 

GATCCTAGAT 
********** 



200 

TAA CCTACAT 

TAACCTACAT 

TAACCTACAT 

TAACCTACAT 

TAACCTACAT 

TAACCTACAT 

TAACCTACAT 

TAACCTACAT 

TAACCTACAT 

TAACCTACAT 

TAACCTACAT 
********** 



msal3 71 1 9 . 2 { 3 03_COH1 
msal37119 . 2{303_M732 
msal37119.2{303_m781 
msal37119.2{303_090} 
msal37119.2{303_18RS2l} 
msa!37119 . 2 {303_2603 } 



201 

TAaGGGAGAT 
TAaGGGAGAT 
TAaGGGAGAT 
TAgGGGAGAT 
TAgGGGAGAT 
TAgGGGAGAT 



ATTACAGAAG 
ATTACAGAAG 
ATTACAGAAG 
ATTACAGAAG 
ATTACAGAAG 
ATTACAGAAG 



CTGATAAGAT 
CTGATAAGAT 
CTGATAAGAT 
CTGATAAGAT 
CTGATAAGAT 
CTGATAAGAT 



TCATTTAGAA 
TCATTTAGAA 
TCATTTAGAA 
TCATTTAGAA 
TCATTTAGAA 
TCATTTAGAA 



250 

cAtAGAAaTT 
cAtAGAAaTT 
CAtAGAAaTT 
gAcAGAAcTT 
gAcAGAAcTT 
gAcAGAAcTT 



962 



WO 2004/018646 



PCT/US2003/026827 



Table 66: Comparative Sequences relating to SAG 0754 



msal37119.2{303_A909> 
msal37119 . 2 { 303_CJB110 } 
msal37119 .2{303_H36B} 
msal37119 . 2 { 303_JM9130013 } 
msal37119 . 2 {303_1169NT} 

Consensus 



TAgGGGAGAT 
TAgGGGAGAT 
TAgGGGAGAT 
TAgGGGAGAT 
TAaGGGAGAT 



ATTACAGAAG 
ATTACAGAAG 
ATTACAGAAG 
ATTACAGAAG 
ATTACAGAAG 



CTGATAAGAT 
CTGATAAGAT 
CTGATAAGAT 
CTGATAAGAT 
CTGATAAGAT 



**_******* ********** ********** 



TCATTTAGAA 

TCATTTAGAA 

TCATTTAGAA 

TCATTTAGAA 

TCATTTAGAA 
********** 



gAcAGAAcTT 

gAcAGAAcTT 

gAcAGAAcTT 

gAcAGAAcTT 

gAcAGAAcTT 
_★_**** _ * * 



msal37119 . 
msal37119 . 
msal37119. 
msal37119 
msal37119.2{ 
msal37119 . 
msal37119 . 
msal37119.2{ 
- msal37119. 
msal37119.2{303 
msal37119.2{ 



2{303_COH1) 
2{303_M732 } 
2{303_m781) 
.2{303_090} 
303_18RS2l} 
2{303_2603) 
2{303_A909} 
303_CJB110} 
2{303_H36B} 
_JM9130013 } 
303_1169NT} 
Consensus 



251 

TTGATATATT 
TTGATATATT 
TTGATATATT 
TTGATATATT 
TTGATATATT 
TTGATATATT 
TTGATATATT 
TTGATATATT 
TTGATATATT 
TTGATATATT 
TTGATATATT 



AATTGACTGT 
AATTGACTGT 
AATTGACTGT 
AATTGACTGT 
AATTGACTGT 
AATTGACTGT 
AATTGACTGT 
AATTGACTGT 
AATTGACTGT 
AATTGACTGT 
AATTGACTGT 



ATTGGAGCGA 
ATTGGAGCGA 
ATTGGAGCGA 
ATTGGAGCGA 
ATTGGAGCGA 
ATTGGAGCGA 
ATTGGAGCGA 
ATTGGAGCGA 
ATTGGAGCGA 
ATTGGAGCGA 
ATTGGAGCGA 



TTAAGCCCAA 
TTAAGCCCAA 
TTAAGCCCAA 
TTAAGCCCAA 
TTAAGCCCAA 
TTAAGCCCAA 
TTAAGCCCAA 
TTAAGCCCAA 
TTAAGCCCAA 
TTAAGCCCAA 
TTAAGCCCAA 



300 

TCAACTAGAT 
TCAACTAGAT 
TCAACTAGAT 
TCAACTAGAT 
TCAACTAGAT 
TCAACTAGAT 
TCAACTAGAT 
TCAACTAGAT 
TCAACTAGAT 
TCAACTAGAT 
TCAACTAGAT 



********** ********** ********** ********** ********** 



msal37119.2{3 03_COHi; 
msal37119.2(303_M732; 
msal37119 >2{303_m78l' 
msal37119.2{303_090; 
msal37119 . 2 { 303_18RS2i; 
msal3 7 1 1 9 . 2 f 3 0 3_2 60 3 
msal3 7119.2(30 3_A90 9 , 
msal37119 . 2{303_CJB110 ; 
tnsal37119.2{303_H36B; 
msal37119 . 2 { 303_JM9130013 
msal37119 .2{303_JL169NT 

Consensus 



301 

GAGCTTAACG 
GAGCTTAACG 
GAGCTTAACG 
GAGCTTAACG 
GAGCTTAACG 
GAGCTTAACG 
GAGCTTAACG 
GAGCTTAACG 
GAGCTTAACG 
GAGCTTAACG 
GAGCTTAACG 



TTAAAGCAAC 
TTAAAGCAAC 
TTAAAGCAAC 
TTAAAGCAAC 
TTAAAGCAAC 
TTAAAGCAAC 
TTAAAGCAAC 
TTAAAGCAAC 
TTAAAGCAAC 
TTAAAGCAAC 
TTAAAGCAAC 



********** ********** 



CCAAAAAGCA 

CCAAAAAGCA 

CCAAAAAGCA 

CCAAAAAGCA 

CCAAAAAGCA 

CCAAAAAGCA 

CCAAAAAGCA 

CCAAAAAGCA 

CCAAAAAGCA 

CCAAAAAGCA 

CCAAAAAGCA 
********** 



GTAGCACTCT 
GTAGCACTCT 
GTAGCACTCT 
GTAGCACTCT 
GTAGCACTCT 
GTAGCACTCT 
GTAGCACTCT 
GTAGCACTCT 
GTAGCACTCT 
GTAGCACTCT 
GTAGCACTCT 
********** 



350 

GTCACAAAAA 

GTCACAAAAA 

GTCACAAAAA 

GTCACAAAAA 

GTCACAAAAA 

GTCACAAAAA 

GTCACAAAAA 

GTCACAAAAA 

GTCACAAAAA 

GTCACAAAAA 

GTCACAAAAA 
********** 



msal37119. 

msal37119- 

msal37119. 
msal37119 
msal37119.2{ 

msal37119 . 

msal37119 . 
msal37119 .2{ 

msal37119 . 
msal37119.2{303 
rasal37119.2{ 



2{303_COHi; 
2{303_M732 
2{303_m781 
.2{303_090 
303_18RS21; 
2{303_2603* 
2{303_A909 
303_CJB110 
2{303_H36B 
_JM9130013 
303_1169NT;- 
Consensus 



351 

TCAAATACCA 
TCAAATACCA 
TCAAATACCA 
TCAAATACCA 
TCAAATACCA 
TCAAATACCA 
TCAAATACCA 
TCAAATACCA 
TCAAATACCA 
TCAAATACCA 
TCAAATACCA 



AAGTTAGTTT 
AAGTTAGTTT 
AAGTTAGTTT 
AAGTTAGTTT 
AAGTTAGTTT 
AAGTTAGTTT 
AAGTTAGTTT 
AAGTTAGTTT 
AAGTTAGTTT 
AAGTTAGTTT 
AAGTTAGTTT 



********** ********** 



AcATTTCAGC 

AcATTTCAGC 

AcATTTCAGC 

AtATTTCAGC 

AtATTTCAGC 

AtATTTCAGC 

AtATTTCAGC 

AtATTTCAGC 

AtATTTCAGC 

AtATTTCAGC 

AcATTTCAGC 
*_******** 



CAAtAGCGGC 

CAAtAGCGGC 

CAAtAGCGGC 

CAAcAGCGGC 

CAAcAGCGGC 

CAAcAGCGGC 

CAAcAGCGGC 

CAAcAGCGGC 

CAACAGCGGC 

CAAcAGCGGC 

CAAcAGCGGC 
*** _****** 



400 

TATTCAGCTT 

TATTCAGCTT 

TATTCAGCTT 

TATTCAGCTT 

TATTCAGCTT 

TATTCAGCTT 

TATTCAGCTT 

TATTCAGCTT 

TATTCAGCTT 

TATTCAGCTT 

TATTCAGCTT 
********** 



msal37119. 

msa!37119. 

msa!37119 . 
msal37119 
msal37119.2{ 

msal37119 . 

msal37119 . 
msal37119.2{ 

msal37119 . 
msal37119.2{303 
msal37119 .2{ 



2f303_COHl; 
2{303_M732; 
2{303_m781; 
.2{303_090 
303_18RS21 
2{303_2603 
2{303_A909 
303_CJB110; 
2{303_H36B 
_JM9130013 
303_1169NT 
Consensus 



401 

ACATTAaAAG 

ACATTAaAAG 

ACATTAaAAG 

ACATTAaAAG 

ACATTAaAAG 

ACATTAaAAG 

ACATTAaAAG 

ACATTAaAAG 

ACATTAaAAG 

ACATTAaAAG 

ACATTAgAAG 
******_*** 



TAAAAGGAAG 

TAAAAGGAAG 

TAAAAGGAAG 

TAAAAGGAAG 

TAAAAGGAAG 

TAAAAGGAAG 

TAAAAGGAAG 

TAAAAGGAAG 

TAAAAGGAAG 

TAAAAGGAAG 

TAAAAGGAAG 
********** 



GCAGAGCAGA 

GCAGAGCAGA 

GCAGAGCAGA 

GCAGAGCAGA 

GCAGAGCAGA 

GCAGAGCAGA 

GCAGAGCAGA 

GCAGAGCAGA 

GCAGAGCAGA 

GCAGAGCAGA 

GCAGAGCAGA 
********** 



TAATCAAAGC 
TAATCAAAGC 
TAATCAAAGC 
TAATCAAAGC 
TAATCAAAGC 
TAATCAAAGC 
TAATCAAAGC 
TAATCAAAGC 
TAATCAAAGC 
TAATCAAAGC 
TAATCAAAGC 



450 

AAGCGGTCTG 
AAGCGGTCTG 
AAGCGGTCTG 
AAGCGGTCTG 
AAGCGGTCTG 
AAGCGGTCTG 
AAGCGGTCTG 
AAGCGGTCTG 
AAGCGGTCTG 
AAGCGGTCTG 
AAGCGGTCTG 



********** ********** 



msal37119- 

msal37119. 

msal37119. 
msa!37119 
msal37119.2{ 

msal37119 . 

msal37119 . 
msal37119 .2{ 

msal37119 . 
msal37119.2{303 
msal37119.2{ 



2{303_COH1} 
2(303_M732} 
2{303_m781) 
.2{303__090} 
303_18RS21} 
2{303_2603 J 
2{303_A909 * 
303_CJB110 
2{303_H36B 
_JM9130013| 
303_1169NT} 
Consensus 



451 

GATTATCTTT 

GATTATCTTT 

GATTATCTTT . 

GATTATCTTT 

GATTATCTTT 

GATTATCTTT 

GATTATCTTT 

GATTATCTTT 

GATTATCTTT 

GATTATCTTT 

GATTATCTTT 
********** 



TTGTAAGACC 
TTGTAAGACC 
TTGTAAGACC 
TTGTAAGACC 
TTGTAAGACC 
TTGTAAGACC 
TTGTAAGACC 
TTGTAAGACC 
TTGTAAGACC 
TTGTAAGACC 
TTGTAAGACC 
********** 



AGGTTTGATG 

AGGTTTGATG 

AGGTTTGATG 

AGGTTTGATG 

AGGTTTGATG 

AGGTTTGATG 

AGGTTTGATG 

AGGTTTGATG 

AGGTTTGATG 

AGGTTTGATG 

AGGTTTGATG 
********** 



TATGGTGAAG 

TATGGTGAAG 

TATGGTGAAG 

TATGGTGAAG 

TATGGTGAAG 

TATGGTGAAG 

TATGGTGAAG 

TATGGTGAAG 

TATGGTGAAG 

TATGGTGAAG 

TATGGTGAAG 
********** 



500 

AGCGACCTCT 
AGCGACCTCT 
AGCGACCTCT 
AGCGACCTCT 
AGCGACCTCT 
AGCGACCTCT 
AGCGACCTCT 
AGCGACCTCT 
AGCGACCTCT 
AGCGACCTCT 
AGCGACCTCT 
********** 



msal37119.2{303_COHl} 
msal37119 . 2 { 303_M732 } 
msal37119.2{303_m78l} 
msal37119.2{303_090) 
msal37119.2{303__18RS2l} 



501 

CTCGATTTTC 
CTCGATTTTC 
CTCGATTTTC 
CTCGATTTTC 
CTCGATTTTC 



CAAGCCAAGT 
CAAGCCAAGT 
CAAGCCAAGT 
CAAGCCAAGT 
CAAGCCAAGT 



GTATAAAaTT 
GTATAAAaTT 
GTATAAAaTT 
GTATAAAgTT 
GTATAAAgTT 



ATTTAGTCAT 
ATTTAGTCAT 
ATTTAGTCAT 
ATTTAGTCAT 
ATTTAGTCAT 



550 

TTGCCTTTCT 
TTGCCTTTCT 
TTGCCTTTCT 
TTGCCTTTCT 
TTGCCTTTCT 



963 



WO 2004/018646 



PCT/US2003/026827 



Table 66: Comparative Sequences relating to SAG 0754 



msal37119 .2{303_2603} 
msal37119 ,2{303_A9Q9} 
msa!37119 .2{303_CJB110} 
msal37119.2{303_H36B} 
msal37119 . 2 {303_JM9 130013 } 
msal37119 . 2 {303_1169NT} 

Consensus 



CTCGATTTTC 

CTCGATTTTC 

CTCGATTTTC 

CTCGATTTTC 

CTCGATTTTC 

CTCGATTTTC 
********** 



CAAGCCAAGT 
CAAGCCAAGT 
CAAGCCAAGT 
CAAGCCAAGT 
CAAGCCAAGT 
CAAGCCAAGT 



GTATAAAgTT 
GTATAAAgTT 
GTATAAAgTT 
GTATAAAgTT 
GTATAAAgTT 
GTATAAAaTT 



********** *******_** 



ATTTAGTCAT 

ATTTAGTCAT 

ATTTAGTCAT 

ATTTAGTCAT 

ATTTAGTCAT 

ATTTAGTCAT 
* ******** * 



TTGCCTTTCT 

TTGCCTTTCT 

TTGCCTTTCT 

TTGCCTTTCT 

TTGCCTTTCT 

TTGCCTTTCT 
********** 



rasal37119 .2 
msal37119 .2 
msal37119.2 
msal37119 . 

msal37119 .2{3 
msal37119 .2 
msal37119 . 2 

msal37119.2{3 
msal37119 .2 
msal37119.2{303_i 

msa!37119.2{3 



{3 03_COH1 
{303__M732 
{303__m781 
2{303_090 
03__18RS21 
{303_2603j 
{303_A909 
03_CJB110 
{303_H36B" 
JM9130013; 
03__1169NT/ 
Consensus 



551 

TAGGTATTGT 
TAGGTATTGT 
TAGGTATTGT 
TAGGTATTGT 
TAGGTATTGT 
TAGGTATTGT 
TAGGTATTGT 
TAGGTATTGT 
TAGGTATTGT 
TAGGTATTGT 
TAGGTATTGT 



TGTACAAAAa 
TGTACAAAAa 
TGTACAAAAa 
TGTACAAAAg 
TGTACAAAAg 
TGTACAAAAg 
TGTACAAAAg 
TGTACAAAAg 
TGTACAAAAg 
TGTACAAAAg 
TGTACAAAAg 



GTCTTTCCAA 
GTCTTTCCAA 
GTCTTTCCAA 
GTCTTTCCAA 
GTCTTTCCAA 
GTCTTTCCAA 
GTCTTTCCAA 
GTCTTTCCAA 
GTCTTTCCAA 
GTCTTTCCAA 
GTCTTTCCAA 



CTAAGGTTGT 
CTAAGGTTGT 
CTAAGGTTGT 
CTAAGGTTGT 
CTAAGGTTGT 
CTAAGGTTGT 
CTAAGGTTGT 
CTAAGGTTGT 
CTAAGGTTGT 
CTAAGGTTGT 
CTAAGGTTGT 



600 

GATAGTGGCA 
GATAGTGGCA 
GATAGTGGCA 
GATAGTGGCA 
GATAGTGGCA 
GATAGTGGCA 
GATAGTGGCA 
GATAGTGGCA 
GATAGTGGCA 
GATAGTGGCA 
GATAGTGGCA 



********** *********, 



********** ********** ********** 



msal37119. 

msal37119. 

msal37119 . 
msal37119 
msal37119.2{ 

msal37119 . 

msal37119. 
msal37119.2{ 

msal37H9 
msal37119.2{303 
msa!37119.2{ 



2(303_COH1} 
2{303_M732} 
2{303_m78l} 
.2{303_090} 
303_18RS2l) 
2{303_2603) 
2{303_A909} 
303__GJB110} 
2{303_H36B) 
_JM9130013} 
303_1169NT} 
Consensus 



601 

GAAGCAATCG 

GAAGCAATCG 

GAAGCAATCG 

GAAGCAATCG 

GAAGCAATCG 

GAAGCAATCG 

GAAGCAATCG 

GAAGCAATCG 

GAAGCAATCG 

GAAGCAATCG 

GAAGCAATCG 
********** 



TTACTt CGCT 

TTACT t CGCT 

TTACTt CGCT 

TTACTaCGCT 

TTACTaCGCT 

TTACTaCGCT 

TTACTaCGCT 

TTACTaCGCT 

TTACTaCGCT 

TTACTaCGCT 

TTACTaCGCT 
*****_**** 



TAGGAaAAAA 
TAGGAaAAAA 
TAGGAaAAAA 
TAGGAaAAAA 
TAGGAaAAAA 
TAGGAaAAAA 
TAGGAaAAAA 
TAGGAaAAAA 
TAGGAaAAAA 
TAGGAaAAAA 
TAGGAcAAAA 



CCAACtCAAA 
CCAACtCAAA 
CCAACtCAAA 
CCAACcCAAA 
CCAACcCAAA 
CCAACcCAAA 
CCAACcCAAA 
CCAACcCAAA 
CCAACCCAAA 
CCAACcCAAA 
CCAACtCAAA 



650 

AAATCCTTTC 
AAATCCTTTC 
AAATCCTTTC 
AAATCCTTTC 
AAATCCTTTC 
AAATCCTTTC 
AAATCCTTTC 
AAATCCTTTC 
AAATCCTTTC 
AAATCCTTTC 
AAATCCTTTC 



*****_**** *****_**** ********** 



msal37119 . 

msa!37119 . 

msal37119 . 
msal37119 
msal37119 .2{ 

msal37119 . 

rasal37119. 
msal37119.2{ 

msal37119 . 
msal37119.2{303 
msal37119.2{ 



2{303_COHl} 
2{303_M732} 
2{303_m78l} 
.2 {303^090} 
303_18RS2l} 
2(303_2603' 
2{303__A909 
303_CJB110 
2{303_H36B] 
_JM9130013} 
303_1169NT} 
Consensus 



651 

TATTGAAGAA 

TATTGAAGAA 

TATTGAAGAA 

TATTGAAGAA 

TATTGAAGAA 

TATTGAAGAA 

TATTGAAGAA 

TATTGAAGAA 

TATTGAAGAA 

TATTGAAGAA 

TATTGAAGAA 
********** 



672 

TTAAATAATA AA 

TTAAATAATA AA 

TTAAATAATA AA 

TTAAATAATA AA 

TTAAATAATA AA 

TTAAATAATA AA 

TTAAATAATA AA 

TTAAATAATA AA 

TTAAATAATA AA 

TTAAATAATA AA 

TTAAATAATA AA 
********** ** 



SEQ ZD MO. 6612 
STRAIN 2603 frame: 1 

TRHI KI S I LNLQNEGEGTME I L I AGGSGFLGKQ 1 1 KAALTKGHKVAYLSRHEGKGDI FKD 
PRLT Y IRGD I TEADKI HLEDRTFD I L I DC I GAI KPNQLDELNVKATQKAVALCHKNQ I PK 
LVYI SANSGY SAYI KSKRKAEQ 1 1 KASGLDYLFVRPGLMYGEERPLS I FQAKCI KLFSHL 
PFLGIWQKVFPTKVVIVAEAIVTTLRKKPTQKILSIEELNNK 

SEQ ID NO. 6613 

STRAIN 090 frame: 1 

TRHI KI S I LNLQNEGEGTME I L I AGGSGFLGKQ 1 1 KAALTKGHKVAYLSRHEGKGDI FKD 
PRLT Y I RGD I TEADKI HLEDRTFD I LI DCIGAI KPNQLDELNVKATQKAVALCHKNQ I PK 
LVYI SANSGYSAYI KSKRKAEQ 1 1 KASGLDYLFVRPGLMYGEERPLS I FQAKCI KLFSHL 
PFLG I WQKVFPTKWI VAEAI VTTLRKKPTQKI LSI EELNNK 

SEQ ID NO. 6614 
STRAIN A909 frame: 1 

TRHI KI S ILNLQNEGEGTME ILIAGGSGFLGKQI I KAALTKGHKVAYLSRHEGKGDI FKD 
PRLTY I RGD I TEADKI HLEDRTFD I LI DCIGAI KPNQLDELNVKATQKAVALCHKNQ I PK 
LVYI SANSGYSAYI KSKRKAEQ 1 1 KASGLDYLFVRPGLMYGEERPLS I FQAKCI KLFSHL 
PFLG I WQKVFPTKWI VAEAI VTTLRKKPTQKI LS I EELNNK 

SEQ ID NO. 6615 

STRAIN H36B frame: 2 

IKI S I LNLQNEGEGTME I L I AGGSGFLGKQ 1 1 KAALTKGHKVAYLSRHEGKGDI FKDPRL 
TYIRGDITEADKIHLEDRTFDI LIDCI GAI KPNQLDELNVKATQKAVALCHKNQ I PKLVY 
I SANSGYSAYI KSKRKAEQ 1 1 KASGLDYLFVRPGLMYGEERPLS I FQAKCI KLFSHLPFL 
GI WQKVFPTKWI VAEAI VTTLRKKPTQKI LS I EELNNK 

SEQ ID NO. 6616 
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Table 66: Comparative Sequences relating to SAG 0754 



STRAIN 18RS21 frame: 1 

TEH I KI S ILNLQNEGEGTME I L I AGGSGFLGKQI I KAALTKGHKVAYLSRHEGKGDI FKD 
PRLTYIRGD I TEADKIHLEDRTFDI L I DCIGAIKPNQLDELNVKATQKAVALCHKNQI PK 
LVYI SANSGYSAYI KSKRKAEQI I KASGLDYLFVRPGLMYGEERPLS I FQAKCI KLFSHL 
PFLG I WQKVFPTKWI VAEAI VTTLRKKPTQKI LS I EELNNK 

SEQ ID NO. 6617 

STRAIN M732 frame: 1 

QNEGEGTME I LI AGGSGFLGKQI I KAALT KGH KVA YL S RH EG KG D I FKDPRLTYI KGDIT 
EADKI HLEHRNFD I LIDCIGAI KPNQLDELNVKATQKAVALCHKNQI PKLVYI SANSGYS 
AYIKSKRKAEQI I KASGLDYLFVRPGLMYGEERPLS I FQAKCI KLFSHL PFLG I WQKVF 
PTKWI VAEAI VTSLRKKPTQKI LS I EELNNK 



SEQ ID NO. 6618 

STRAIN COH1 frame: 1 

TRHI KI S I LNLQNEGEGTME I L I AGGSGFLGKQ 1 1 KAALTKGHKVAYLSRHEGKGDI FKD 
PRLTYIKGD I TEADKI HLEHRNFD I L I DC I GA I KPNQLDELNVKATQKAVALCHKNQI PK 
LVYI SANSGYSAYI KSKRKAEQI I KASGLDYLFVRPGLMYGEERPLS I FQAKCI KLFSHL 
PFLGI WQKVFPTKWI VAEAI VTSLRKKPTQKI LS I EELNNK 



SEQ ID NO. 6619 
STRAIN M781 frame: 1 

TRHIKIS I LNLQNEGEGTME I L I AGGSGFLGKQ 1 1 KAALTKGHKVAYLSRHEGKGDI FKD 
PRLTYIKGD I TEADKI HLEHRNFD ILIDCI GAI KPNQLDELNVKATQKAVALCHKNQI PK 
LVYI SANSGYSAYI KSKRKAEQI I KASGLDYLFVRPGLMYGEERPLS I FQAKCI KLFSHL 
PFLGI WQKVFPTKWI VAEAI VTSLRKKPTQKI LSI EELNNK 



SEQ ID NO. 6620 
STRAIN 1169NT frame: 1 

TRHI KI S I LNLQNEGEGTME I L I AGGSGFLGKQI I KAALTKGHKLAYLSRHEGKGD I FKD 
PRLTYIKGD ITEADKI HLEDRTFD I L I DC I GAI KPNQLDELNVKATQKAVALCHKNQ I PK 
LVYI SANSGYSAYI RS KRKAEQI I KASGLDYLFVRPGLMYGEERPLS I FQAKCI KLFSHL 
PFLG I WQKVFPTKWI VAEAI VTTLRTKPTQKI LS I EELNNK 



SEQ ID NO. 6621 

STRAIN CJB1 10 frame: 1 

TRHIKIS ILNLQNEGEGTME ILI AGGSGFLGKQI I KAALTKGHKVAYLSRHEGKGDI FKD 
PRLTYIRGD I TEADKI HLEDRTFD I LIDCIGAI KPNQLDELNVKATQKAVALCHKNQI PK 
LVYI SANSGYSAYI KSKRKAEQI I KASGLDYLFVRPGLMYGEERPLS I FQAKCI KLFSHL 
PFLG I WQKVFPTKWI VAEAI VTTLRKKPTQKI LS I EELNNK 



SEQ ID NO. 6622 

STRAIN JM91 30013 frame: 1 

TRHI KI S ILNLQNEGEGTME IL I AGGSGFLGKQ 1 1 KAALT KGHKVAYLS RHEGKGD I FKD 
PRLTYI RGD I TEADKI HLEDRTFD ILI DC I GAI KPNQLDELNVKATQKAVALCHKNQI PK 
LVYI SANSGYSAYI KSKRKAEQI I KASGLDYLFVRPGLMYGEERPLS I FQAKCI KLFSHL 
PFLGI WQKVFPTKWI VAEAI VTTLRKKPTQKI LS I EELNNK 

PRETTY Of: /biotmp/msal37299 . 2 { * } April 10, 2003 03:37 



msal37299 . 

rasal37299 . 

msal37299 . 
msal37299 
meal37299 .2 { 

msal37299 . 

msal37299 . 
msal37299 .2 { 
msal37299.2{303 

msal37299 . 
msa!37299 .2{ 



2{303__COH1} 
2l303_M732) 
2{303_M781} 
.2{303_090} 
303_18RS21} 
2(303_2603) 
2{303_A909} 
303_CJB110} 
_JM9130013" 
2{303_H36B' 
303_1169NT] 
Consensus 



trhikisiln 

trhikisiln 
trhikisiln 
trhikisiln 
trhikisiln 
trhikisiln 
trhikisiln 
trhikisiln 

ikisiln 

trhikisiln 



1 QNEGEGTME 

-QNEGEGTME 

1 QNEGEGTME 

1QNEGEGTME 

1 QNEGEGTME 

1 QNEGEGTME 

1 QNEGEGTME 

1 QNEGEGTME 

1 QNEGEGTME 

1 QNEGEGTME 

1 QNEGEGTME 
_ ********* 



ILIAGGSGFL 

ILIAGGSGFL 

ILIAGGSGFL 

ILIAGGSGFL 

ILIAGGSGFL 

ILIAGGSGFL 

ILIAGGSGFL 

ILIAGGSGFL 

ILIAGGSGFL 

ILIAGGSGFL 

ILIAGGSGFL 
********** 



GKQIIKAALT 

GKQIIKAALT 

GKQIIKAALT 

GKQIIKAALT 

GKQIIKAALT 

GKQIIKAALT 

GKQIIKAALT 

GKQIIKAALT 

GKQIIKAALT 

GKQIIKAALT 

GKQIIKAALT 
********** 



50 

KGHKvAYLSR 

KGHKvAYLSR 

KGHKvAYLSR 

KGHKvAYLSR 

KGHKvAYLSR 

KGHKvAYLSR 

KGHKvAYLSR 

KGHKVAYLSR 

KGHKvAYLSR 

KGHKvAYLSR 

KGHK1AYLSR 
****_***** 



msal37299 . 

msal37299 . 

msal37299 . 
msal37299 
msal37299.2{ 

msal37299 . 

msal37299. 
msal37299 .2{ 
msal37299.2{303 

msal37299 . 
msa!37299.2{ 



2{303_C0H1; 
2{303_M732; 
2{303_M781 
.2{303__090 
303_18RS21 
2{303__2603 
2{303_A909 
303_CJB110' 
_JM9130013 
2{303_H36B 
303_1169NT 
Consensus 



51 

HEGKGDI FKD 

HEGKGDIFKD 

HEGKGDI FKD 

HEGKGDIFKD 

HEGKGDIFKD 

HEGKGDIFKD 

HEGKGDIFKD 

HEGKGDIFKD 

HEGKGDIFKD 

HEGKGDIFKD 

HEGKGDIFKD 
********** 



PRLTYIkGDI 

PRLTYI kGDI 

PRLTYIkGDI 

PRLTYI rGD I 

PRLTYI rGDI 

PRLTYI rGD I 

PRLTYI rGDI 

PRLTYI rGDI 

PRLTYI rGDI 

PRLTYI rGDI 

PRLTYIkGDI 
******_*** 



TEADKI HLEh 

TEADKI HLEh 

TEADKIHLEh 

TEADKIHLEd 

TEADKI HLEd 

TEADKIHLEd 

TEADKIHLEd 

TEADKIHLEd 

TEADKIHLEd 

TEADKIHLEd 

TEADKIHLEd 
*********_ 



RnFDILIDCI 

RnFDILIDCI 

RnFDILIDCI 

RtFDILIDCI 

RtFDILIDCI 

RtFDILIDCI 

RtFDILIDCI 

RtFDILIDCI 

RtFDILIDCI 

RtFDILIDCI 

RtFDILIDCI 
*_******** 



100 

GAI KPNQLDE 

GAIKPNQLDE 

GAI KPNQLDE 

GAIKPNQLDE 

GAIKPNQLDE 

GAIKPNQLDE 

GAIKPNQLDE 

GAIKPNQLDE 

GAIKPNQLDE 

GAIKPNQLDE 

GAIKPNQLDE 
********** 



965 



WO 2004/018646 



PCT/US2003/026827 



Table 66: Comparative Sequences relating to SAG 0754 



msal37299 . 

msal37299 . 

msal37299 . 
msal37299 
rasal37299.2{ 

msa!37299 . 

msa!37299 . 
msa!37299.2{ 
msal37299.2{303 

msal37299 . 
msal37299 .2{ 



2{303_COH1; 
2{303_M732 ; 
2{303_M781 
.2{303_090 
303_18RS2l' 
2{303_2603 ! 
2{303_A909 : 
303_CJB110 
_JM9130013 
2{303_H35B 
303_1169NT 
Consensus 



101 

LNVKATQKAV 
LNVKATQKAV 
LNVKATQKAV 
LNVKATQKAV 
LNVKATQKAV 
LNVKATQKAV 
LNVKATQKAV 
LNVKATQKAV 
LNVKATQKAV 
LNVKATQKAV 
LNVKATQKAV 



ALCHKNQIPK 
ALCHKNQIPK 
ALCHKNQIPK 
ALCHKNQIPK 
ALCHKNQIPK 
ALCHKNQIPK 
ALCHKNQIPK 
ALCHKNQIPK 
ALCHKNQIPK 
ALCHKNQI PK 
ALCHKNQIPK 



********** ********** 



LVYISANSGY 

LVYI SANSGY 

LVYISANSGY 

LVYISANSGY 

LVYISANSGY 

LVYISANSGY 

LVYISANSGY 

LVYISANSGY 

LVYISANSGY 

LVYISANSGY 

LVYI SANSGY 
********** 



SAY I kS KRKA 
SAY I kS KRKA 
SAY I kS KRKA 
SAY IkS KRKA 
SAY IkS KRKA 
SAY I kS KRKA 
SAY I kS KRKA 
SAY I kS KRKA 
SAYIkSKRKA 
SAY I kSKRKA 
SAYIrSKRKA 



150 

EQIIKASGLD 
EQI IKASGLD 
EQIIKASGLD 
EQIIKASGLD 
EQIIKASGLD 
EQIIKASGLD 
EQIIKASGLD 
EQIIKASGLD 
EQIIKASGLD 
EQIIKASGLD 
EQIIKASGLD 



****„***** ********** 



rasal37299 . 

msal37299 . 

msal37299 . 
msal37299 
msal37299.2{ 

msal37299 . 

msal37299 . 
msal37299.2{ 
msal37299.2{303 

msal37299 . 
msal37299.2{ 



;(303_COHl} 
;{303J> 



2{303_M732} 
2{303_M781} 
.2{303__090} 
303_18RS21^ 
2{303_2603 
2{303_A909 
303_CJB110} 
_JM9130013} 
2{303_H36B} 
303_1169NT} 
Consensus 



151 

YLFVRPGLMY 

YLFVRPGLMY 

YLFVRPGLMY 

YLFVRPGLMY 

YLFVRPGLMY 

YLFVRPGLMY 

YLFVRPGLMY 

YLFVRPGLMY 

YLFVRPGLMY 

YLFVRPGLMY 

YLFVRPGLMY 
********** 



GEERPLSIFQ 

GEERPLS I FQ 

GEERPLSIFQ 

GEERPLSIFQ 

GEERPLSIFQ 

GEERPLSIFQ 

GEERPLSIFQ 

GEERPLSIFQ 

GEERPLSIFQ 

GEERPLSIFQ 

GEERPLSIFQ 
********** 



AKCI KLFSHL 

AKCI KLFSHL 

AKCI KLFSHL 

AKCI KLFSHL 

AKCI KLFSHL 

AKCI KLFSHL 

AKCI KLFSHL 

AKCI KLFSHL 

AKCI KLFSHL 

AKCI KLFSHL 

AKCI KLFSHL 
********** 



PFLGIWQKV 
PFLGIWQKV 
PFLGIWQKV 
PFLGIWQKV 
PFLGIWQKV 
PFLGIWQKV 
PFLGIWQKV 
PFLGIWQKV 
PFLGIWQKV 
PFLGIWQKV 
PFLGIWQKV 



200 

FPTKWIVAE 
FPTKWIVAE 
FPTKWIVAE 
FPTKWIVAE 
FPTKWIVAE 
FPTKWIVAE 
FPTKWIVAE 
FPTKWIVAE 
FPTKWIVAE 
FPTKWIVAE 
FPTKWIVAE 



********** ********** 



msal37299.2{303_COHl} 
msal37299.2f 303_M732} 
msal3 72 9 9 . 2 { 3 03_M7 8 1 } 
msal37299.2{303_090} 
msal37299 . 2 { 303_18RS21} 
msal37299.2{303_2603} 
msal37299 .2{303_A909j 
msal37299 . 2 {303_CJB110 } 
msal372 99.2{303_JM9130013} 
msal37299.2{303__H36B) 
msal37299.2{303_1169NT} 

Consensus 



201 223 
AIVTsLRkKP TQKILSIEEL NNK 
AIVTsLRkKP TQKILSIEEL NNK 
AIVTsLRkKP TQKILSIEEL NNK 
AIVTtLRkKP TQKILSIEEL NNK 
AIVTtLRkKP TQKILSIEEL NNK 
AIVTtLRkKP TQKILSIEEL NNK 
AIVTtLRkKP TQKILSIEEL NNK 
AIVTtLRkKP TQKILSIEEL NNK 
AIVTtLRkKP TQKILSIEEL NNK 
AIVTtLRkKP TQKILSIEEL NNK 
AIVTtLRtKP TQKILSIEEL NNK 
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Table 67: Comparative Sequences relating to SAG0475 



SEQ ID NO. 6701 
STRAIN 090 

CAAT AACAACATTTG AAAAT AAAAAAGTT TTAGTC CTTG GT^ 
TCTGGAGAAGCCGCTGC^CGTTTGTTAGCTAAGTTAGGAGCAATAGTGAC 
AGTTAATGATGGCAAACCATTTGATGAAAATCCMCAGCACAGTCTTTGT 
TGGAAGAGGGTATTAAAGTGGTTTGTGGTAGTCATCCTTTAGAATTGTTA 
GATGAGGATTTTTGTTACATGATTAAAAA.TCCAGGAATACCTTATAACAA 
TC CTATGGT CAAAAAAG CATTAGAAAAACAAATC CCT GTTTT GACTG AAG 
TGGAATTAG CATACTTAGTTTCAGAAT CTCAG CTAAT AGGTATTACAGG C 
TCTAACGGGAAAACGACAACGACAACGATGATTGCAGAAGTCTTAAATGC 
TGGAGGTCAGAGAGGTTTGTTAGCTGGGAATATCGGCTTTCCTGCTAGTG 
AAGTTGTT CAGG CTG CGGATGATAAAGAT ATT CTAGT TATGGAAT TAT CA 
AGTTTTCAG CTAAT GGGAGTTAAGG AATTT CGTC CTCAT ATTGCAGTAAT 
TACTAATTTAATGCCAACTCATTTA 

ATGTTG CTG CAAAATGGAATAT CCAAAAT CAAATGT CTT CAT CTGATTTT 

TTGGTACTTAATTTTAATCAAGGTATTTCTAAAGAGTTAGcTAAAACT 

TAAAGGRACAATCGTTCCTTTCTCTACTACGGAAAAAGTTGATGGTGCTT 

ACGTACAAG ACAAG CAACTTTT CTATAAAGGGGAG AATATTATGTTAGTA 

GATGACATTGGTGT CCCAGGA^G CCATAACGTAG AGAATG CT CT AGCAAC 

TATTG CGGTTGCTAAACTAG CT GGTAT CAGTAATCAAGTTATT AGAGAAA 

CTTTAAGCAATTTTGGAGGTGTTAAACACCGCrrTGCA^ 

GTT CATGGTATTAGTTT CTATAACGACAG CAAGT CAACTAATATATTGG C 

AACTCAAAAAGCATTATCTGGCTTTGATAATACTAAAGTTATCCT 

CAGGAGGT CTTGAT CG CGGTAATGAGTTTGATGAATTGATAC CAGAT AT C 

ACTGGACTTAAACAT ATGGTTGTTTTAGGGGAAT CGGCAT CT CGAGT AAA 

ACGTGCTGC^CAAAAAG(^GGAGTAACITATAGCGATGCTTTAGATGTTA 

GAGATGCGGTACATAAAGCTTATGAGGTGGCACAACAGK3GCGATGTTATC 

TTGCTAAGT CCTG CAAATG CAT CATGGGACATGTATAAGAATTTCGAAGT 

CCGTGGTGATGAATTCATTGATAC t TTCGAAAGT CTTAGAGGAGAG 

SEQ ID NO. 6702 

STRAIN A909 

CAAT AACAACATTTGAAAATAAAAAAGTTTTAGTC CITC CACGA 
TCTGG AGAAGCTGCT G CACGTTTGTTAG CTAAGTT AGGAG CAATAGTGAC 
AGTTAATGATGGCAAACCATTTGATGAZ\AATCCAAC 

TGGAAGAGGGTATTAAAGTGGTTTGTGGTAGT CAT C CTTTAGAATTGTTA 

GATGAGGATTTTTGTTACATGATTAAAAATCCAGGAATACCT^ 

TC CTATGGT CAAAAAAG CATTAGAAAAACAAATCCCTGTTTTGACTGAAG 

TGGAAT/TAGCATACTTAGTTTCAGAATCTCAGCTAATAGGTATTACAG^ 

TCTAACGGGAAAACGACAACGACAACGATGATTG CAGAAGTCTTAAATG C 

TGGAGGTCAGAGAGGTTTGTTAGCTGGGAATATCGGCTTTCCTC 

AAGTTGTT CAGG CTGCGAATGATAAAGATACT CT AGTTATGGAATTATCA 

AGTTTTCAGCTAATGGGAGTTAAGGAATTTCGTCCTCATATTGCAGTAAT 

TACTAATTTAAT GCCAACT CATTTAGATT ATCATGGGTCTTTTG AAGATT 

ATGTTGCTG CAAAAT GGAATAT CCAAAAT CAAATGT CTTCATCTGATTTT 

TTGGTACITAATTTTAATCAAGGT ATTT CTAAAGAGTTAG CTAAAACTAC 

TAAAG CaACAATCGTTC CITT CTCTACTACGGAAAAAGTT GATGGTG CTT 

ACGTACAAGACAAGCAACITTTCTATAAAGGGGAGAA^ 

GATGACATTGGTGTCC CAGGAAGCCATAACGTAnAGAATG CT CTAG CAAC 

TATTGCGGTTGCTAAACTGGCTGGTATCAGTAATCAAGTTAT^ 

CTTTAAGCAATTTTGGAGGtGTTAAACAC'CGCT 

GTTCATGGTATTAGTTT CTATAACGACAGCAAGT CAACTAATATATTGG C 
AACTCAAAAAGCATTAT CTGG CTTTGATAATACTAAAGTTAT C CTAATTG 
CAGGAGGTCITG AT CG CGGTAA.TGAGTTTGATGAAT/TGAT ACCAGATATC 
ACTGGACTTAAACATATGGTTGTTTTAGGGGAAT CGG CATCT CGAGTAAA 
ACGTGCTGCACAAAAAGGAGGAGTAACTTATAGCGATGCTT^ 
GAGATGCGGT ACAT AAAGCTTATG AGGTGG CACAACAGGG CG AT GTT AT C 
TTGCTAAGTCCTGCAAATG CAT CATGGGACATGTATAAGAATTT CGAAGT 
CCGTGGTGATGAATT CATTGAT ACTTTCGAAAGT CTTAGAGGAGAG 

SEQ ID NO. 6703 
STRAIN H36B 

GGACGAGT AATGAAAACAATAACAACATTT GAAAAT 

AAAAAAGTTTTAGTC CTTGGTTTAG CACGATCTGG AGAAG CTGCTG CACG 
TTTGTTAGCTAAGTT AGGAG CAAT AGTGACAGTTAATGATGG CAAACCAT 
TTGATGAAAAT C CAACAGCACAGT CTTTGTTGGAAGAGGGTATTAAAGTG 
GTTTGTGGTAGT CAT CCTTTAGAATTGTTAGATGAGGATTTTTGTTACAT 
GATTAAAAATCCAGGAATACCTTATAACAATCCTATGGTCAAWA.GCAT 
TAGAAAAACAAATC CCTGTTTTGACTGAAGTC4GAATTAG CATACTTAGTT 
TCAGAAT CTCAG CTAAT AGGTATTACAGG CT CTAACGGGAAAACGACAAC 
GACAACGATGATTGCAGAAGTCTTAAATGCTGGAGGTCAGAGAG^ 
TAGCTGGGAATATCGGCTTTCCTG CTAGT GAAGTTGTTCAGG CTGCGAAT 
GATAAAGATACT CTAGTTATCKGAATTATCAAGTTTTCAG CTAATGGGAGT 
TAAGG AATTT CGTC CTCATATTG CAGTAATTACT AATTT AATG CCAACT C 
ATTTAGATTATCATGGGTCITTTGAAGATTATGTTGCT 
ATC CAAAAT CAAATGT CTT CAT CTG ATTTTTTGGTACTTAATTTTAATCA 
AGGTATTTCTAAAGAGTTAG CTAAAACTACTAAAG CAACAATCGTTC CTT 
TCT CT ACTACGGAAAAAGTTGATGGTG CTTACGT ACAAGACAAG CAACTT 
TT CTATAAAGGGGAGAATATTATGT CAGT AGATGACATTGGTGT CCCAGG 
AAG C CATAACGTAGAGAATG CT CTAGCAACTATTGCGGTTGCTAAACTGG 
CTGGTAT CAGTAATCAAGTT ATTAGAGAAACTTTAAG CAATTTTGGAGGT 
GTTAAACAC CGCTTG CAAT CACTCGGT AAGGTTCATGGTATTAGTTT CTA 
TAACGACAG CAAG 
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Table 67: Comparative Sequences relating to SAG0475 



SEQ ID NO. 6704 
STRAIN 18RS21 

GGAC GAGTAATG AAAACAAT AACAACATTTG 

AAAATAAAAAAGTTTTAGTCCTTGGTTTAGCACGATCTGGAGAAGCTGCT 
GCACGTTTGTTAGCTAAGTTAGGAGCMTAGTGACAGTTAATGATGGCAA 
AC CATTT GATGAAAATC CAACAGCACAGT CTTTGTTG GAAGAGGGTATTA 
AAGTGGTTTGTGGTAGTCATCCTTTAGAATTGTTAGATGAGGATTTTTGT 
TACATGATTAAAAAT CCAG G AATAC CTTAT AACAAT C CTATGGT CAAAAA 
AG CATTAGAAAAACAAAT C C CTGTTTT GACTGAAGTGGAATT AG CATACT 
TAGT TTCAG AAT CTCAG CTAATAGGTATTACAGG CTCTAACGGG AAAACG 
ACAACGACAACG ATG ATTG CAG AAGT CTTAAATG C TGGAGGT CAG AG AGG 
TTTGTTAGCTGGGAATATCGGCTTTCCTGCTAGTGAAGTTGTTCAGGCTG 
CGAATGATAAAG AT ACT CTAGTTATGG AATTATCAAGTTTT GAG CTAATG 
GGAGTTAAGGAATTTCGTCCTCATATTGCAGTAATTACTAATTTAATGCC 
AACT CATTTAGATTATCATGGGT CTTTTGAAGATT ATGTTG CTG CAAAAT 
GGAATAT C CAAAAT CAAATGTCTT CAT CTGATTTTTTGGTACITAATTTT 
AATCAAGGT ATTTCTAAAGAGTTAG CT AAAACTACTAAAG CAACAATCGT 
TCCTTTCTCTACTACGGAAAAAGTTGATGGTGCTTACGTACA 
AA CI ' TIU 'CTATAAAGGGGAGAATATTATGTCZAGTAGATGACATTGGTGTC 
CC^GGAAGCCATAACGTAGAGAATGCTCTAGCAACTATTGCGGTTGCTAA 
ACTGGCTGGTAT CAGTAAT CAAGTTATTAGAG AAACTTTAAGCAATTTTG 
GAGGTGTTAAACACCGCTTGCAATCACTCGGTAAGGTTCATGGTATTAGT 
TTCTATAACGACAG CAAGT CAACTAATATATTGG CAACT CAAAAAGCATT 
ATCTTGGCITTGATAATACTAAAGTTATCCrAATTGCAGGAGGTCTTGATC 
GCGGTAATGAGTTTGATGAATTGATACCAGATA^ 

ATCX3TTGTTTTAGGGGAAT CGG CAT CT CGAGT AAAACGTG CTG CACAAAA 
AG CAGGAGTAACITATAGCG ATGCTTTAGATGTTAGAGATG C GGTACAT A 
AAGCTTATGAGGTGGCACAACAGGGCGATGTTATCTCT 
AATGCATCATGGGACATGTATAAGAATTTCGAAGTCCGTGGTGATGAATT 
CATTGATACTTTCGAAAGT CTT AGAGG AGAG 

SEQ ID NO. €705 
STRAIN M732 

GGACGAGTAATGAAAACAAT AACAACATTT GAAA 
ATAAAAAAGTTTTAGTCCTTGGTTTAGCACGATCT 

CGTTTGTTAG CTAAGTTAGGAGCAATAGTGACAGTTAATGATGG CAAAC C 
ATTTGATGAAAATCCAACAG CACAGT CTTTGTTG GAAGAGGGTATTAAAG 
TGGTTTGTGGTAGTCATCCTCTAGAATTGTTAGATGAGGATT^ 
ATGATTAAAAATCCAGGAATACCTTATAACAATCCTATGGTCAAAAAAG C 
ATTAGAAAAACAAATCC CTGTTTTGACTGAAGTGGAATTAG CATACTTAG 
TTT CAGAAT CTCAGCTAATAGGTATTACAGG CT CTAACGGGAAAACG ACA 
ACGACAACGATGATTGCAGAAGTCTTAAATGCTGGAGGTCAGAGAGGTTT 
GTTAG CTGGGAATAT CGGCTTTC CTGCTAGTGAAGTTGTT CAGG CTGCGG 
aTC^TAAAGATATTCTAGTTATGGAATTATCAAGTTTTCAGCTAATGGGA 
GTTAAGGAATTT CGT CCT CATATTG CAGTAATTACTAATTTAATG C CAAC 
TCAtTTAGATTATCATGGGTCITTTGAAGATTATGtTG 
ATATCCAAAATCAAATGT CTT CAT CTGATTTTTTGGTACT^ 
CAAGGTATTTCTAAAGAGTTAG CTAAAACTACTAAAG CAACAaT CGTTC C 
TTTCTCTACTACGGAAAAAGTTGATGGTG CTTACGTACAAGACAAG CAAC 
TTTTCTATAAAGGGGAG AAT ATTATGT CAGTAGATGACATTGGTGTCC CA 
GGAAGCCATAACGTAGAGAATG CTCTAG CAACTATTGCX3GTTG CT 
AGCTGGTATCAGTAATCAAGTTATTAGAGAAACTTTAAGCAATTT^ 
GTGTTAAACACCG CTTG CAATCA.CTCGGTAAGGTTCATGGTATTAGTTTC 
TATAACGACAG CAAGTCAACTAATATATTGG CAACTCAAAAAGCATTATC 
TGGCTTTGATAATACTAAAGTTATC CTAATTG CAGGAGGT CTTGATCGCG 
CTAATGAGTTTGATGAATTGATACCAGATATCACTG<5ACTTAAACATATG 
GTTGTTTTAGC^a^TCGGCATCTCCAGTAAAACGTGCTGCACAAAAAGC 
AGGAGTAACTTATAG CX1ATG CTTTAGATGTTAGAGATGCGGTACATAAAG 
CTTATGAGGTGGCACAACAGGG CGATGTTATCTTG CTAAGTC CTG CAAAT 
G CAT GATGGG ACATGTATAAGAATTTCGAAGT CCGTGGTGATGAATT CAT 
TGATACTTTCGAAAGTCTTAGAGGAGAG 

SEQ ID NO. 6706 
STRAIN COH1 

GGACGAGTAATGAAAACAATAACAACATTTGA 
AAATAAAAAAGTTTTAGTCCTTGGTTTAGCACGA 
CACGTCTGCTAGCTAAGTTAGGAGCAATAGTGAGACOTAATG 
CCATTTGATGAAAATCCAACAGCACAG 

AGTGGTTTGTGGTAGTCATCCUU"lAGAATTGTTAGATGAGGAl"i"l"l"iGTT 
ACAT GATTAAAAATC CAGGAAT AC CTTAT AACAAT CCTATGGT CAAA^AA 
GCATTAGAAAAAOVAATCCCTGTTTTGACT 

AGTTT CAGAAT CTCAG CTAATAGGTATTACAGG CT CTAACGGGAAAACGA 
CAACGACAACGATGACTGCAGAAGTCTTAAATGCTGGAGGTCAGAGAGGT 
TTGTTAG CTGGGAATAT CGG CTTT C CTG CT AGTGAAGTTGTT CAGGCTG C 
GGaTGATAAAGATATTCTAGCTATGGAATTATCAAGTTTTCAGCTAATGG 
GAGTTAAGGAATTT CGTCCTCATATTG CAGTAATTACTAATTTAATG CCA 
ACT CATTTAGATTATCATGGGT CTTTTGAAGATTATGTTGCTG CAAAATG 
GAATATCCAAAATCAAATGTCTTCATCTGATTTTTTGGTACT 
ATCAAGGTATTTCTAAAGAGTTAG CTAAAACTACTAAAGCAaCAATCGTT 
CCTITT'CTCTACTACGGAAAAAGTTGATGGTGCTTACGTACA 
ACTTTTCTAT AAAGGGG AGAATATTATGTCAGTAGATGACATTGGTGT C C 
CAGGAAG CCATAACGTAGAGAATG CT CTAG CAACT ATTG CGGTTG CT AAA 
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CTAG CTGGTAT CAGTAAT CAAGTTATTAGAGAAACTTTAAG CAATTTTG G 
AGGTGTTAAACACCGCTTGCA^TCACTCGGTAAGGTTCATGGTATTAGTT 
TCrATAACGACAGCAAGTCAACTAATATATTGGCAACrrCAAAAAGCATTA 
T CTGG CTTTGAT AAT ACTAAAGTTATC CT AATTG CAGGAGGT CTTGAT CG 
CGGTA^TGAGTTTGATGAATTGATACCAGATATCACTGGACTTAAACATA 
TGGTTGTTTTAGGGGAATCGGCATCTCGAGTAAAACGTGCTGCACAAAAA 
GCAGGAGTAACTTATAGCGATGCTTTAGATGTTAGAGATGCGGTACATAA 
AGCTTATGAGGTGGCACAACAGGGCGATGTTATCTTGCTAAGTCCTGCAA 
ATGCATCATGGGACATGTATAAGAATTTCGAAGTCCGTGGTGATGAATTC 
ATTGATACTTTCGAAA 

SEQ ID NO. 6707 
STRAIN M7 81 

GGACGAGTAATGAAAACAATAACAACATT 

TGAAAATAAAAAAGTTTTAGTCCTTGGTTTAGCACGATCTGGAGAAGCCG 
CTGCACGTTTGTTAG CTAAGTT AGGAG CAATAGT GACAGTTAATGATGG C 
AAACCATTTGATGAAAATCCAACAGCACAGTCTTTGTTGGAAGAGGGTAT 
TAA^GTGGTTTGTGGTAGTCATCCTTTAGAATTGTTAGATGAGGATTTTT 
GTTACATGATTAAAAATC CAGGAATACCTTATAACAAT C CTATGGTCAAA 
AAAG CATTAGAAAAACAAAT CC CTGTTTTG ACTGAAGTGGAATTAG CATA 
CTTAGTTTCAGAATCTCAGCTAATAGGTATTACAGGCTCTAACGGGAAAA 
CGACAACGACAACGATGATTGCAGAAGT CTTAAAT G CTG GAGGTCAGAG A 
GGTTTGTTAG CTGGG AATAT CGGCTTTC CTGCTAGTGAAGTTGTTCAGG C 
TG CGG ATGATAAAGATATTCTAGTT ATGG AATTAT CAAGTTTT CAGCTAA 
TGGGAGTTAAGGAATTTCGTCCTCATATTGCAGTAATTACTAA.TTTAATG 
CCAACTCATTTAGATTATCATGGGT C1TTTGAAGATT ATGTTG CTG CAAA 
ATGGAAT AT C CAAAATCPAATGTCTTCATOTGAl"rrri"l'GGTACTTAA^ 
TT AAT CAAGGTATTT CTAA^GAGTTAG CTAAAACTACTAAAG CAaCAAT C 
GTTC CTTTCT CTACT ACGGAAAAAGTTGATGGTGCTTACGTACAAGACAA 
GCAACTTTT'CTATAAAGGGGAGAATATTATGTCAGTAGATGACATTGGTG 
T C CCAGGAAG CCATAACGT AGAGAATG CT CTAGCAACTATTG CGGTTGCT 
AAACTAGCTGGT AT CAGTAAT CAAGTTATTAGAGAAACTTTAAG CAATTT 
TGGAGGTGTTAA^CACCG CTTG CAAT CACT CGGTAAGGTT CATGGTATTA 
GTTTCTATAACGACAGCAAGTCAACTAATATATTGGCAA 
TTATCTGGCTTTGATAATACTAAAGTTAT C CTAATTG CAGGAGGTCTTGA 
TCG CG GT AATGAGTTTGATG AATTGATAC CAGATATCACTGG ACTTAAAC 
ATATGGTTGTTTTAgGGGAATCGGCATCrCGAGTAAAACGTGCTGCACAA 
AAAG CAGGAGT aACTT ATAG CGATG CTTTAGATGTTAGAGATG CGGTACA 
TAAAGCTTATGAGGTGGCACAACAGGGCGATGTTATCTTGCTAAGTCCTG 
CAAATGCATCATGGGACATGTATAAGAATTTCGAAGTCCGTGGTGATGAA 
IT CATTGATACTTT CGAAAGTCTTAGAGGAGAG 

SEQ ID NO. 6708 
STRAIN CJB110 

GGACGAGTAATGAAAACAATAACAACATTTGA 
AAATAAAAAAGTTTTAGTCCITGGTTTAGCACGATCT 
CACGTTTGTTAG CTAAGTTAGGAG CAATAGTGACAGTTAATGATGG CAAA 
C CATTTGATGAAAAT CCAACAG CACAGTCTTTGTTGG AAGAGGGTATTAA 
AGTGGTTTGTGGTAGTCAT C CTTTAGAATTGTTAGATGAGGATTTTTGTT 
ACATGATTAAAAATCCAGGA^TACCTTATAACAATCCTATGGTCAAAAA^ 
GCATTAGAAAAACAAATCCCTGTTTTX3ACT 

AGTTT CAGAAT CTCAGCTAATAGGTATTACAGGCT CTAACGGGAAAACGA 
CAACGACAACGATG ATTG CAGAAGT CTTAAATGCTGGAGGTCAGAGAGGT 
TTGTTAGCTGGGAATATCGGCT"lT , CCrGCIAGTGAAG , r r rGlTCAGGC , rGC 
GGATGATAAAGATATTCTAGTTATGGAATTATCAAGTTTTCAGCTAATGG 
GAGTTA^GGAATTT CGT CCT CATATTG CAGTAATTACTAATTTAATG CCA 
ACTCATTTAGATTAT CATGGGTCTTTTGAAGAATATGTTG CTGCAAAATG 
GAATATCCAAAATCAAATGTCTTCATCTGAITTTTTGGTACTTAAT^ 
AT CAAGGTATTTCTAAAGAGTTAG CTAAAACTACTAAAG CAACAAT CGTT 
CCTTTCTCTACTACGGAAAAAGTTGATGGTGCTTACGTACAAGACAAG 
ACITTTCTATAAAGGGGAGAATATTATGTTAGTAGATGAG^TTGGTGTCC 
CAGGAAG CCATAACGTAGAGA^TGCTCTAG CAACTATTG CGGTTG CT AAA 
CTAG CTGGTATCAGT AAT CAAGTTATTAGAGAAACTTTAAG CAATTTTGG 
AGGTGTTAAACACCG CTTG CAATCACT CGGTAAGGTT CATGGTATTAGTT 

tctataatgacagcaagtcaactaatatatt^ 

tctgg ctttgataat actaaagttat c ctaattg caggaggt cttgatcg 

cggtaatgagtttgatgaattgataccagatatcact 

tggttgttttaggggaatcggcatctcgagtaaaacgtgc^ 

gcaggagtaacttatag cgatg ctttagatgttagagatg cggtacataa 

agcttatgaggtggcacaacagggcc^tgttatct^ 

atgcatcatgggacatgtataagaattt cgaagt c cgtggtgatgaattc 

attgatactttcgaaagtcttagaggagag 

SEQ ID NO. 6709 
STRAIN 1169NT 

CAATAACAACATTTGAAAATAAAA^GTTTTAGT^ 

TCIGGAGAAG CCG CTGCACGTTTGTTAGCTAAGTTAGGAG CAATAGTGAC 
AGTTAATGATGGCAAACCATTTGATGAAAATCCAACAG 
TGGAAGAGGGTATTAAAGTGGTTTGTGGTAGTCATCCTTTAGAATTC 
GATGAC^ATTTTTGTTACATGATTAAAAAT CCAGGAATACCTTAT AACAA 
TCCTATGGTCAAAAAAG CATTAGAAAAACAAATCCCTGTTTTGACTGAAG 
TGGAATTAG CATACTTAGTTT CAGAATCT CAG CTAATAGGTATTACAGG C 
TCTAACGGGAAAA.CGACAACGACAACGATGATTG CAGAAGT CTTG AATG C 
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TGGAGGTCAGAGAGGTTTGTTAGCTGGGAATATCGGCTTTCCTGCTAGTG 
AAGTTGTTCAGGCTGCGGATGATAAAGATACTCJTAGTTATGGAATTATCA 
AGTTTTCAGCTAATGGGAGTTAAGGAATTTCX5TCCTCATATTGCAGTAAT 
TACT AATTT AATG CCAACT CATTT AGATT ATCAT G GGTCTTTTGAAGA t T 
ATG t TGCTG CAAAATGG AATAT CGAAAAT CAAATGT CTT CAT CTG ATTTT 
TTGGTACTTAATTTTAATCAAGGTATTTCTAAAGAGTTAGcTAAAACTAC 
TAAAGCAACAATCGTTCCTTTCTCTACTACGGAAAAAGTTGATGGTGCTT 
ACGTACAAGACAAGCAACTTTTCTATAAAGGGGAGAATATTATGTCAGTA 
GACGACATTGGTGTC C CAGG AAG C CATAACGTAGAGAATG CT CT AGCAAC 
TATTGCGGTTG CTAAACTAG CTGGTATCAGTAAT CAAGTTATTAGAGAAA. 
CTTT AAG CAATTTTGGAGGTGTTAAACAC CG CTTG CAAT CACT CGGTAAG 
GTTCATGGTATTAGTTT CT ATAACG ACAGT AAGT CAACTAATATATTGG C 
AACTCAAAAAGCATTATCTGGCTTTGATAATACTAAAGTTATCCTAATTG 
CAGGAGGTCTTG AT CG CGGTAATG AGTTTGATGAATTGAT AC CAGATATC 
ACTGGACTT AAG CAT ATGGTTGTTTTAGG GG AAT CGG CAT CT CGAGT AAA 
ACGTGCTGCACAAAAAGCAGGAGTAACTTATAGCAATGCTTTAgATGTTA 
GAgATGCgGTACATAAAGCTTATGAGGTGGCACAACAGGGCGATGTTATC 
TTGTTmAGT c CTGCGAATG CAT CATGGGACATGTATAAGAATTT CGAAGT 
CCGTG GTGATGAATT CATTGATACTTT CG 

SEQ ID NO. 6710 

STRAIN JM9130013 

GGACGAGTAATGAAAACAATAACAACA 

TTTGAAAATAAAAAAGTTTTAGTCCTTGGTTTAG CACGATCTGGAGAAGC 
TG CTGCACGTTTGTTAGCTAAGTTAGGAG CAATAGTG ACAGTTAATGATG 
G CAAACCATTTGATGAAAATCCAACAG CACAGTCTTTGTTGGAAGAG GGT 
ATTAAAGTGGTTTGTGGTAGT CAT CCTTTAGAATTG t TAGATGAGGATTT 
TTGTT ACATGATT aAAAAT CCAGGAAT ACCCTAT AACAAT CCTATGGT CA 
AAAAAGCATTAGAAAAACAAAT CCCTGTTTTGACTG^ CA 
TACITAGTTTCAGAATCTCAGCTAATAGGTATTACAGGCTCT 
AACGACAACGACAACGATGATTGCAGAAGT CTTAAATGCTGGAGGTCAG A 
GAGGTrTGrT AG CTGGG AATAT CGG CTTT C CTG CTAGTGAAGTTGTT CAG 
GCTGCGAATGATAAAGATACTCTAGTTATGGAATTATCAAGTTTTCAGCT 
AATGGGAGTTAAGG AATTT CGT CCTCAT ATTG CAGTAATTACTAATTTAA 
TGCCAACTCATTTAGATTATCATGGGT CTTTTGAAGATTATGTTG CTGCA 
AAATGGAATATCCAAAATCAAATGTCTTCATCTGATTTT^ 
TTTTAATCAAGGTATTT CTAAAGAGTTAG CTAAAACTACTAAAGCaACAA 
TCGTTCCTTTCTCTACTACGGAAAAAGTTGATGGTGCITACGTACA^ 
AAGCAACTTTTCTATAAAGGGGAGAATATTATGTCAGTAGATGACATTGG 
TGTCCCAGGAAGCCATAACGTAGAGAATGCTCTAGCAACTATTGCGGTTG 
CTAAACTGG CTGGTAT CAGTAATCAAGTTATTAG AGAAACTTTAAG CAAT 
TTTGGAGGTGTTAAACACCGCTTGCAATCACTCGGT 

TAG t TTCTATAACGACAG CAAGTC1AACTAATATATTGGCAACTCAAAAAG 
CATTATCTGG CTTTGATAATACTAAAGTTATCCTAATTGCAGGAGGTCTT 
GATCGCACTAATGAGTTTGATGAACTGATACCAGATATCACTGGACTTAA 
ACATATGGTTGTTTTAGGGGAATCGGCATCTCGAGTAAAACGTGCTGCAC 
AAAAAGCAGGAGTAACTTATAG CGATGCTTTAGATGTTAGAGATG CGGTA 
CATAAAGCTTATGAGGTGG CACAACAGGGCGATGTTATCTTGCTAAGTCC 
TG CAAATGCATCATGGGACATGTATAAGAATTT CGAAGT C CGTGGTGATG 
AATT CATTGATAC t TT CGAAAGTCTTAGAGGAGAG 

SEQ XD NO. 6710 
STRAIN 2603 

ggacgagtaatgaaaacaataacaacatttgaaaataaaaaagttttagt 
ccttggtttagcacgatctggagaagctgctgcacgtttgttagctaagt 
taggagcaatagtgacagttaatgatggcaaaccatttgatgaaaatcca 
acagcacagtctttgttggaagagggtattaaagtggtttgtggtagtca 
tcctttagaattgttagatgaggatttttgttacatgattaaaaatccag 
gaataccttataacaatcctatggtcaaaaaagcattagaaaaacaaatc 
cctgttttgactgaagtggaattagcatacttagtttcagaatctcagct 
aataggtattacaggctctaacgggaaaacgacaacgacaacgatgattg 
cagaagtcttaaatgctggaggtcagagaggtttgttagctgggaatatc 
ggctttcctgctagtgaagttgttcaggctgcgaatgataaagatactct 
agttatggaattatcaagttttcagctaatgggagttaaggaatttcgtc 
ctcatattgcagtaattactaatttaatgccaactcatttagattatcat 
gggtcttttgaagattatgttgctgcaaaatggaatatccaaaatcaaat 
gtcttcatctgattttttggtacttaattttaatcaaggtatttctaaag 
agttagctaaaactactaaagcaacaatcgttcctttctctactacggaa 
aaagttgatggtgcttacgtacaagacaagcaacttttctataaagggga 
gaatattatgtcagtagatgacattggtgtcccaggaagccataacgtag 
agaatgctctagcaactattgcggttgctaaactggctggtatcagtaat 
caagttattagagaaactttaagcaattttggaggtgttaaacaccgctt 
gcaatcactcggtaaggttcatggtattagtttctataacgacagcaagt 
caactaatatattggcaactcaaaaagcattatctggctttgataatact 
aaagttatcctaattgcaggaggtcttgatcgcggtaatgagtttgatga 
attgataccagatatcactggacttaaacatatggttgttttaggggaat 
cggcatctcgagtaaaacgtgctgcacaaaaagcaggagtaacttatagc 
gatgctttagatgttagagatgcggtacataaagcttatgaggtggcaca 
acagggcgatgttatcttgctaagtcctgcaaatgcatcatgggacatgt 
ataagaatttcgaagtccgtggtgatgaattcattgatactttcgaaagt 
cttagaggagag 



970 



WO 2004/018646 



PCT/US2003/026827 



Table 67: Comparative Sequences relating to SAG0475 



MSA Alignment Results: Pretty output 

PRETTY of: /biotmp/msa30176 . 2 { * } April 29, 



2002 02:09 



msa30176.2{ 
tnsa30176. 
tnsa30176 . 
msa30l76 . 
msa30176.2{305 
msa30176 . 
msa30176 . 
msa30176 .2 
msa30176 
msa30176.2{ 
msa30176.2{ 



1 

3 05_18RS2l} ggacgagtaa tgaaaaCAAT 

2{305_26 03} ggacgagtaa tgaaaaCAAT 

2f305_A909} CAAT 

2{305_H36B} ggacgagtaa tgaaaaCAAT 

_JM9130013} ggacgagtaa tgaaaaCAAT 

2{305_COHl} ggacgagtaa tgaaaaCAAT 

2{305_M78l} ggacgagtaa tgaaaaCAAT 

{305e_M732l ggacgagtaa tgaaaaCAAT 

.2{305__0 90} CAAT 

305_CJB110} ggacgagtaa tgaaaaCAAT 

305_1169NT} CAAT 

Consensus **** 



AACAACATTT 

AACAACATTT 

AACAACATTT 

AACAACATTT 

AACAACATTT 

AACAACATTT 

AACAACATTT 

AACAACATTT 

AACAACATTT 

AACAACATTT 

AACAACATTT 
********** 



GAAAATAAAA 

GAAAATAAAA 

GAAAATAAAA 

GAAAATAAAA 

GAAAATAAAA 

GAAAATAAAA 

GAAAATAAAA 

GAAAATAAAA 

GAAAATAAAA 

GAAAATAAAA 

GAAAATAAAA 
********** 



50 

AAGTTTTAGT 

AAGTTTTAGT 

AAGTTTTAGT 

AAGTTTTAGT 

AAGTTTTAGT 

AAGTTTTAGT 

AAGTTTTAGT 

AAGTTTTAGT 

AAGTTTTAGT 

AAGTTTTAGT 

AAGTTTTAGT 
********** 



msa30176.2{ 
msa30176 . 
msa30176 . 
msa30176 . 
msa30176.2{305 
msa30176 . 
msa30176 . 
msa30176 .2 
msa30176 
msa30176.2{ 
tnsa30176.2{ 



305_18RS2lJ 
2{305_2603} 
2{305_A909} 
2{305_H36B} 
_JM9130013} 
2{305_COH1} 
2{305_M78l} 
{305e_M732} 
.2{305_090} 
305_CJB110} 
305_1169NT} 
Consensus 



51 

CCTTGGTTTA 
CCTTGGTTTA 
CCTTGGTTTA 
CCTTGGTTTA 
CCTTGGTTTA 
CCTTGGTTTA 
CCTTGGTTTA 
CCTTGGTTTA 
CCTTGGTTTA 
CCTTGGTTTA 

CCTTGGTTTA 
********** 



GCACGATCTG 
GCACGATCTG 
GCACGATCTG 
GCACGATCTG 
GCACGATCTG 
GCACGATCTG 
GCACGATCTG 
GCACGATCTG 
GCACGATCTG 
GCACGATCTG 
GCACGATCTG 



GAGAAGCtGC 
GAGAAGCtGC 
GAGAAGCtGC 
GAGAAGCtGC 
GAGAAGCtGC 
GAGAAGCc GC 
GAGAAGCcGC 
GAGAAGCcGC 
GAGAAGCCGC 
GAGAAGCcGC 
GAGAAGCcGC 



********** *******_** 



TGCACGTTTG 
TGCACGTTTG 
TGCACGTTTG 
TGCACGTTTG 
TGCACGTTTG 
TGCACGTTTG 
TGCACGTTTG 
TGCACGTTTG 
TGCACGTTTG 
TGCACGTTTG 

TGCACGTTTG 
********** 



100 

TTAGCTAAGT 
TTAGCTAAGT 
TTAGCTAAGT 
TTAGCTAAGT 
TTAGCTAAGT 
TTAGCTAAGT 
TTAGCTAAGT 
TTAGCTAAGT 
TTAGCTAAGT 
TTAGCTAAGT 

TTAGCTAAGT 
********** 



msa30176.2{ 
msa30176 . 
msa30176. 
msa30176 . 
msa30176.2{305 
msa30176 . 
msa30176 . 
msa30176 .2 
msa30176 
msa30176 .2{ 
msa30176.2{ 



305_18RS21} 
2{305_2603) 
2{305_A909* 
2{305_H36B 
_JM9130013 
2{305_COHl} 
2{305_M781) 
{305e_M732> 
-2{305_090} 
305_CJB110} 
305_1169NT} 
Consensus 



101 

TAGGAGCAAT 

TAGGAGCAAT 

TAGGAGCAAT 

TAGGAGCAAT 

TAGGAGCAAT 

TAGGAGCAAT 

TAGGAGCAAT 

TAGGAGCAAT 

TAGGAGCAAT 

TAGGAGCAAT 

TAGGAGCAAT 
********** 



AGTGACAGTT 

AGTGACAGTT 

AGTGACAGTT 

AGTGACAGTT 

AGTGACAGTT 

AGTGACAGTT 

AGTGACAGTT 

AGTGACAGTT 

AGTGACAGTT 

AGTGACAGTT 

AGTGACAGTT 
********** 



AATGATGGCA 

AATGATGGCA 

AATGATGGCA 

AATGATGGCA 

AATGATGGCA 

AATGATGGCA 

AATGATGGCA 

AATGATGGCA 

AATGATGGCA 

AATGATGGCA 

AATGATGGCA 
********** 



AACCATTTGA 

AACCATTTGA 

AACCATTTGA 

AACCATTTGA 

AACCATTTGA 

AACCATTTGA 

AACCATTTGA 

AACCATTTGA, 

AACCATTTGA 

AACCATTTGA 

AACCATTTGA 
********** 



150 

TGAAAATCCA 

TGAAAATCCA 

TGAAAATCCA 

TGAAAATCCA 

TGAAAATCCA 

TGAAAATCCA 

TGAAAATCCA 

TGAAAATCCA 

TGAAAATCCA 

TGAAAATCCA 

TGAAAATCCA 
********** 



msa30176 . 2 { 305JL8RS21 
msa30176.2{305_2603 
msa30176.2{305_A909; 
msa30176.2{305_H36B} 
msa30176 . 2 {305_JM9130013 
msa3 0 17 6 . 2 f 3 0 5J20H1 
msa30176.2{3 05_M781 
msa30176 . 2 {305e_M732 ■ 
msa3 0176.2{305_090; 
msa30176.2{305_CJB110; 
msa30176 . 2 {305_1169NT; 

Consensus 



151 

ACAGCACAGT 

ACAGCACAGT 

ACAGCACAGT 

ACAGCACAGT 

ACAGCACAGT 

ACAGCACAGT 

ACAGCACAGT 

ACAGCACAGT 

ACAGCACAGT 

ACAGCACAGT 

ACAGCACAGT 
********** 



CTTTGTTGGA 

CTTTGTTGGA 

CTTTGTTGGA 

CTTTGTTGGA 

CTTTGTTGGA 

CTTTGTTGGA 

CTTTGTTGGA 

CTTTGTTGGA 

CTTTGTTGGA 

CTTTGTTGGA 

CTTTGTTGGA 
********** 



AGAGGGTATT 

AGAGGGTATT 

AGAGGGTATT 

AGAGGGTATT 

AGAGGGTATT 

AGAGGGTATT 

AGAGGGTATT 

AGAGGGTATT 

AGAGGGTATT 

AGAGGGTATT 

AGAGGGTATT 
********** 



AAAGTGGTTT 

AAAGTGGTTT 

AAAGTGGTTT 

AAAGTGGTTT 

AAAGTGGTTT 

AAAGTGGTTT 

AAAGTGGTTT 

AAAGTGGTTT 

AAAGTGGTTT 

AAAGTGGTTT 

AAAGTGGTTT 
********** 



200 

GTGGTAGTCA 

GTGGTAGTCA 

GTGGTAGTCA 

GTGGTAGTCA 

GTGGTAGTCA 

GTGGTAGTCA 

GTGGTAGTCA 

GTGGTAGTCA 

GTGGTAGTCA 

GTGGTAGTCA 

GTGGTAGTCA 
********** 



msa30176.2{305_18RS21; 
msa3 0 17 6 . 2 ( 3 0 5__26 03 : 
msa3 0 17 6 . 2 { 3 05_A9 09 ' 
msa30176 . 2{305_H36B 
msa30176.2{305_JM9130013 
msa30176 . 2 f 305_COH1 
msa3 0176 . 2 { 305_M781 
msa30176.2{305e_M732 ■ 
msa30176.2{305_090; 
msa3 0176 .2(30 5_C JB110 
msa30176.2{305_1169NT; 

Consensus 



201 

TCCTTTAGAA 

TCCTTTAGAA 

TCCTTTAGAA 

TCCTTTAGAA 

TCCTTTAGAA 

TCCTTTAGAA 

TCCTTTAGAA 

TCCTTTAGAA 

TCCTTTAGAA 

TCCTTTAGAA 

TCCTTTAGAA 
********** 



TTGTTAGATG 

TTGTTAGATG 

TTGTTAGATG 

TTGTTAGATG 

TTGTTAGATG 

TTGTTAGATG 

TTGTTAGATG 

TTGTTAGATG 

TTGTTAGATG 

TTGTTAGATG 

TTGTTAGATG 
********** 



AGGATTTTTG 

AGGATTTTTG 

AGGATTTTTG 

AGGATTTTTG 

AGGATTTTTG 

AGGATTTTTG 

AGGATTTTTG 

AGGATTTTTG 

AGGATTTTTG 

AGGATTTTTG 

AGGATTTTTG 
********** 



TTACATGATT 

TTACATGATT 

TTACATGATT 

TTACATGATT 

TTACATGATT 

TTACATGATT 

TTACATGATT 

TTACATGATT 

TTACATGATT 

TTACATGATT 

TTACATGATT 
********** 



250 

AAAAATCCAG 

AAAAATCCAG 

AAAAATCCAG 

AAAAATCCAG 

AAAAATCCAG 

AAAAATCCAG 

AAAAATCCAG 

AAAAATCCAG 

AAAAATCCAG 

AAAAATCCAG 

AAAAATCCAG 
********** 



msa30176.2{3 
msa30176.2 
msa30176 .2 
msa30176.2 
msa30176.2{305_. 
msa3 0176.2 
msa30176.2 
msa30176.2{ 
msa30176 . 



05_18RS2l} 
{3 05_2603} 
?305_A909" 
(305_H36B 
JM9130013 
{305_COH1} 
{305_M781' 
305e_M732 
2 {305_090 



251 

GAATACCTTA 
GAATACCTTA 
GAATACCTTA 
GAATACCTTA 
GAATACCTTA 
GAATACCTTA 
GAATACCTTA 
GAATACCTTA 
GAATACCTTA 



TAACAATCCT 
TAACAATCCT 
TAACAATCCT 
TAACAATCCT 
TAACAATCCT 
TAACAATCCT 
TAACAATCCT 
TAACAATCCT 
TAACAATCCT 



ATGGTCAAAA 
ATGGTCAAAA 
ATGGTCAAAA 
ATGGTCAAAA 
ATGGTCAAAA 
ATGGTCAAAA 
ATGGTCAAAA 
ATGGTCAAAA 
ATGGTCAAAA 



AAGCATTAGA 
AAGCATTAGA 
AAGCATTAGA 
AAGCATTAGA 
AAGCATTAGA 
AAGCATTAGA 
AAGCATTAGA 
AAGCATTAGA 
AAGCATTAGA 



300 

AAAACAAATC 
AAAACAAATC 
AAAACAAATC 
AAAACAAATC 
AAAACAAATC 
AAAACAAATC 
AAAACAAATC 
AAAACAAATC 
AAAACAAATC 
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msa30176 . 2 {305_CJB110 } 
msa30176.2{305_1169NT} 

Consensus 



GAATACCTTA TAACAATCCT ATGGTCAAAA AAGCATTAGA AAAACAAATC 

GAATACCTTA TAACAATCCT ATGGTCAAAA AAGCATTAGA AAAACAAATC 
********** ********** ********** ********** ********** 



msa30176 . 2 
msa30176 
msa30176 
msa30176 
rasa30176.2{30 
msa30176 
msa30176 
msa30176 . 
msa3017 
msa30176.2 
msa30176.2 



{305_18RS2l} 
.2{305_2603} 
.2{305_A909} 
.2{3 05_H36B 
5_JM9130013 
.2{305_COHl} 
,2{305_M78l} 
2{305e_M732} 
6.2{305_090} 
{305_CJB110} 
{305_1169NT} 
Consensus 



301 

CCTGTTTTGA 
CCTGTTTTGA 
CCTGTTTTGA 
CCTGTTTTGA 
CCTGTTTTGA 
CCTGTTTTGA 
CCTGTTTTGA 
CCTGTTTTGA 
CCTGTTTTGA 
CCTGTTTTGA 
CCTGTTTTGA 



CTGAAGTGGA 
CTGAAGTGGA 
CTGAAGTGGA 
CTGAAGTGGA 
CTGAAGTGGA 
CTGAAGTGGA 
CTGAAGTGGA 
CTGAAGTGGA 
CTGAAGTGGA 
CTGAAGTGGA 
CTGAAGTGGA 



ATTAGCATAC 
ATTAGCATAC 
ATTAGCATAC 
ATTAGCATAC 
ATTAGCATAC 
ATTAGCATAC 
ATTAGCATAC 
ATTAGCATAC 
ATTAGCATAC 
ATTAGCATAC 
ATTAGCATAC 



********** ********** ********** 



TTAGTTT CAG 

TTAGTTTCAG 

TTAGTTT CAG 

TTAGTTTCAG 

TTAGTTTCAG 

TTAGTTTCAG 

TTAGTTTCAG 

TTAGTTTCAG 

TTAGTTTCAG 

TTAGTTTCAG 

TTAGTTTCAG 
********** 



350 

AATCTCAGCT 

AATCTCAGCT 

AATCTCAGCT 

AATCTCAGCT 

AATCTCAGCT 

AATCTCAGCT 

AATCTCAGCT 

AATCTCAGCT 

AATCTCAGCT 

AATCTCAGCT 

AATCTCAGCT 
********** 



msa30176.2{305_18RS2l} 
tnsa30176.2{305_2603} 
msa3 0 17 6 . 2 { 3 05_A9 09 } 
msa3 0 17 6 . 2 { 3 05_H3 6B } 
msa30176 . 2 {305_JM9130013 } 
msa3 0 17 6 . 2 { 3 05_COH1 } 
msa3 0 17 6 . 2 { 3 05_M78 1 } 
msa30176 . 2 { 3 05e_M732 } 
msa3 0176 . 2 { 305_p 90 } 
msa30176.2{305_CJB110} 
msa30176.2{305_1169NT} 

Consensus 



351 

AATAGGTATT 
AATAGGTATT 
AATAGGTATT 
AATAGGTATT 
AATAGGTATT 
AATAGGTATT 
AATAGGTATT 
AATAGGTATT 
AATAGGTATT 
AATAGGTATT 
AATAGGTATT 



ACAGGCTCTA 
ACAGGCTCTA 
ACAGGCTCTA 
ACAGGCTCTA 
ACAGGCTCTA 
ACAGGCTCTA 
ACAGGCTCTA 
ACAGGCTCTA 
ACAGGCTCTA 
ACAGGCTCTA 
ACAGGCTCTA 



ACGGGAAAAC 
ACGGGAAAAC 
ACGGGAAAAC 
ACGGGAAAAC 
ACGGGAAAAC 
ACGGGAAAAC 
ACGGGAAAAC 
ACGGGAAAAC 
ACGGGAAAAC 
ACGGGAAAAC 
ACGGGAAAAC 



GACAACGACA 
GACAACGACA 
GACAACGACA 
GACAACGACA 
GACAACGACA 
GACAACGACA 
GACAACGACA 
GACAACGACA 
GACAACGACA 
GACAACGACA 
GACAACGACA 



400 

ACGATGATTG 
ACGATGATTG 
ACGATGATTG 
ACGATGATTG 
ACGATGATTG 
ACGATGATTG 
ACGATGATTG 
ACGATGATTG 
ACGATGATTG 
ACGATGATTG 
ACGATGATTG 



********** ********** ********** ********** ********** 



msa30176.2{ 
msa30176 . 
msa30176. 
msa30176 . 
msa30176.2{305 
msa30176 . 
msa30176. 
msa30176 .2 
msa3 0176 
msa30176.2{ 
msa30176.2{ 



3 05_18RS2l} 
2 {305^2603} 
2{305_A909) 
2{305_H36B} 
_JM9130013} 
2f305_COHl} 
2{305_M78l} 
{3 05e_M732} 
.2{305_090} 
305_CJB110} 
3 05_1169NT} 
Consensus 



401 

CAGAAGTCTT 
CAGAAGTCTT 
CAGAAGTCTT 
CAGAAGTCTT 
CAGAAGTCTT 
CAGAAGTCTT 
CAGAAGTCTT 
CAGAAGTCTT 
CAGAAGTCTT 
CAGAAGTCTT 
CAGAAGTCTT 



aAATGCTGGA 
aAATGCTGGA 
aAATGCTGGA 
aAATGCTGGA 
aAATGCTGGA 
aAATGCTGGA 
aAATGCTGGA 
aAATGCTGGA 
aAATGCTGGA 
aAATGCTGGA 
gAATGCTGGA 



********** _********* 



GGTCAGAGAG 

GGTCAGAGAG 

GGTCAGAGAG 

GGTCAGAGAG 

GGTCAGAGAG 

GGTCAGAGAG 

GGTCAGAGAG 

GGTCAGAGAG 

GGTCAGAGAG 

GGTCAGAGAG 

GGTCAGAGAG 
********** 



GTTTGTTAGC 
GTTTGTTAGC 
GTTTGTTAGC 
GTTTGTTAGC 
GTTTGTTAGC 
GTTTGTTAGC 
GTTTGTTAGC 
GTTTGTTAGC 
GTTTGTTAGC 
GTTTGTTAGC 

GTTTGTTAGC 
********** 



450 

TGGGAATATC 
TGGGAATATC 
TGGGAATATC 
TGGGAATATC 
TGGGAATATC 
TGGGAATATC 
TGGGAATATC 
TGGGAATATC 
TGGGAATATC 
TGGGAATATC 

TGGGAATATC 
********** 



msa30176.2{ 
msa30176 . 
msa30176 . 
msa30176. 
msa30176.2{305 
tnsa30176. 
msa30176 . 
msa30176 .2 
msa30176 
msa30176.2{ 
msa3 0176.2{ 



305_18RS21; 
2{305_2603; 
2{305_A909 
2{305_H36B' 
JM9130013; 
2{305_COH1; 
2{305_M781; 
{305e_M732} 
.2{305_090} 
305_CJB110} 
3 05_1169NT} 
Consensus 



451 

GGCTTTCCTG 
GGCTTTCCTG 
GGCTTTCCTG 
GGCTTTCCTG 
GGCTTTCCTG 
GGCTTTCCTG 
GGCTTTCCTG 
GGCTTTCCTG 
GGCTTTCCTG 
GGCTTTCCTG 
GGCTTTCCTG 



CTAGTGAAGT 
CTAGTGAAGT 
CTAGTGAAGT 
CTAGTGAAGT 
CTAGTGAAGT 
CTAGTGAAGT 
CTAGTGAAGT 
CTAGTGAAGT 
CTAGTGAAGT 
CTAGTGAAGT 
CTAGTGAAGT 



********** ********** 



TGTTCAGGCT 

TGTTCAGGCT 

TGTTCAGGCT 

TGTTCAGGCT 

TGTTCAGGCT 

TGTTCAGGCT 

TGTTCAGGCT 

TGTTCAGGCT 

TGTTCAGGCT 

TGTTCAGGCT 

TGTTCAGGCT 
********** 



GCGaATGATA 

GCGaATGATA 

GCGaATGATA 

GCGaATGATA 

GCGaATGATA 

GCGgATGATA 

GCGgATGATA 

GCGgATGATA 

GCGgATGATA 

GCGgATGATA 

GCGgATGATA 
***_****** 



500 

AAGATAcTCT 

AAGATAcTCT 

AAGATAcTCT 

AAGATAcTCT 

AAGATAcTCT 

AAGATAtTCT 

AAGATAtTCT 

AAGATAtTCT 

AAGATAtTCT 

AAGATAtTCT 

AAGATAcTCT 
******_*** 



msa30176 . 2 {305_18RS21 i 
msa30176 .2 {305_2603 } 
msa3 0176 . 2 { [3 05_A9 09 j 
msa3 0 17 6 . 2 { 3 0 5_H3 6B \ 
msa30176 . 2 {305_JM9130013 ' 
msa3 0 1 7 6 . 2 { 3 0 5_C0H1 
msa30176.2{305_M78l} 
msa30176.2{305e_M732} 
msa3 0176 . 2 { 305__090 " 
msa30176 . 2 f 305JZJB110 
msa30176.2{305_1169NT; 

Consensus 



msa30176.2{305_18RS2l} 
msa30176.2{305_2603} 
msa3 0 17 6 . 2 { 3 05_A9 09 \ 
msa30176.2{305_H3 6B) 
msa30176.2{305_JM9130013} 
msa30176 . 2 {305_C0H1* 
msa30176 . 2 { 305_M781 
msa30176.2{305e_M732 



501 

AGTTATGGAA 

AGTTATGGAA 

AGTTATGGAA 

AGTTATGGAA 

AGTTATGGAA 

AGTTATGGAA 

AGTTATGGAA 

AGTTATGGAA 

AGTTATGGAA 

AGTTATGGAA 

AGTTATGGAA 
********** 

551 

CTCATATTGC 
CTCATATTGC 
CTCATATTGC 
CTCATATTGC 
CTCATATTGC 
CTCATATTGC 
CTCATATTGC 
CTCATATTGC 



TTATCAAGTT 

TTATCAAGTT 

TTATCAAGTT 

TTATCAAGTT 

TTATCAAGTT 

TTATCAAGTT 

TTATCAAGTT 

TTATCAAGTT 

TTATCAAGTT 

TTATCAAGTT 

TTATCAAGTT 
********** 



AGTAATTACT 
AGTAATTACT 
AGTAATTACT 
AGTAATTACT 
AGTAATTACT 
AGTAATTACT 
AGTAATTACT 
AGTAATTACT 



TTCAGCTAAT 

TTCAG CTAAT 

TTCAGCTAAT 

TTCAGCTAAT 

TTCAGCTAAT 

TTCAGCTAAT 

TTCAGCTAAT 

TTCAGCTAAT 

TTCAGCTAAT 

TTCAGCTAAT 

TTCAGCTAAT 
********** 



AATTTAATGC 
AATTTAATGC 
AATTTAATGC 
AATTTAATGC 
AATTTAATGC 
AATTTAATGC 
AATTTAATGC 
AATTTAATGC 



GGGAGTTAAG 
GGGAGTTAAG 
GGGAGTTAAG 
GGGAGTTAAG 
GGGAGTTAAG 
GGGAGTTAAG 
GGGAGTTAAG 
GGGAGTTAAG 
GGGAGTTAAG 
GGGAGTTAAG 
GGGAGTTAAG 
********** 



CAACTCATTT 
CAACTCATTT 
CAACTCATTT 
CAACTCATTT 
CAACTCATTT 
CAACTCATTT 
CAACTCATTT 
CAACTCATTT 



550 

GAATTTCGTC 

GAATTTCGTC 

GAATTTCGTC 

GAATTTCGTC 

GAATTTCGTC 

GAATTTCGTC 

GAATTTCGTC 

GAATTTCGTC 

GAATTTCGTC 

GAATTTCGTC 

GAATTTCGTC 
********** 

600 

AGATTATCAT 
AGATTATCAT 
AGATTATCAT 
AGATTATCAT 
AGATTATCAT 
AGATTATCAT 
AGATTATCAT 
AGATTATCAT 



972 



WO 2004/018646 



PCT/US2003/026827 



Table 67: Comparative Sequences relating to SAG0475 



msa30176.2{305_090 
msa30176.2{305_CJB110 
msa30176.2{305_1169NT; 

Consensus 



CT CAT ATTGC AGTAATTACT AATTTAATGC CAACTCATTT AGATTATCAT 
CTCATATTGC AGTAATTACT AATTTAATGC CAACTCATTT AGATTATCAT 
CTCATATTGC AGTAATTACT AATTTAATGC CAACTCATTT AGATTATCAT 
********** ********** ********** ********** ********** 



msa30176.2{305_18RS2l} 
mBa30176.2{305_2603} 
msa30176.2<305_A909} 
msa30176 .2{3 05_H36B} 
msa30176.2{305_JM9130013} 
msa30176 . 2 {305_COHl} 
msa30176.2{305_M78l} 
msa30176.2{305e__M732} 
msa30176. 2 {305_090 } 
msa30176.2{305_CJB110} 
rnsa30176 . 2 {305_1169NT} 

Consensus 



601 

GGGTCTTTTG 

GGGTCTTTTG 

GGGTCTTTTG 

GGGTCTTTTG 

GGGTCTTTTG 

GGGTCTTTTG 

GGGTCTTTTG 

GGGTCTTTTG 

GGGTCTTTTG 

GGGTCTTTTG 

GGGTCTTTTG 
********** 



AAGAtTATGT 

AAGAtTATGT 

AAGAtTATGT 

AAGAtTATGT 

AAGAtTATGT 

AAGAtTATGT 

AAGAtTATGT 

AAGAtTATGT 

AAGAtTATGT 

AAGAaTATGT 

AAGAtTATGT 
****_***** 



TGCTGCAAAA 

TGCTGCAAAA 

TGCTGCAAAA 

TGCTGCAAAA 

TGCTGCAAAA 

TGCTGCAAAA 

TGCTGCAAAA 

TGCTGCAAAA 

TGCTGCAAAA 

TGCTGCAAAA 

TGCTGCAAAA 
********** 



TGGAATATCC 

TGGAATATCC 

TGGAATATCC 

TGGAATATCC 

TGGAATATCC 

TGGAATATCC 

TGGAATATCC 

TGGAATATCC 

TGGAATATCC 

TGGAATATCC 

TGGAATATCC 
********** 



650 

AAAATCAAAT 

AAAATCAAAT 

AAAATCAAAT 

AAAATCAAAT 

AAAATCAAAT 

AAAATCAAAT 

AAAATCAAAT 

AAAATCAAAT 

AAAATCAAAT 

AAAATCAAAT 

AAAATCAAAT 
********** 



msa30176.2{305_18RS2l} 
msa30176.2{305_2603} 
msa30176.2{305_A909) 
msa3 0176 . 2 { 3 05__H3 6B) 
msa30176.2{305_JM9130013} 
msa30176.2{305_COHl} 
msa30176.2{305_M78l) 
msa30176.2{305e_M732} 
msa30176.2{305_090~ 
msa30176.2{305_CJB110; 
msa30176 . 2 {305_1169NT] 

Consensus 



651 

GTCTT CAT CT 

GTCTT CAT CT 

GTCTTCATCT 

GTCTT CATCT 

GTCTTCATCT 

GTCTTCATCT 

GTCTTCATCT 

GTCTTCATCT 

GTCTTCATCT 

GTCTTCATCT 

GTCTTCATCT 
********** 



GATTTTTTGG 
GATTTTTTGG 

GATTTTTTGG 
GATTTTTTGG 
GATTTTTTGG 
GATTTTTTGG 
GATTTTTTGG 
GATTTTTTGG 
GATTTTTTGG 

********** 



TACTTAATTT 
TACTTAATTT 
TACTTAATTT 
TACTTAATTT 
TACTTAATTT 
TACTTAATTT 
TACTTAATTT 
TACTTAATTT 
TACTTAATTT 
TACTTAATTT 

TACTTAATTT 
********** 



TAATCAAGGT 

TAATCAAGGT 

TAATCAAGGT 

TAATCAAGGT 

TAATCAAGGT 

TAATCAAGGT 

TAATCAAGGT 

TAATCAAGGT 

TAATCAAGGT 

TAATCAAGGT 

TAATCAAGGT 
********** 



700 

ATTTCTAAAG 

ATTTCTAAAG 

ATTTCTAAAG 

ATTTCTAAAG 

ATTTCTAAAG 

ATTTCTAAAG 

ATTTCTAAAG 

ATTTCTAAAG 

ATTTCTAAAG 

ATTTCTAAAG 

ATTTCTAAAG 
********** 



msa30176.2{ 
msa30176. 
msa30176. 
msa30176. 
msa30176.2{305 
msa30176. 
msa30176. 
msa30176 .2 
msa30176 
msa30176.2{ 
msa30176 .2{ 



305_18RS2l} 
2{305_2603} 
2?305_A909} 
2{305_H36B} 
_JM9130013} 
2{305_COHl} 
2{305_M781^ 
{305e_M732' 
.2{305_090 
305_CJB110} 
305_1169NT} 
Consensus 



701 

AGTTAGCTAA 
AGTTAGCTAA 
AGTTAGCTAA 
AGTTAGCTAA 
AGTTAGCTAA 
AGTTAGCTAA 
AGTTAGCTAA 
AGTTAGCTAA 
AGTTAGCTAA 
AGTTAGCTAA 
AGTTAGCTAA 
********** 



AACTACTAAA 

AACTACTAAA 

AACTACTAAA 

AACTACTAAA 

AACTACTAAA 

AACTACTAAA 

AACTACTAAA 

AACTACTAAA 

AACTACTAAA 

AACTACTAAA 

AACTACTAAA 
********** 



GCAACAATCG 

GCAACAATCG 

GCAACAATCG 

GCAACAATCG 

GCAACAATCG 

GCAACAATCG 

GCAACAATCG 

GCAACAATCG 

GCAACAATCG 

GCAACAATCG 

GCAACAATCG 
********** 



TTCCTTTCTC 

TTCCTTTCTC 

TTCCTTTCTC 

TTCCTTTCTC 

TTCCTTTCTC 

TTCCTTTCTC 

TTCCTTTCTC 

TTCCTTTCTC 

TTCCTTTCTC 

TTCCTTTCTC 

TTCCTTTCTC 
********** 



750 

TACTACGGAA 

TACTACGGAA 

TACTACGGAA 

TACTACGGAA 

TACTACGGAA 

TACTACGGAA 

TACTACGGAA 

TACTACGGAA 

TACTACGGAA 

TACTACGGAA 

TACTACGGAA 
********** 



msa30176.2{305_18RS2l} 
msa30176.2{305_2603} 
msa30176.2{305_A909} 
msa30176.2{305_H36B} 
msa30176.2{305_JM9130013} 
msa3 0 17 6 . 2 { 3 05_COH1 } 
msa30176.2{305_M781) 
msa30176 . 2 { 305e_M732 * 
msa30176.2{305_090 
msa30176.2{305_CJB110 ( 
msa30176 . 2 {305_1169NT; 

Consensus 



msa30176.2{305_18RS2l} 
msa30176 . 2 {305__2603 } 
tnsa30176.2{305_A909} 
msa3 0176.2{305_H36B} 
tnsa30176 . 2 {305_OM9130013 } 
msa3 0176.2(30 5_C0H1 ' 
msa3 0 176 . 2 { 3 05__M7 8 1 
msa30176.2{305e_M732 
msa30176. 2 {305^090} 
rasa30176 .2 {305_CJB110 } 

rasa30176 . 2 {305_1169NT} 

Consensus 



751 

AAAGTTGATG 
AAAGTTGATG 
AAAGTTGATG 
AAAGTTGATG 
AAAGTTGATG 
AAAGTTGATG 
AAAGTTGATG 
AAAGTTGATG 
AAAGTTGATG 
AAAGTTGATG 
AAAGTTGATG 



GTGCTTACGT 
GTGCTTACGT 
GTGCTTACGT 
GTGCTTACGT 
GTGCTTACGT 
GTGCTTACGT 
GTGCTTACGT 
GTGCTTACGT 
GTGCTTACGT 
GTGCTTACGT 
GTGCTTACGT 



ACAAGACAAG 
ACAAGACAAG 
ACAAGACAAG 
ACAAGACAAG 
ACAAGACAAG 
ACAAGACAAG 
ACAAGACAAG 
ACAAGACAAG 
ACAAGACAAG 
ACAAGACAAG 
ACAAGACAAG 



********** ********** ********** 



801 

GAATATTATG 
GAATATTATG 
GAATATTATG 
GAATATTATG 
GAATATTATG 
GAATATTATG 
GAATATTATG 
GAATATTATG 
GAATATTATG 
GAATATTATG 

GAATATTATG 
********** 



TcAGTAGAtG 

TcAGTAGAtG 

TcAGTAGAtG 

TcAGTAGAtG 

TcAGTAGAtG 

TcAGTAGAtG 

TcAGTAGAtG 

TcAGTAGAtG 

TtAGTAGAtG 

T tAGTAGAt G 

TcAGTAGAcG 
*_******_* 



ACATTGGTGT 

ACATTGGTGT 

ACATTGGTGT 

ACATTGGTGT 

ACATTGGTGT 

ACATTGGTGT 

ACATTGGTGT 

ACATTGGTGT 

ACATTGGTGT 

ACATTGGTGT 

ACATTGGTGT 
********** 



CAACTTTTCT 

CAACTTTTCT 

CAACTTTTCT 

CAACTTTTCT 

CAACTTTTCT 

CAACTTTTCT 

CAACTTTTCT 

CAACTTTTCT 

CAACTTTTCT 

CAACTTTTCT 

CAACTTTTCT 
********** 



CCCAGGAAGC 
CCCAGGAAGC 
CCCAGGAAGC 
CCCAGGAAGC 
CCCAGGAAGC 
CCCAGGAAGC 
CCCAGGAAGC 
CCCAGGAAGC 
CCCAGGAAGC 
CCCAGGAAGC 

CCCAGGAAGC 
********** 



800 

ATAAAGGGGA 

ATAAAGGGGA 

ATAAAGGGGA 

ATAAAGGGGA 

ATAAAGGGGA 

ATAAAGGGGA 

ATAAAGGGGA 

ATAAAGGGGA 

ATAAAGGGGA 

ATAAAGGGGA 

ATAAAGGGGA 
********** 

850 

CATAACGTAg 

CATAACGTAg 

CATAACGTAn 

CATAACGTAg 

CATAACGTAg 

CATAACGTAg 

CATAACGTAg 

CATAACGTAg 

CATAACGTAg 

CATAACGTAg 

CATAACGTAg 
*********_ 



msa30176 . 2 {305_18RS21 ; 
msa30176.2{305_2603; 
msa30176.2{305_A909; 
ma a3 0 17 6 . 2 { 3 0 5_H3 6B 
msa3 0 17 6 . 2 { 3 05_JM913 0 0 13 
msa3 0 17 6 . 2 { 3 05_COH1 
msa30176.2{305_M781 



851 

AGAATGCTCT 
AGAATGCTCT 
AGAATGCTCT 
AGAATGCTCT 
AGAATGCTCT 
AGAATGCTCT 
AGAATGCTCT 



AG CAACTATT 
AG CAACT ATT 
AG CAACTATT 
AG CAACTATT 
AG CAACTATT 
AG CAACTATT 
AG CAACTATT 



GCGGTTGCTA 
GCGGTTGCTA 
GCGGTTGCTA 
GCGGTTGCTA 
GCGGTTGCTA 
GCGGTTGCTA 
GCGGTTGCTA 



AACTgGCTGG 
AACTgGCTGG 
AACTgGCTGG 
AACTgGCTGG 
AACTgGCTGG 
AACTaGCTGG 
AACTaGCTGG 



900 

TATCAGTAAT 
TATCAGTAAT 
TATCAGTAAT 
TATCAGTAAT 
TATCAGTAAT 
TATCAGTAAT 
TATCAGTAAT 



973 



WO 2004/018646 PCT/US2003/026827 

Table 67: Comparative Sequences relating to SAG0475 



msa30176.2{305e_M732} 
msa30176. 2 { 305_090} 
msa30176 . 2 { 305_CJB110 } 
msa30176 . 2 {305_1169NT} 

Consensus 



msa30176.2{305_18RS2l} 
msa30176. 2 {305^2603} 
msa30176.2{305_A909} 
msa30176.2{305_H36B} 
msa30X76 . 2 {305_JM9130013 ) 
msa3 0 176 . 2 { 3 05_COH1 } 
msa30176 . 2 {305_M78l} 
msa30176.2{305e_M732} 
msa3 0176.2{305_090 J 
msa30176.2{305_CJB110} 
msa30176.2{305_1169NT} 

Consensus 



msa3 017 6 . 2 { 3 05_18RS21 } 
msa30176.2{3 05_2603> 
msa30176.2{305_A909) 
msa30176 . 2 { 305_H36B} 
msa30176 . 2 {305_JM9130013 } 
msa30176.2(305_COHl} 
msa30176.2{305_M78l} 
msa30176 . 2 {305e_M732 } 
msa30176. 2 {305^090} 
msa30176.2{305_CJB110} 
msa30176.2{305_1169NT} 

Consensus 



msa30176.2{305_18RS21 
msa30176.2{305_2603 . 
msa30176 . 2 {305_A909 
. msa3 0176.2(30 5 JS3 6B 
msa30176.2{305_OM9130013 
ms a3 0 1 7 6 . 2 { 3 0 5_C0H1 
msa30176.2{305_M781 
msa30176 . 2 {305e_M732 
msa30l76 . 2 { 305_090 
rasa3 017 6 .2(30 5_CJB110 [ 
msa30176.2{305_1169NT' 

Consensus 



rasa30176 . 2{305_18RS21; 
msa30176 . 2 {305_2603 ; 
rasa3 0 176 .2(3 05_A909 
msa3 0176 .2(3 05_H36B 
msa30176 . 2 {305_JM9130013 
msa3 0 17 6 . 2 { 3 05_COH1 
msa30176.2(305_M781 
msa30176.2{305e_M732 
msa30176.2{305_090 
msa30176 . 2 { 3 05_CJB110 
msa30176.2{305_1169NT 

Consensus 



msa3 0 17 6 . 2 { 3 0 5_1 8RS2 1 
msa30176 . 2 {305__2603 
msa30176 . 2 {305_A909 ; 
msa30176 . 2 (305_H36B 
msa30176 . 2 {305_JM9130013 
msa30176 . 2 {305_COH1 ' 
msa30176.2{305_M78l" 
msa30176 . 2 { 305e_M73 2 
msa3 0176 . 2 { 305_090 
msa30176.2{305_CJB110; 
msa30176.2{305_1169NT: 

Consensus 



msa30176.2{305_18RS2l} 
msa30176.2f 305_2603} 
msa3 0 176 . 2 { 3 05__A9 09 } 
msa30176.2{305_H36B} 
msa30176.2{305_JM9130013) 
msa30176 . 2 {305_COHl} 



AGAATGCTCT 

AGAATGCTCT 

AGAATGCTCT 

AGAATGCTCT 
********** 



AGCAACTATT 
AGCAACTATT 
AGCAACTATT 
AGCAACTATT 



GCGGTTGCTA 

GCGGTTGCTA 

GCGGTTGCTA 

GCGGTTGCTA 
********** 



AACTaGCTGG 

AACTaGCTGG 

AACTaGCTGG 

AACTaGCTGG 
****_***** 



TATCAGTAAT 
TATCAGTAAT 
TATCAGTAAT 
TATCAGTAAT 



901 

CAAGTTATTA 

CAAGTTATTA 

CAAGTTATTA 

CAAGTTATTA 

CAAGTTATTA 

CAAGTTATTA 

CAAGTTATTA 

CAAGTTATTA 

CAAGTTATTA 

CAAGTTATTA 

CAAGTTATTA 
********** 



GAGAAACTTT 

GAGAAACTTT 

GAGAAACTTT 

GAGAAACTTT 

GAGAAACTTT 

GAGAAACTTT 

GAGAAACTTT 

GAGAAACTTT 

GAGAAACTTT 

GAGAAACTTT 

GAGAAACTTT 
********** 



AAGCAATTTT 

AAGCAATTTT 

AAGCAATTTT 

AAGCAATTTT 

AAGCAATTTT 

AAGCAATTTT 

AAGCAATTTT 

AAGCAATTTT 

AAGCAATTTT 

AAGCAATTTT 

AAGCAATTTT 
********** 



GGAGGTGTTA 

GGAGGTGTTA 

GGAGGTGTTA 

GGAGGTGTTA 

GGAGGTGTTA 

GGAGGTGTTA 

GGAGGTGTTA 

GGAGGTGTTA 

GGAGGTGTTA 

GGAGGTGTTA 

GGAGGTGTTA 
********** 



950' 

AACACCGCTT 

AACACCGCTT 

AACACCGCTT 

AACACCGCTT 

AACACCGCTT 

AACACCGCTT 

AACACCGCTT 

AACACCGCTT 

AACACCGCTT 

AACACCGCTT 

AACACCGCTT 
********** 



951 

GCAATCACTC 
GCAATCACTC 
GCAATCACTC 
GCAATCACTC 
GCAATCACTC 
GCAATCACTC 
GCAATCACTC 
GCAATCACTC 
GCAATCACTC 
GCAATCACTC 

GCAATCACTC 
********** 



GGTAAGGTTC 

GGTAAGGTTC 

GGTAAGGTTC 

GGTAAGGTTC 

GGTAAGGTTC 

GGTAAGGTTC 

GGTAAGGTTC 

GGTAAGGTTC 

GGTAAGGTTC 

GGTAAGGTTC 

GGTAAGGTTC 
********** 



ATGGTATTAG 
ATGGTATTAG 
ATGGTATTAG 
ATGGTATTAG 
ATGGTATTAG 
ATGGTATTAG 
ATGGTATTAG 
ATGGTATTAG 
ATGGTATTAG 
ATGGTATTAG 

ATGGTATTAG 
********** 



TTTCTATAAc 

TTTCTATAAc 

TTTCTATAAc 

TTTCTATAAc 

TTTCTATAAc 

TTTCTATAAc 

TTTCTATAAc 

TTTCTATAAC 

TTTCTATAAc 

TTTCTATAAt 

TTTCTATAAc 
*********_ 



1000 
GACAGcAAGt 
GACAGcAAGt 
GACAGcAAGt 
GACAGcAAG - 
GACAGcAAGt 
GACAGcAAGt 
GACAGcAAGt 
GACAGcAAGt 
GACAGcAAGt 
GACAGcAAGt 

GACAGtAAGt 
*****_***_ 



1001 1050 

caactaatat attggcaact caaaaagcat tatctggctt tgataatact 

caactaatat attggcaact caaaaagcat tatctggctt tgataatact 

caactaatat attggcaact caaaaagcat tatctggctt tgataatact 



caactaatat attggcaact caaaaagcat tatctggctt tgataatact 

caactaatat attggcaact caaaaagcat tatctggctt tgataatact 

caactaatat attggcaact caaaaagcat tatctggctt tgataatact 

caactaatat attggcaact caaaaagcat tatctggctt tgataatact 

caactaatat attggcaact caaaaagcat tatctggctt tgataatact 

caactaatat attggcaact caaaaagcat tatctggctt tgataatact 

caactaatat attggcaact caaaaagcat tatctggctt tgataatact 



1051 1100 

aaagttatcc taattgcagg aggtcttgat cgcggtaatg agtttgatga 

aaagttatcc taattgcagg aggtcttgat cgcggtaatg agtttgatga 

aaagttatcc taattgcagg aggtcttgat cgcggtaatg agtttgatga 



aaagttatcc taattgcagg aggtcttgat cgcagtaatg agtttgatga 

aaagttatcc taattgcagg aggtcttgat cgcggtaatg agtttgatga 

aaagttatcc taattgcagg aggtcttgat cgcggtaatg agtttgatga 

aaagttatcc taattgcagg aggtcttgat cgcggtaatg agtttgatga 

aaagttatcc taattgcagg aggtcttgat cgcggtaatg agtttgatga 

aaagttatcc taattgcagg aggtcttgat cgcggtaatg agtttgatga 

aaagttatcc taattgcagg aggtcttgat cgcggtaatg agtttgatga 



1101 1150 

attgatacca gatatcactg gacttaaaca tatggttgtt ttaggggaat 

attgatacca gatatcactg gacttaaaca tatggttgtt ttaggggaat 

attgatacca gatatcactg gacttaaaca tatggttgtt ttaggggaat 

attgatacca gatatcactg gacttaaaca tatggttgtt ttaggggaat 
attgatacca gatatcactg gacttaaaca tatggttgtt ttaggggaat 
attgatacca gatatcactg gacttaaaca tatggttgtt ttaggggaat 
attgatacca gatatcactg gacttaaaca tatggttgtt ttaggggaat 
attgatacca gatatcactg gacttaaaca tatggttgtt ttaggggaat 
attgatacca gatatcactg gacttaaaca tatggttgtt ttaggggaat 
attgatacca gatatcactg gacttaagca tatggttgtt ttaggggaat 



1151 1200 

cggcatctcg agtaaaacgt gctgcacaaa aagcaggagt aacttatagc 

cggcatctcg agtaaaacgt gctgcacaaa aagcaggagt aacttatagc 

cggcatctcg agtaaaacgt gctgcacaaa aagcaggagt aacttatagc 



cggcatctcg agtaaaacgt gctgcacaaa aagcaggagt aacttatagc 
cggcatctcg agtaaaacgt gctgcacaaa aagcaggagt aacttatagc 
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msa30176.2{305_M78l) 
msa30176.2{305e_M732) 
msa30176. 2 { 305^090} 
msa3 0 1 7 6 . 2 { 3 0 5_C JB1 10 } 
msa30176.2{305_1169NT} 

Consensus 



cggcatctcg 
cggcatctcg 
cggcatctcg 
cggcatctcg 
cggcatctcg 



agtaaaacgt 
agtaaaacgt 
agtaaaacgt 
agtaaaacgt 
agtaaaacgt 



gctgcacaaa 
gctgcacaaa 
gctgcacaaa 
gctgcacaaa 
gctgcacaaa 



aagcaggagt 
aagcaggagt 
aagcaggagt 
aagcaggagt 
aagcaggagt 



aacttatagc 
aacttatagc 
aacttatagc 
aacttatagc 
aacttatagc 



msa30176.2{ 
msa30176 . 
msa30176 . 
msa30176 . 
msa3017 6.2{305 
msa30176 . 
msa30176 . 
msa30176 .2 
msa30176 
msa30176.2{ 
msa30176.2{ 



3 05_18RS2l} 
2{305_2603) 
2{305_A909} 
2{305_H36B} 
_JM9130013} 
2{305_COHl} 
2{305_M78l) 
{305e_M732) 
.2{305_090} 
305_CJB110} 
3 05_1169NT} 
Consensus 



msa30176.2{305_18RS2l} 
msa30176.2{305_2603} 
msa30176 . 2 f305_A909> 
msa30176 . 2 {305_H36B} 
msa30176.2{305_JM9130013j 
rasa3 0 17 6 . 2 { 3 0 5_COHl } 
msa30176.2{305_M78l} 
rasa30176.2{305e_M732}. 
msa30176.2{305_090> 
msa30176.2{305_CJB110) 
msa30176 . 2 { 305_1169NT} 

Consensus 



msa30176 . 2 {305_18RS2l} 
msa30176.2{305_2603} 
msa30176.2{305_A909} 
msa30176.2{305_H36B} 
insa30176 . 2 {305_JM9130013 } 
msa3 0 176 . 2 f 3 0 5_COHl \ 
msa30176.2{305_M78l} 
msa30176 . 2 {305e_M732 } 
msa30176.2{305_090} 
msa3 0 17 6 . 2 { 3 05_CJB1 1 0 } 
msa30176.2{305_1169NT} 

Consensus 



1201 1250 
gatgctttag atgttagaga tgcggtacat aaagcttatg aggtggcaca 
gatgctttag atgttagaga tgcggtacat aaagcttatg aggtggcaca 
gatgctttag atgttagaga tgcggtacat aaagcttatg aggtggcaca 



gatgctttag 
gatgctttag 
gatgctttag 
gatgctttag 
gatgctttag 
gatgctttag 
aatgctttag 



atgttagaga 
atgttagaga 
atgttagaga 
atgttagaga 
atgttagaga 
atgttagaga 
atgttagaga 



tgcggtacat 
tgcggtacat 
tgcggtacat 
tgcggtacat 
tgcggtacat 
tgcggtacat 
tgcggtacat 



aaagcttatg 
aaagcttatg 
aaagcttatg 
aaagcttatg 
aaagcttatg 
aaagcttatg 
aaagcttatg 



aggtggcaca 
aggtggcaca 
aggtggcaca 
aggtggcaca 
aggtggcaca 
aggtggcaca 
aggtggcaca 



1251 1300 
acagggcgat gttatcttgc taagtcctgc aaatgcatca tgggacatgt 
acagggcgat gttatcttgc taagtcctgc aaatgcatca tgggacatgt 
acagggcgat gttatcttgc taagtcctgc aaatgcatca tgggacatgt 



acagggcgat 
acagggcgat 
acagggcgat 
acagggcgat 
acagggcgat 
acagggcgat 
acagggcgat 



gttatcttgc 
gttatcttgc 
gttatcttgc 
gttatcttgc 
gttatcttgc 
gttatcttgc 
gttatcttgt 



taagtcctgc 
taagtcctgc 
taagtcctgc 
taagtcctgc 
taagtcctgc 
taagtcctgc 
tmagtcctgc 



aaatgcatca 
aaatgcatca 
aaatgcatca 
aaatgcatca 
aaatgcatca 
aaatgcatca 
gaatgcatca 



tgggacatgt 

tgggacatgt 
tgggacatgt 
tgggacatgt 
tgggacatgt 
tgggacatgt 
tgggacatgt 



1301 1350 
ataagaattt cgaagtccgt ggtgatgaat tcattgatac tttcgaaagt 
ataagaattt cgaagtccgt ggtgatgaat tcattgatac tttcgaaagt 
ataagaattt cgaagtccgt ggtgatgaat tcattgatac tttcgaaagt 



ataagaattt 
ataagaattt 
ataagaattt 
ataagaattt 
ataagaattt 
ataagaattt 
ataagaattt 



cgaagtccgt 
cgaagtccgt 
cgaagtccgt 
cgaagtccgt 
cgaagtccgt 
cgaagtccgt 
cgaagtccgt 



ggtgatgaat 
ggtgatgaat 
ggtgatgaat 
ggtgatgaat 
ggtgatgaat 
ggtgatgaat 
ggtgatgaat 



tcattgatac 
tcattgatac 
tcattgatac 
tcattgatac 
tcattgatac 
tcattgatac 
tcattgatac 



tttcgaaagt 
tttcgaaa — 
tttcgaaagt 
tttcgaaagt 
tttcgaaagt 
tttcgaaagt 
tttcg 



msa30176.2{ 
msa30176 . 
msa30176 . 
msa30176 . 
msa30176.2{305 
msa30176 . 
msa30176 . 
msa30176.2 
msa30176 
tnsa30176.2{ 
msa30176.2{ 



1351 1362 

305_18RS21} cttagaggag ag 

2{305_2603} cttagaggag ag 

2{305_A909} cttagaggag ag 

2{305_H36B) — 

_JM9130013} cttagaggag ag 

2{305_COH1} 

2{305_M78l} cttagaggag ag 

{305e_M732> cttagaggag ag 

.2{305__090} cttagaggag ag 

305_CJB110} cttagaggag ag 

305_1169NT} — 

Consensus 



SEQ ID NO. 6711 

STRAIN 090 frame: 3 

ITTFENKK\njVLGLARSGEAAARLIiAK 

HPLELLDEDFCYMI KNPGI PYNNPMVKKALEKQI PVLTEVELAYLVSESQLI GI TGSNGK 
TTTTTM I AE VLNAGGQRGLLAGN I GFPASEWQAADDKD I LVMELSS FQLMGVKE FRPH I 
AVI TNLM PTHLDYHGSFEDYVAAKWNI QNQMS S SDFLVLNFNQG I SKELAKTTKATI VP F 
STTEKVDGAYVQDKQLFYKGENI MLVDD I GVPGSHNVENALAT I AVAKLAG I SNQVI RET 
LSNFGGVKHRLQSLGKVHGISFYNDSKSTNIIATQKALSGFDNTKVILIAGGLDRGNEFD 
ELIPDITGLKHMVVIX3ESASRVKRAAQKAGVTYSDALDVRDAVHKAYEVA 
ANASWDMYKNFEVRGDEFI DTFESLRGE 



SEQ ZD NO. 6712 

STRAIN A909 frame: 3 

ITTFENKiCVLVLGLARSGEAAARLLAKLGAI 

HPLELLDEDFCYMI KNPGI PYNNPMVKKALEKQI PVLTEVELAYLVSESQLI GITGSNGK 
TTTTTMI AEVLNAGGQRGLLAGNI GFPASEWQAANDKDTLVMELSS FQLMGVKE FRPH I 
AVITNLMPTHIjDYHGSFEDYVAAKWNIQNQMSSSDFLVLNFNQGISKELAK^ 
STTEKVDGAYVQDKQLF YKGEN I MS VDD I GVPGSHNVXNALAT I AVAKLAG I SNQVI RET 
LSNFGGVKHRLQSLGKVHGI SFYNDSKSTNI LATQKALSGFDNTKVI LI AGGLDRGNEFD 
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ELI PD ITGLKHMVVLGESASRVKRAAQKAGVT YSDALDVRDAVHKAYEVAQQGDVI LLS P 
ANASWDMYKNFEVRGDEFI DTFESLRGE 

SEQ ID NO. 6713 

STRAIN H36B frame: 1 
grvmktittfenkkvlvix;larsge 

kwcgshplelldedfcymi knpgi pynnpmvkkalekqi pvltevelaylvsesqligi 
tgsngktttttm i aevlnaggqrgllagn i gf pase wqaandkdtlvmels s fqlmgvk 
efrphi avitnlmpthldyhgsfedyvaakwni qnqmsssdflvlnfnqgi skelakttk 
at i vpfsttekvdgayvqdkqlfykgeni msvdd i gvpgshnvenalat i avaklag i sn 
qv i retl snfggvkhrlqs lgkvhg i s fyndsk 

SEQ ID NO. 6714 
STRAIN 18RS21 frame: 1 

GRVMKTITTFENKK^/LVLGIjARSGEAAARLIA 

KWCGSHPLELLDEDFCYMI KNPGI PYNNPMVKKALEKQIPVLTEVE LA YLVSESQLIGI 

TGSNGKTTTTTMIAEVLNAGGQRGLLAGNIGFPASEWQAANDKDTLVMELSSF 

EFRPHIAVITNLMPTHLDYHGSFEDWAAKWNIQNQMSSSDFLVLNF^QGISK^ 

AT I VPFSTTE KVDGAYVQDKQLFYKGENI MS VDD I GVPGSHNVENALAT I AVAKLAG I SN 

QVIRETLSNFGGVKHRLQSLGKWGISFYNDSKSTNILATQKALSGFDNTKVILIAGGLD 

RGNE FDEL I PD I TGLKHMVVLGESASRVKRAAQKAGVTYSDALDVRDAVHKAYE VAQQGD 

VI LLS PANAS WDMYKNFEVRGDE F I DTFESLRGE 

SEQ ID NO. 6715 
STRAIN M732 frame: 1 

GRVMKT I TT FENKKVLVLGLARSGEAAARLLAKLGAI VTVNDGKP FDF^PTAQSLLEEG I 
KWCGSHPLELLDEDFCYMI KNPGI PYNNPMVKKALEKQI PVLTEVELAYLVSESQLIG I 
TGSNGKTTTTTM I AEVLNAGGQRGLLAGN I GFPASEWQAADDKD I LVMELSS FQLMGVK 
EFRPHI AVI TNLMPTHLDYHGS FEDYVAAKWN I QNQMSS SDFLVLNFNQGI SKELAKTTK 
AT I VPFSTTEKVDGAYVQDKQLFYKGENI MS VDD I GVP<3SHNVENALAT I AVAKLAG I SN 
QVI RETLSNFGGVKHRLQSLGKVHG I S FYNDS KSTNI LATQKALSGFDNTKVIL I AGGLD 
RGNE FDEL I PD I TGL KHMVVLGE S AS RVKRAAQKAGVTY S DALD VRDAVH KA YE VAQQGD 
V I LL S PANAS WDMY KNFEVRGDE F I DT FE S LRGE 

SEQ ID NO. 6716 
STRAIN COH1 frame: 1 

GRVMKTITTFENKKVLVLGLARSGEAAARLLAKLG 

KWCGSHPLELLDEDFCYMI KNPGI PYNNPMVKKALEKQI PVLTEVELAYLVSESQLIG I 
TGSNGKTTTTTMIAEVLNAGGQRGLLAGNI GFPASEWQAADDKD I LVMELSS FQLMGVK 
EFRPH I AVI TNLMPTHLDYHGS FEDYVAAKWN I QNQMS S SDFLVLNFNQGI SKELAKTTK 
AT I VPFSTTEKVDGAYVQDKQLFYKGENIMS VDD I GVPGSHNVENALAT I AVAKLAG I SN 
QVI RETLSNFGGVKHRLQSLGKVHG I SFYNDSKSTNIIATQKALSGFDNTKVI LI AGGLD 
RGNEFDELIPDITGLKHMWLGESASRVKRAAQKAGVTYSDALDVRDAVHKAYEVAQCGD 
VI LLS PANASWDMYKNFEVRGDEF I DTFE 

SEQ ID NO. 6717 

STRAIN M781 frame: 1 

GRVMKTITTFENKKVLVLGLARSGEAAARLIAXLGAIW 

KWCGSHPLELLDEDFCYMI KNPGI PYNNPMVKKALEKQI PVLTEVELAYLVSESQLIG I 
TGSNGKTTTTTM I AEVLNAGGQRGLLAGNI GF PASEWQAADDKD I LVMELS S FQLMGVK 
EFRPH I AVI TNLMPTHLDYHGS FEDYVAAKWN I QNQMSS SDFLVLNFNQGI SKELAKTTK 
ATI VPFSTTEKVDGAYVQDKQLFYKGENI MS VDD I GVPGSHNVENALAT I AVAKLAG I SN 
QVI RETLSNFGGVKHRLQSLGKVHGI SFYNDSKSTNI LATQKALSGFDNTKVIL I AGGLD 
RGNE FDEL I PD I TGLKHMVVLGE S AS R VKRAAQKAG VTYSDALD VRD AVHKAYEVAQ QGD 
VI LLS PANASWDMYKNFEVRGDEF I DTFESLRGE 

SEQ ID NO. 6718 

STRAIN CJB1 10 frame: 1 

GRVMKTITTFENKKVLVIjGLARSGEAAARLIAKLGAIVTVNDGKPFDENP^ 
KWCGSHPLELLDEDFC YM I KNPG I PYNNPMVKKALE KQ I PVLTE VELAYLVSE S QL I G I 
TGSNGKTTTTTMIAEVLNAGGQRGLl^GNIGFPA 

EFRPH IAVI TNLMPTHLDYHGS FEEYVAAKWNI QNQMSS SDFLVIjNFNQG I SKELAKT^ 
AT I VP FSTTEKVDGAYVQDKQL FYKGENIMLVDD I GVPGSHNVENALAT I AVAKLAG I SN 
QVIRETLSNFGGVKHRLQSLGKVHG I SFYNDSKSTNI LATQKALSGFDNTKVILI AGGLD 
RGNEFDELIPDITCLKHMVVLGESASRVKRAAQKAGVTYSDALDVRDAVHKA 
VI LLS PANASWDMYKNFEVRGDEF I DTFESLRGE 

SEQ ID NO. 6719 
STRAIN 1169NT frame: 3 

ITTFENKKVLVLGLARSGEAAARLLAKLGAIVTVNDGKP FDENPTAQSLLEEG I KWCGS 
HPLELLDEDFCYMI KNPGI PYNNPMVKKALEKQI PVLTEVELAYLVS ESQL I GITGSNGK 
TTTTTM I AEVLNAGGQRGLLAGN IGF PAS E WQAADDKDT LVME L S S FQLMGVKE FRPH I 
AVITNLMPTHLDYHGSFEDYVAAKWNI QNQMSSSDFLVLNFNQGI SKELAKTTKATIVPF 
STTE KVDGAYVQDKQLFYKGENI MS VDD I GVPGSHNVENALAT I AVAKLAG I SNQVI RET 
LSNFGGVKHRLQSLGKVHG I SFYNDSKSTN I LATQKALSGFDNTKVI L I AGGLDRGNEFD 
EL I PD I TGL KHMVVLGE S AS RVKRAAQKAGVTY S NALD VRDAVH KAYEVAQOGD V I LXS P 
ANASWDMYKNFEVRGDEFIDTF 



976 



WO 2004/018646 



PCT/US2003/026827 



Table 67: Comparative Sequences relating to SAG0475 



SEQ ID NO. 6720 

STRAIN JM9130013 frame: 1 
GRVMKTITTFENKKVLVLX3LARSGEAA^ 

KWCGSHPLELLDEDFC YM I KNPG I P YNNPMVKKALEKQI PVLTE VELAYLVSESQL IG I 
TGSNGKTTTTTM I AE VLNAGGQRGLLAGN I GFPASEWQAANDKDTLVMELS S FQLMGVK 
EFRPH I AVI TNLMPTHLDYHGS FED YVAAKWN I QNQMS S SDFLVLNFNQG I S KELAKTTK 
AT I VPFSTTEKVDGAYVQDKQLFYKGENIMSVDD I GVPGSHNVENALAT I AVAKLAG I SN 
QVI RETLSNFGGVKHRLQSLGKVHGI SFYNDSKSTNIiLATQKALSGFDNTKVILI AGGLD 
RSNEFDELI PDITGLKHMVW^ESASRVKRAAQKAGVTYSDALDVRDAVHKAYEVAQQGD 
VI LLS PANAS WDMYKNFEVRGDEF I DTFE S LRGE 

SEQ ID NO. 6721 

STRAIN 2603 frame: 1 

GRVMKT I TT FENKKVLVLGI^ARSGEAAARLLAKLGAI VTVNDGKP FDENPTAQS LLE EG I 
KWCGSHPLELLDEDFCYMI KNPGI PYNNPMVKKALEKQI PVLTEVELAYLVSESQLIGI 
TGSNGKTTTTTM I AE VLNAGGQRGLLAGN I GFPASEWQAANDKDTLVMELS S FQLMGVK 
EFRPHIAVITNLMPTHLDYHGSFEDYVAAKMTIQNQMSSSDFLVLNFNQGISKELAKTTK 
AT I VP FSTTEKVDGAYVQD KQLFYKGEN I MS VDD I GVPGSHNVENALAT I AVAKLAG I SN 
QVI RETLSNFGG VKHRLQS LGKVHG I SFYNDSKSTNI LATQKALSGFDNTKVI LIAGGLD 
RGNEFDELI PDI TGLKHMVVLGESASRVKRAAQKAGVTYSDALDVRDAVHKAYEVAQC^D 
VI LL S PANAS WDMYKNFEVRGDEF I DTFE SLRGE 



MSA Alignment Results : Pretty output 

PRETTY of: /biotmp/msa25243 . 2 { * } April 29, 2002 

1 

msa25243.2{305_18RS2l} grvmktlTTF ENKKVDVLGL 

msa25243 .2{305_2603} grvmktlTTF ENKKVLVLGL 

msa25243.2{305_JM9130013 j grvmktlTTF ENKKVLVLGL 

msa25243.2{30 5__COHl} grvmktlTTF ENKKVLVLGL 

msa25243.2{305_M732} grvmktlTTF ENKKVLVLGL 

msa25243 .2{3 05_M78l} grvmktlTTF ENKKVLVLGL 

msa25243.2{305_1169NT} ITTF ENKKVLVLGL 

msa25243.2{305„A909} ITTF ENKKVLVLGL 

msa25243.2{305_CJB110} grvmktlTTF ENKKVLVLGL 

msa25243.2{305_090} ITTF ENKKVLVLGL 

msa25243 .2{30S_H36B} grvmktlTTF ENKKVLVLGL 

Consensus **** ********** 



02 :20 



ARSGEAAARL 

ARSGEAAARL 

ARSGEAAARL 

ARSGEAAARL 

ARSGEAAARL 

ARSGEAAARL 

ARSGEAAARL 

ARSGEAAARL 

ARSGEAAARL 

ARSGEAAARL 

ARSGEAAARL 
********** 



LAKLGAI VTV 
LAKLGAIVTV 
LAKLGAI VTV 
LAKLGAIVTV 
LAKLGAIVTV 
LAKLGAIVTV 
LAKLGAIVTV 
LAKLGAIVTV 
LAKLGAIVTV 
LAKLGAIVTV 
LAKLGAIVTV 
********** 



50 

NDGKPFDENP 

NDGKPFDENP 

NDGKPFDENP 

NDGKPFDENP 

NDGKPFDENP 

NDGKPFDENP 

NDGKPFDENP 

NDGKPFDENP 

NDGKPFDENP 

NDGKPFDENP 

NDGKPFDENP 
********** 



msa25243 .2{ 

msa25243 . 
msa25243.2{305 

msa25243 . 

msa25243 . 

msa25243 . 
msa25243.2{ 

msa25243 . 
msa25243 .2{ 
msa25243 

msa25243 . 



305_18RS2l} 
2{305_2603} 
_JM9130013} 
2(305_COH1} 
2{305_M732} 
2{305_M781} 
305_1169NT} 
2{305_A909} 
305__CJB110} 
.2{305_090} 
2{305_H36B} 
Consensus 



51 

TAQSLLEEGI 
TAQSLLEEG I 
TAQSLLEEGI 
TAQSLLEEGI 
TAQSLLEEGI 
TAQSLLEEGI 
TAQSLLEEGI 
TAQSLLEEGI 
TAQSLLEEGI 
TAQSLLEEGI 
TAQSLLEEGI 



KWCGSH PLE 
KWCGSHPLE 
KWCGSHPLE 
KWCGSHPLE 
KWCGSHPLE 
KWCGSHPLE 
KWCGSHPLE 
KWCGSHPLE 
KWCGSHPLE 
KWCGSHPLE 
KWCGSHPLE 



********** ********** 



LLDEDFCYMI 

LLDEDFCYMI 

LLDEDFCYMI 

LLDEDFCYMI 

LLDEDFCYMI 

LLDEDFCYMI 

LLDEDFCYMI 

LLDEDFCYMI 

LLDEDFCYMI 

LLDEDFCYMI 

LLDEDFCYMI 
********** 



KNPGI PYNNP 
KNPGI PYNNP 
KNPGI PYNNP 
KNPG I PYNNP 
KNPG IP YNNP 
KNPG I PYNNP 
KNPG I PYNNP 
KNPGI PYNNP 
KNPG I PYNNP 
KNPGI PYNNP 
KNPGI PYNNP 



100 

MVKKALEKQI 
MVKKALEKQI 
MVKKALEKQI 
MVKKALEKQI 
MVKKALEKQI 
MVKKALEKQI 
MVKKALEKQI 
MVKKALEKQI 
MVKKALEKQI 
MVKKALEKQI 
MVKKALEKQI 



********** ********** 



msa25243 .2{305_18RS21 
msa25243.2{3 05_2603 
msa25243.2{305_JM9130013 
msa2 5243.2(30 5_COHl 
msa25243 . 2 {305_M732 
msa25243 .2{305_M781 
msa25243 .2{305__1169NT 
msa25243.2{305_A909 
msa2 52 4 3 . 2 { 3 0 5_C JB1 10 
msa25243 . 2 { 305_090 
msa25243.2{305_H36B; 

Consensus 



101 

PVLTEVELAY 
PVLTEVELAY 
PVLTEVELAY 
PVLTEVELAY 
PVLTEVELAY 
PVLTEVELAY 
PVLTEVELAY 
PVLTEVELAY 
PVLTEVELAY 
PVLTEVELAY 
PVLTEVELAY 



LVSESQLIGI 
LVSESQLIGI 
LVSESQLIGI 
LVSESQLIGI 
LVSESQLIGI 
LVSESQLIGI 
LVSESQLIGI 
LVSESQLIGI 
LVSESQLIGI 
LVSESQLIGI 
LVSESQLIGI 



********** ********** 



TGSNGKTTTT 

TGSNGKTTTT 

TGSNGKTTTT 

TGSNGKTTTT 

TGSNGKTTTT 

TGSNGKTTTT 

TGSNGKTTTT 

TGSNGKTTTT 

TGSNGKTTTT 

TGSNGKTTTT 

TGSNGKTTTT 
********** 



TMIAEVLNAG 

TMIAEVLNAG 

TMIAEVLNAG 

TMIAEVLNAG 

TMIAEVLNAG 

TMIAEVLNAG 

TMIAEVLNAG 

TMIAEVLNAG 

TMIAEVLNAG 

TMIAEVLNAG 

TMIAEVLNAG 
********** 



150 

GQRGLLAGNI 

GQRGLLAGNI 

GQRGLLAGNI 

GQRGLLAGNI 

GQRGLLAGNI 

GQRGLLAGNI 

GQRGLLAGNI 

GQRGLLAGNI 

GQRGLLAGNI 

GQRGLLAGNI 

GQRGLLAGNI 
********** 



msa25243.2{ 

msa25243 . 
msa25243.2{305 

msa25243 . 

msa25243 . 

msa25243 . 
msa25243 .2{ 

msa25243 . 
msa25243 .2{ 
msa25243 

msa25243 . 



305__18RS2i; 
2{305_2603; 
_JM9130013; 
2(305_COH1 
2{305_M732 
2{305_M781 
305_1169NT) 
2{305_A909} 
305_CJB110} 
.2{305_090} 
2{305_H36B} 
Consensus 



151 

GFPASEWQA 

GFPASEWQA 

GFPASEWQA 

GFPASEWQA 

GFPASEWQA 

GFPASEWQA 

GFPASEWQA 

GFPASEWQA 

GFPASEWQA 

GFPASEWQA 

GFPASEWQA 
********** 



AnDKDtLVME 

AnDKDtLVME 

AnDKDtLVME 

AdDKDi LVME 

AdDKDiLVME 

AdDKDiLVME 

AdDKDtLVME 

AnDKDtLVME 

AdDKDiLVME 

AdDKDiLVME 

AnDKDtLVME 
*-***_**** 



LSS FQLMGVK 

LSS FQLMGVK 

LSS FQLMGVK 

LSS FQLMGVK 

LSSFQLMGVK 

LSS FQLMGVK 

LSSFQLMGVK 

LSSFQLMGVK 

LSSFQLMGVK 

LSSFQLMGVK 

LSSFQLMGVK 
********** 



EFRPHIAVIT 
EFRPHIAVIT 
EFRPHIAVIT 
EFRPHIAVIT 
EFRPHIAVIT 
EFRPHIAVIT 
EFRPHIAVIT 
EFRPHIAVIT 
EFRPHIAVIT 
EFRPHIAVIT 

EFRPHIAVIT 
********** 



200 

NLMPTHLDYH 

NLMPTHLDYH 

NLMPTHLDYH 

NLMPTHLDYH 

NLMPTHLDYH 

NLMPTHLDYH 

NLMPTHLDYH 

NLMPTHLDYH 

NLMPTHLDYH 

NLMPTHLDYH 

NLMPTHLDYH 
********** 
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Table 67: Comparative Sequences relating to SAG0475 



msa25243 ,2{ 

msa25243 . 
msa25243.2{305 

msa25243 . 

msa25243 . 

msa25243 . 
msa25243 .2{ 

msa25243 . 
msa25243 .2{ 
msa25243 

msa25243 . 



305_18RS21} 
2{305_2603} 
_JM9130013} 
2{305_COH1} 
2{305_M732} 
2{305_M781} 
305_1169NT} 
2{305_A909 
305_CJB110 
.2{305_090 
2{305_H36B} 
Consensus 



201 

GSFEdYVAAK 
GSFEdYVAAK 
GSFEdYVAAK 
GSFEdYVAAK 
GSFEdYVAAK 
GSFEdYVAAK 
GSFEdYVAAK 
GSFEdYVAAK 
GSFEeYVAAK 
GSFEdYVAAK 
GSFEdYVAAK 



WNIQNQMSSS 
WNIQNQMSSS 
WNIQNQMSSS 
WNIQNQMSSS 
WNIQNQMSSS 
WNIQNQMSSS 
WNIQNQMSSS 
WNIQNQMSSS 
WNIQNQMSSS 
WNIQNQMSSS 
WNIQNQMSSS 



DFLVLNFNQG 
DFLVLNFNQG 
DFLVLNFNQG 
DFLVLNFNQG 
DFLVLNFNQG 
DFLVLNFNQG 
DFLVLNFNQG 
DFLVLNFNQG 
DFLVLNFNQG 
DFLVLNFNQG 
DFLVLNFNQG 



****_***** ********** ********** 



ISKELAKTTK 

ISKELAKTTK 

ISKELAKTTK 

ISKELAKTTK 

ISKELAKTTK 

ISKELAKTTK 

ISKELAKTTK 

ISKELAKTTK 

ISKELAKTTK 

ISKELAKTTK 

ISKELAKTTK 
********** 



250 

ATIVPFSTTE 

ATIVPFSTTE 

ATIVPFSTTE 

ATIVPFSTTE 

ATIVPFSTTE 

ATIVPFSTTE 

ATIVPFSTTE 

ATIVPFSTTE 

ATIVPFSTTE 

ATIVPFSTTE 

ATIVPFSTTE 
********** 



msa25243 .2{ 

msa25243 . 
msa25243.2{305 

msa25243 . 

msa25243 . 

msa25243 . 
msa25243 . 2{ 

rasa25243 . 
msa2 5243 .2{ 
msa25243 

msa25243 . 



305_18RS21 
2{305_2603 
_JM913 0013 
2{3 05_COHi; 
2f305_M732} 
2{305_M781) 
305_1169NT} 
2{305_A909} 
305_CJB110} 
.2{305_090} 
2{305_H36B} 
Consensus 



251 

KVDGAYVQDK 
KVDGAYVQDK 
KVDGAYVQDK 
KVDGAYVQDK 
KVDGAYVQDK 
KVDGAYVQDK 
KVDGAYVQDK 
KVDGAYVQDK 
KVDGAYVQDK 
KVDGAYVQDK 
KVDGAYVQDK 



QLFYKGENIM 
QLFYKGENIM 
QLFYKGENIM 
QLFYKGENIM 
QLFYKGENIM 
QLFYKGENIM 
QLFYKGENIM 
QLFYKGENIM 
QLFYKGENIM 
QLFYKGENIM 
QLFYKGENIM 



********** ********** 



sVDDIGVPGS 

SVDDIGVPGS 

SVDDIGVPGS 

sVDDIGVPGS 

sVDDIGVPGS 

SVDDIGVPGS 

SVDDIGVPGS 

SVDDIGVPGS 

1VDDIGVPGS 

1VDDIGVPGS 

sVDDIGVPGS 
_ ********* 



HNVeNALAT I 

HNVeNALAT I 

HNVeNALAT I 

HNVeNALATI 

HNVeNALAT I 

HNVeNALATI 

HNVeNALAT I 

HNVxNALATI 

HNVeNALATI 

HNVeNALATI 

HNVeNALATI 
***_****** 



300 

AVAKLAGISN 

AVAKLAGISN 

AVAKLAGISN 

AVAKLAGISN 

AVAKLAGISN 

AVAKLAGISN 

AVAKLAGISN 

AVAKLAGISN 

AVAKLAGISN 

AVAKLAGISN 

AVAKLAGISN 
********** 



msa25243 .2{3 
msa25243 .2 
msa25243 . 2 { 3 05_« 
msa25243 . 2 
msa25243 . 2 
msa25243 .2 

msa25243 . 2{3 
msa25243 .2 

msa25243 .2 {3 
msa25243 . 
msa25243 -2 



05_18RS2l} 
{305_2603; 
JM9130013 
{3 05_COH1 
{305_M732; 
{305_M78i; 
05_1169NT] 
{3 05_A909] 
05_CJB110 ! 
2{305_090 
{3 05_H36B; 
Consensus 



301 

QVI RETLSNF 
QVIRETLSNF 
QVI RETLSNF 
QVIRETLSNF 
QVIRETLSNF 
QVIRETLSNF 
QVIRETLSNF 
QVIRETLSNF 
QVIRETLSNF 
QVIRETLSNF 
QVIRETLSNF 



GGVKHRLQSL 
GGVKHRLQSL 
GGVKHRLQSL 
GGVKHRLQSL 
GGVKHRLQSL 
GGVKHRLQSL 
GGVKHRLQSL 
GGVKHRLQSL 
GGVKHRLQSL 
GGVKHRLQSL 
GGVKHRLQSL 



********** ********** 



GKVHGISFYN 

GKVHGISFYN 

GKVHGISFYN 

GKVHGISFYN 

GKVHGISFYN 

GKVHGISFYN 

GKVHGISFYN 

GKVHGISFYN 

GKVHGISFYN 

GKVHGISFYN 

GKVHGISFYN 
********** 



DSKstnilat 

DSKstnilat 

DSKstnilat 

DSKstnilat 

DSKstnilat 

DSKstnilat 

DSKstnilat 

DSKstnilat 

DSKstnilat 

DSKstnilat 

DSK 

*** - 



350 

qkalsgfdnt 
qkalsgf dnt 
qkalsgfdnt 
qkalsgfdnt 
qkalsgfdnt 
qkalsgfdnt 
qkalsgfdnt 
qkalsgfdnt 
qkalsgfdnt 
qkalsgfdnt 



msa25243 .2{305_18RS2i; 
msa25243 . 2 {305_2603 
msa25243 . 2 {305_JM9130013 / 
msa25243 .2(305_COHl] 
msa25243 .2{305_M732] 
msa25243.2{305_M781 
msa25243.2{305_1169NT 
rasa2 5243 . 2 { 3 05_A9 09 
msa2 5243 . 2 {305_CJB110 
msa25243 .2{305_090} 
msa25243 .2{305_H36B] 
Consensus 



351 

kviliaggld 
kviliaggld 
kviliaggld 
kviliaggld 
kviliaggld 
kviliaggld 
kviliaggld 
kviliaggld 
kviliaggld 
kviliaggld 



rgnefdelip 
rgnef delip 
rsnefdelip 
rgnefdelip 
rgnefdelip 
rgnefdelip 
rgnefdelip 
rgnefdelip 
rgnefdelip 
rgnefdelip 



ditglkhmw 
ditglkhmw 
ditglkhmw - 
ditglkhmw 
ditglkhmw 
ditglkhmw 
ditglkhmw 
ditglkhmw 
ditglkhmw 
ditglkhmw 



lgesasrvkr 
lgesasrvkr 
lgesasrvkr 
lgesasrvkr 
lgesasrvkr 
lgesasrvkr 
lgesasrvkr 
lgesasrvkr 
lgesasrvkr 
lgesasrvkr 



400 

aaqkagvtys 
aaqkagvtys 
aaqkagvtys 
aaqkagvtys 
aaqkagvtys 
aaqkagvtys 
aaqkagvtys 
aaqkagvtys 
aaqkagvtys 
aaqkagvtys 



msa25243 .2{ 

msa25243 . 
msa25243.2{305 

msa25243 . 

msa25243 . 

msa25243 . 
msa25243 .2{ 

msa25243 . 
msa25243 .2{ 
msa25243 

msa25243 . 



305_18RS21] 
2{305_2603] 
_JM9130013 
2{305_COH1' 
2{305_M732 
2{305_M781 
305_1169NT] 
2{305_A909] 
3 05_CJB110 
.2{305_090 
2{305_H36B 
Consensus 



401 

daldvrdavh 
daldvrdavh 
daldvrdavh 
daldvrdavh 
daldvrdavh 
daldvrdavh 
naldvrdavh 
daldvrdavh 
daldvrdavh 
daldvrdavh 



kayevaqqgd 
kayevaqqgd 
kayevaqqgd 
kayevaqqgd 
kayevaqqgd 
kayevaqqgd 
kayevaqqgd 
kayevaqqgd 
kayevaqqgd 
kayevaqqgd 



vi lisp anas 
villspanas 
villspanas 
villspanas 
villspanas 
villspanas 
vilxspanas 
villspanas 
villspanas 
villspanas 



wdmyknf evr 
wdmyknf evr 
wdmyknf evr 
wdmyknf evr 
wdmyknf evr 
wdmyknf evr 
wdmyknf evr 
wdmyknf evr 
wdmyknf evr 
wdmyknf evr 



450 

gdef idtf es 
gdef idtf es 
gdef idtf es 
gdef idtf e~ 
gdef idtf es 
gdef idtf es 
gdef idtf — 
gdef idtf es 
gdef idtf es 
gdef idtf es 



msa25243 .2{ 

msa25243 . 
msa25243.2{305 

msa25243 . 

msa25243 . 

msa25243 . 
msa25243.2{ 

msa25243 . 
msa25243 .2{ 
msa25243 

msa25243 . 



305_18RS21 
2{305_2603 
_JM9130013 
2{305_COH1 
2{305_M732 
2{305_M781 
305_1169NT 
2{305_A909 
305_CJB110 
.2{305_090 
2{305_H36B} 
Consensus 



451 
lrge 
lrge 
lrge 

lrge 
lrge 

mm 

lrge 
lrge 
lrge 
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Table 68: Comparative Sequences relating to SAG 0499 



SEQ ID NO. 6801 
STRAIN 2603 

ATGGCTAAAGAGAGGGTAGATGTTCTTGCCTATAAA.CAGGGACTTTTTGATACACGAGAG 
CAAGCGAAACGTGGTGTTATGGCAGGAATGGTGATTAACGTTATCAATGGAGAACGTTAT 
GATAAACCAGGTGAAAAGGTTGCAGACGATACrGAATTAAAACTAAAAGGTGAAAAACTA 
AAATATGTTAGTAGAGGTGGATTGAAATTAGAAAAAGCTTTACA^ 

GTTGCAGATAAGCTAACTATAGATATTGGCGCCTCTACGGGTGGTTTTACTGATGTTATG 
CTACAAT CAGGAGCG CGTTT AGTTTACGCAGTAG ATGTAGGAACAAAT CAATTAGTTTG G 
AAGTTACGTCAGGAT CATCGTGTT CGTT CTATGG AACAAT ATAATTTT AGGTATG C C CAA 
AAAGAAGATTT CAAGGAGGGACTG CCTGAATTTG CAT CG ATAGATGT CTCATTTAT CTCT 
CTTAATTTG ATTTTACCAG CT CTAAAAGAAATTTT AGTGGATGGTGG ACAAGTAGTG G CA 
TTAATTAAACGA.CAATTTGAAGCAGGTCGTGAGGAAATC 
G^CAAGTTGGTTCATGAAAAGGTTTTGACAACAGTGA 
TATACGGTTAAACATCTTGATTTTTCGCCCATTCAAGGTGGA^ 

TTAATG CAT TTG CAAAAGTGT CAAG AT C CACAAAAT CTT GTG CTT GAC CAAAT ACAAGAT 
GTTAT AGAAAAAGCACATAAGGAAT TTAAGAAAAATG AAG AAGAG 

SEQ XD NO. 6802 
STRAIN 090 

GCTAAAGAGAGGGTAGATGTTCTTGCCT 

ATAAACAGGGACTTTTTGATACACGAGAGCAAGCGAAACGTGGTGTTATG 

GCAGGAATGGTGATTAACGTTATCAATGGAGAACGTTATGATAaACCAGG 

TGAAAAGGTTGCAGACGATACTGAATTAAAACTAAAAGGTGAAAAACTAA 

AATATGTTAGTAGAGGTGGATTGAAATTAGAAAAAGCTTT^ 

GAAATTT CAGTTG CAGATAAG CTAACTATAGATATTGG CGCCTCTACGGG 

TGGTTTTACTGATGTTATG CTACAATCAG GAG CGCGTTTAGTTTACG CAG 

TAGATGTAGGAA(2AAATCAATTAGTTTGGAAGTTACGTCAGGATCATCGT 

GTTCGTTCTATGGAACAATATAATTTTAGGTATGCCCAAAAAGAAGATTT 

CAAGGAGGGACTGCCTGAATTTGCATCGATAGATGTCTC^ 

TTAATTTGATTTTACCAGCT CTAAAAG A 

GTAGTGGCATTAATTAAACCACAATTTGAAGC^GGTCGTGAGCAAATTGG 

TAAAAATGGTATTGT CAAAGACAAGTTGGTTGATGAAAAG GTTTTGACAA 

CAGTGAC CAATTTCA.CX5AAAGATTATGGATATACGGTTAAACAT CTTGAT 

TTTTCGCCCATTCAAGGTGGACATGGAAATATTGAGTTTTTA^ 

G CAAAAGTGT CAAGAT C CACAAAAT CTTGTGCTTGAC CAAATACAAGATG 

TTATAGAAAAAGCACATAAGGAATTTAAGAAAAATGAAGAAGAG 

SEQ XD NO. 6803 
STRAIN A909 

GCTAAAGAGAGGGTAGATGTTCTTGCCTA 

TAAACAGGGAC"I"1T , TTGATACACGAGAGCAAGCGAAACGTGGTGTTATGG 

CAGGAATGGTGATTAACGTTAT CAATGGAG AA.CX3TTATGATAAAC CAGGT 

GAAAAGGTTGCAGACGATACTGAATTAAAACTAAAAGGTGAAAAACTAAA 

ATATGTTAGTAGAGGTGGATT<3AAATTAGAAAAAGC 

AAATTTCAGTTG CAGATAAG CTAACTATAGATATTGG CG C CTCTACGGGT 

GGTTTTACTGATGTTATG CTACAATCAGGAGCG CGTTTAGTTTACG CAGT 

AGATGTAGGAACAAAT CAATTAGTTTGGAAGTTACGTCAGGATCAT CGTG 

TTCGTTCTATGGAACAATATAATTTTAGGTATGCCCAAAAAGAA 

AAGGAGGGACTGCCTGAATTTGCATCGATAGATC 

TAATTTGATTTTACCAGCTCTAAAAGAAATTTTAGTGGA 

TAGTGGCATTAATTAAACCACAATTTGAAGCAGGTCX3TGAGCAAATTGGT 

AAAAATGGTATTGT CAA^GACAAGTTGGTTCATG AAAAGGTTTTG ACAAC 

AGTGACCAATTTCAGGAA^GATTATGGATATACGGTTAAAC^ 

TTTCG CCCATTCAAGGTGGACATGGAAATATTGAGTTTTTAATG CATTTG 

CAAAAGTGT CAAGAT CCACAAAAT CTTGTG CTTGACCAAATACAAGATGT 

TATAGAAAAAGCACATAAGGAATTTAAGAAAAATGAAGAAGAG 

SEQ XD NO. 6804 
STRAIN H36B 

GCTAAAGAGAGGGTAGATGTTCTTGCCTATAAACAGG 

GACTTTTTGATACACGAGAGCAAGCGAAACGTGGTGTTATGGCAGGAATG 

GTGATTAACGTTAT CAATGG AGAACGTTATGATAAAC CAGGTGAAAAGGT 

TGCAGACGATACTGAATTAAAACTAAAAGGTGAAAAACT 

GTAGAGGTGGATTGAAATTAGAAAAAGCTTTACAAGTTTTTC 

GTTG CAGATAAG CTAACTATAG ATATTGG CG C CT CTACGGGTGGTTTTAC 

TGATGTTATGCTACAATCAGG^GCGCGTTTAGTTTACGCAGTAGATGTAG 

GAACAAATCAATTAGTTTGGAAGTTACGT CAGGAT CATCGTGTTCGTT CT 

ATGGAACAATATAATTTTAGGTATGCCCAAAAAGAAGATTTCAAGGAGGG 

ACTG CCTGAATT^G CATCG ATAGATGTCT CATTTATCTTCT CTTAATTTGA 

TTTTACCAGCTCTAAAAGAAATTTTAGTGGATGGTGGAC 

TTAATTAAACCACAATTTCAAGCAGGTCGTGAGCAAATTGGTA 

TATTGTCAAAGACAAGTTGGTTCATGAAAAGGTTTTC 

ATTT CACGAAAGATTATGGATATACGGTTAAACATCT^GATTTTTCG CCC 

ATTCAAGGTGGACATGGAAATATTGAGTTTTTAATGCATTTG CAAAAGTG 

TCAAGATCCACAAAATCTTGTGCTTGACCAAATACAAGATGTTATAGAAA 

AAGCACATAAGGAATTTAAGAAAAATGAAGAAGAG 

SEQ XD NO. 6805 

STRAIN 18RS21 

GCTAAAGAGAGGGTAGATGTTCTTGCCTA 

TAAACAGGGACTTTTTGATACACGAGAGCAAGCGAAACGTGGTGTTATGG 
CAGGAATGGTGATTAACGTTAT CAATGGAG AACGTTATGATAAAC CAGGT 



979 
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Table 68: Comparative Sequences relating to SAG 0499 



GAAAAGGTTG CAGACGATACTGAATTAAAACTAAAAGGTGAAAAACTAAA 
ATATGTTAGTAGAGGTGGATTGAAATTAGAAAAAG CTTTACAAGTTTTTG 
AAATTTCAGTTG CAGATAAGCTAACTATAGAT ATTGG CG C CT CTACGGGT 
GGTTTTACTGATGTTATGCTACAATCAGGAGCGCGTTTAGTTTACGCAGT 
AGATGTAGGAACAAATCAATTAGTTTGGAAGTTACGTCAGGATCATCGTG 
TT CGTTCTATGGAACAATATAATTTTAGGTATG CCCAAAAAGA^GATTT C 
AAGGAGG GACTGCCTGAATTTGCAT CG ATAGATGT CT CATTTAT CTCTCT 
TAATTTG ATTTT AC CAG CT CTAAAAGAAATTTTAGTGGATGGTGG ACAAG 
TAGTGGCATTAATTAAACCACAATTTGAAG CAGGT CGTG AG CAAATTGGT 
AAAAATGGTATTGTCAAAGACAAGTTGGTT CATGAAAAGGTTTTGACAAC 
AGTGACCAATTT GACGAAAG ATTATGG ATATACGGITAAACATCTTGAT T 
TTTCGCCC^TTCAAGGTGGACATGGAAATATTGAGTTT^ 
CAAAAGTGT CAAGAT C CACAAAAT CTTGTG CTTGACCAAATACAAGATGT 
TATAGAAAAAGCACATAAG G AATTTAAGAAAAATGAAGAAGAG 

SEQ ID NO. 6806 
STRAIN M732 

GCTAAAGAGAGGGTAGATGTTCTTGCCTA 

TAAACAGGGACTTTTTGATACACGAGAG CAAG CGAAACGTGGTGTTATGG 
CAGGACTGGTGATTAACGTTATCAATGGAGAACGTTATGATAAACCAGGC 
GAAAAGGTTGCAGACGATACTCAATTAAAACTA^ 

ATATGTTAGTAGAGGTGGATTGAAATTAGAAAAAGCTTTACAAGTTTTTG 

AAATTTCAGTTGCAGATAAG CTAACTATAGAT ATTGG CG C CT CTACGGGT 

GGTTTTACTGATGTTATGCTACAATCAGGAGCGCGTTTAGTTTACG 

AGATGTAGGAACAAATCAATTAGTTTGGAAGTTACGT CAGGAT CAT CGTG 

TTCGTTCHATGGAACAATATAATTTTAGGTATGC CCA C 

AAGGAGGGACTG CCTGAATTTG CATCGATAGATGT CTCATTTATCTCT CT 

TAATTTGATTTTAC CAG CTCTAAAAGAAATTTTAGTGGATGGTGGACAAG 

TAGTGGCATTAATTAAACCACAATTTGAAGCAGGTCGTGAGCAAATTGGT 

AAAAATGGTATTGTCAAAGACAAGTTGGTTCATGAAAAGGTTTTG^ 

AGTGACCAATTT CACGAAAGATTATGGATATACXMTTAAACATCTTG ATT 

TTTCGCCCGTTCAAGGTGGACATGGAAATATTGAGTT^ 

CAAAAGTGT CAAGAT CCACAAAAT CTTGTG CTTGACCAAATACAAGATGT 

TATAGAAAAAGCACATAAGGAATTTAAGAAAAATGAAGAAGAG 

SEQ XD NO. 6807 
STRAIN COH1 

GCTAAAGAGAGGGTAGATGTTCTTG CCT 

ATAAACAGG GACTTTTTGATACACGAGAGCAAGCGAAACGTGGTGTTATG 

GCAGGACTGGTGATTAACGTTATCAATGGAGAACGTTATGATAAACCAGG 

CGAAAAGGTTGCAGACX^TACTGAATTAA^CTAAAAG^ 

AATATGTTAGTAGAGGTGGATTGAAATTAGAAAAAGCTTTACAAGTTTT^ 

GAAATTT CAGTTGCAGATAAGCTAACTATAGATATTGG CG CCTCTACGGG 

TGGTTTT ACTGATGTTATG CTACAATCApGAG CG CGTTTAGTTTACG CAG 

TAGATGTAGGAACAAATCAATTAGTTTGGAAGTTACGTCAGGATCATCGT 

GTTCGTT CTATGGAACAATATAATTTTAGGTATG C CCAAAAAGAAGATTT 

CAAGGAGGGACTGCCTGAATTTGCATCGATAGATGTCTCATTTATOT 

TTAATTTGATTTTACCAGCTCTAAAAGAAATTTTAGTGGATC 

GTAGTGGCATTAATTAAACCACAATTTGAAGCAGGTCGTGAGCAAATTGG 

TAAAAATGGTATTGT CAAAGACAAGTTGGTT CA.TGAAAAGGTTTTGACAA 

CAGTGACCAA.TTTCACGAAAGATTATGGATATACGGTTAAACAT CTTGAT 

TTTTCGCCCGTTCAAGGTGGACATGGAAATATTa^GTTTTTAAT 

GCAAAAGTGTCAAGATCCACAAAATCITGTGCITGA 

TTATAGAAAAAGC^CATAAGGAATTTAAGAAAAATGAAGAAGAG 

SEQ ID NO. 6808 
STRAIN M781 

G CTAAAGAGAGGGTAGATGTTCTTG CCT 

ATAAACAGGGACTTTTTGATACACGAG AG CAAG CGAAACGTG GTGTTATG 
GCAGGACTGGTGATTAACX5TTATCAATGGAGAACGTTATGATAAACCAGG 
CGAAAAGGTTGCAGACGATACTGAATTAAAACTAAAAGGTGAAAAACTAA 
AATATGTTAGTAGAGGTGGATTGAAATTAGAAAAAGCTTTACAAGTTTTT 
GAAATTT CAGTTGCAGATAAGCTAACTATAGATATTGGCG CCTCTACGGG 
TG GTTTTACTGATGTTATGCTACAAT CAGGAGCGCGTTTAGTTTACG CAG 
TAGATGTAGGAAfCAAATCAATTAGTTTGGAAGTTO 
GTTCX3TTCTATGGAACAATATAATTTTAGGTATC 

CAAGGAGGGACTGCCTCAATTTGCATCGATAGATGTCTCATTTATCTCT 
TTAATTTGATTTTACCAGCTCTAAAAGAAATTT^ 

GTAGTGG CATTAATTAAAC CACAATTTGAAGCAGGTCGTGAG CAAATTGG 
TAAAAATGGTATTGTCAAAGACAAGTTGGTTCATGAAAAGGTT^ 
CAGTGACCAATTT(^CGAAAGATTATGGATATACGGTTAAACATCTTGAT 
TTTT CGCCCGTTCAAGGTGGACATGGAAATATTGAGTTTTTAATG CATTT 
GCAAAAGTGT CAAG ATC CACAAAATCTTGTGCTTGAC CAAATACAAGATG 
TTAT AGAAAAAG CACATAAGGAATTTAAGAAAAATGAAGAAGAG 

SEQ ID NO. 6809 
STRAIN CJB 110 

GCTAAAGAGAGGGTAGATGTTCTTGCCTA 

TAAACAGGGACTTTTTGAT ACACGAGAGCAAG CGAAACGT GGTGTTATGG 
CAGGAATGGTGATTAACGTTATCAATGGAGAAICGTTATGATAAACCAGGT 
GAAAAGGTTG CAGACGATACTGAATTAAAACTAAAAG<5TGAAAAACTAAA 
ATATGTTAGTAG AGGTGGATTGAAATTAGAAAAAG CTTTACAAGT TT TT G 
AAATTTCAGTTG CAGAT AAG CTAACTATAGATATTGG CGC CTCTACGGGT 



980 
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GGTTTTACTGATGTTATGCTACAATCAGGAGCGCGTTTAGTTTACGCAGT 
AGATGTAGGAACAAATCAATTAGTTTGGAAGTTACGTCAGGATCATCGTG 
TTCGTTCTATGGAACAATATAATTTTAGGTATGCCCAAAAAGAAGATTTC 
AAGG AGGGACT G CCTGAATTTG CAT CGATAGATGTCT CATTTAT CT CT CT 
TAATTTGATTTTACCAGCTCTAAAAGAAATTTTAGTGGATGGTGGACAAG 
TAGTGGCATTAATTAAACGACAATTTGAAGCAGGTCGTGAGCAAATTGGT 
AAAAATGGTATTGTCAAAGACAAGTTGGTTCATGAAAAGGTTTTGACAAC 
AGTGACCAATTTCACGAAAGATTATGGATATACGGTTAAACATCTTGATT 
TTT CG CC CATT CAAGGTGGACATGG AAAT ATTGAGTTTTTAATG CATTTG 
CAAAAGTGT CAAGATCCACAAAATCITGTGCTTGACCAAATACAAGATGT 
TATAGAAAAAGCACATAAGGAATTTAAGAAAAATGAAGAAGAG 

SEQ ID NO. 6810 
STRAIN 1169NT 

G CTAAAGAG AGGGTAGATGTT C TTG CCTA 

TAAACAGGGACTTTTTGATACACGAGAGCAAGCGAAACGTGGTGTTATGG 

CAGGACTGGTGATTAACGTTATCAATGGAGAACGTTATGATAAACCAGGC 

GAAAAGGTTGCAGACGATACrGAATTAAAACTAAAAGGTGAAAAACTAAA 

ATATGTTAGTAGAGGTGGATTGA^TTAGAAAAAGCTTTACAAGTTTTTG 

AAATTT CAGTTG CAGATAAG CTAACTATAGATATTGG CG C CT CTACGGGT 

GGTTTTACTGATGTT ATG CTACAAT CAGGAG CGCGTTTAGTTTACGCAGT 

AGATGTAGGAACAAATCAATTAGTTTGGAAGTTACGTCAGGATCATCGTG 

TTCGT^CTATGGAACAATATAATTTTAGGTATGCCCAAAAAGAAGA 

AAGGAGGGACTGCCTGAATTTG CAT CGATAGATGT CTQ^TTTATCTCTCT 

TAATTTGATTTTGCCAGCTCTAAAAGAAATTTTAGTGG^ 

TAGTGG CATTAATTAAACCACAATTTGAAG CAGGTCGTGAGCAAATTGGT 

AAAAATGGTATTGTCAAAGACAAGTTGGTTCATGAAAAGGTTTT^ 

AGTGACCAATTTCACGAAAGATTATGGATATACGGTTAAACATCTTGATT 

TTTCGCCCATTCAAGGTGGACATGGAAATATTGAGTTTTTAATG 

CAAAAGTGT CAAGATCCACAAAATCTTGTGCTTGACCAAA 

TATAGAAAAAG CACATAAGGAATTTAAGAAAAATGAAGAAGAG 

SEQ ID NO. 6811 
STRAIN JM9 130013 

GCTAAAGAGAGGGTAGATGTTCTTGCCTA 

TAAACAGGGACTTTTTGATACA.CGAGAGCAAGCGAAACGTGGTGTTATGG 
CAGGAATGGTGATTAACGTTATCAATGGAGAACGTTATGATAAACCAGGT 
GAAAAGGTTGC^GACGATACTGAATTAAAACTAZVAAGGTGAAAA^CTAAA 
ATATGTT AGTAGAGGTG GATTG AAATTAG AAAAAG CTTT ACAAGTTTTT G 
AAATTTCAGTTG CAGATAAGCTAACTATAGATATTGG CG C CT CTACGGGT 
GGTTTTACTGATGTTATGCTACAATCAGGAGCGCGTTTAGTTTACGCAGT 
AGATGTAGGAACAAATCAATTAGTTTGGAAGTTACGTCAGGAT CAT CGT G 
TTCGTTCTATGGAACMTATAATTTTAGGTATGCCCAAAAAGAAGATT^ 
AAGGAGGGACTGCCTGAATTTG CAT CX3ATAGATGTCTCATTTATCTCTCT 
TAATTTGATTTTAC CAG CT CTAAAAGAAATTTTAGTGGATGGTGG ACAAG 
TAGTGGCATTAATTAAACCACAATTTGAAG CAGGTCGTGAGCAAATTGGT 
AAAAATGGTATTGTCAAAGACAAGTTGGTTCATGAAAAG^ 
AGTGACCAATTTC^CGAAAGATTATGGATATACGGTTAAACIATCTTGATT 
TTTCGCC CATTCAAGGTGGACATGG AAATATTGAGTTTTTAATGC^ 
CAAAAGTGT CAAGAT CCACAAAAT CTTGTG CTTGAC CAAATACAAGATGT 
TATAGAAAAAGCACATAAGGAATTTAAGAAAAATGAAGAAGAG 
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msa236683 
msa236683 -2{ 

msa236683 . 

nnsa236683 . 
msa236683 . 2{ 

msa236683 . 
msa236683 .2(310 
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GCTAAAG 

GCTAAAG 

atgGCTAAAG 

GCTAAAG 

GCTAAAG 

GCTAAAG 

GCTAAAG 

GCTAAAG 

GCTAAAG 

GCTAAAG 

GCTAAAG 

********** 



AGAGGGTAGA 

AGAGGGTAGA 

AGAGGGTAGA 

AGAGGGTAGA 

AGAGGGTAGA 

AGAGGGTAGA 

AGAGGGTAGA 

AGAGGGTAGA 

AGAGGGTAGA 

AGAGGGTAGA 

AGAGGGTAGA 
********** 



TGTTCTTGCC 

TGTTCTTGCC 

TGTTCTTGCC 

TGTTCTTGCC 

TGTTCTTGCC 

TGTTCTTGCC 

TGTTCTTGCC 

TGTTCTTGCC 

TGTTCTTGCC 

TGTTCTTGCC 

TGTTCTTGCC 
********** 



TATAAACAGG 

TATAAACAGG 

TATAAACAGG 

TATAAACAGG 

TATAAACAGG 

TATAAACAGG 

TATAAACAGG 

TATAAACAGG 

TATAAACAGG 

TATAAACAGG 

TATAAACAGG 
********** 



50 

GACTTTTTGA 

GACTTTTTGA 

GACTTTTTGA 

GACTTTTTGA 

GACTTTTTGA 

GACTTTTTGA 

GACTTTTTGA 

GACTTTTTGA 

GACTTTTTGA 

GACTTTTTGA 

GACTTTTTGA 
********** 



msa236683 
msa236683 .2{ 

msa236683 . 

rasa236683 . 
msa236683 .2 { 

msa23 6683 . 
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msa236683 . 

msa236683 . 
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51 

TACACGAGAG 
TACACGAGAG 
TACACGAGAG 
TACACGAGAG 
TACACGAGAG 
TACACGAGAG 
TACACGAGAG 
TACACGAGAG 
TACACGAGAG 
TACACGAGAG 
TACACGAGAG 



CAAGCGAAAC 
CAAGCGAAAC 
CAAGCGAAAC 
CAAGCGAAAC 
CAAGCGAAAC 
CAAGCGAAAC 
CAAGCGAAAC 
CAAGCGAAAC 
CAAGCGAAAC 
CAAGCGAAAC 
CAAGCGAAAC 



GTGGTGTTAT 
GTGGTGTTAT 
GTGGTGTTAT 
GTGGTGTTAT 
GTGGTGTTAT 
GTGGTGTTAT 
GTGGTGTTAT 
GTGGTGTTAT 
GTGGTGTTAT 
GTGGTGTTAT 
GTGGTGTTAT 



GGCAGGAaTG 
GGCAGGAaTG 
GGCAGGAaTG 
GGCAGGAaTG 
GGCAGGAaTG 
GGCAGGAaTG 
GGCAGGAaTG 
GGCAGGAcTG 
GGCAGGAcTG 
GGCAGGAcTG 
GGCAGGAcTG 



********** ********** ********** *******_** 



100 

GTGATTAACG 

GTGATTAACG 

GTGATTAACG 

GTGATTAACG 

GTGATTAACG 

GTGATTAACG 

GTGATTAACG 

GTGATTAACG 

GTGATTAACG 

GTGATTAACG 

GTGATTAACG 
********** 
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msa23S683 
msa236683 .2{ 

msa236683 . 

msa236683 . 
msa236683 . 2{ 

msa236683 . 
msa236683 .2(310 

msa236683 . 

msa236683 . 

msa236683 . 
msa236683 .2{ 



.2{310_090} 
310_18RS2l} 
2{310_2603} 
2{310_A909) 
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2{310_H36B} 
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Consensus 



101 

TTATCAATGG 

TTATCAATGG 

TTATCAATGG 

TTATCAATGG 

TTATCAATGG 

TTATCAATGG 

TTATCAATGG 

TTATCAATGG 

TTATCAATGG 

TTATCAATGG 

TTATCAATGG 
********** 



AGAACGTTAT 

AGAACGTTAT 

AGAACGTTAT 

AGAACGTTAT 

AGAACGTTAT 

AGAACGTTAT 

AGAACGTTAT 

AGAACGTTAT 

AGAACGTTAT 

AGAACGTTAT 

AGAACGTTAT 
********** 



GATAAACCAG 

GATAAACCAG 

GATAAACCAG 

GATAAACCAG 

GATAAACCAG 

GATAAACCAG 

GATAAACCAG 

GATAAACCAG 

GATAAACCAG 

GATAAACCAG 

GATAAACCAG 
********** 



GtGAAAAGGT 

GtGAAAAGGT 

GtGAAAAGGT 

GtGAAAAGGT 

GtGAAAAGGT 

GtGAAAAGGT 

GtGAAAAGGT 

GcGAAAAGGT 

GcGAAAAGGT 

GcGAAAAGGT 

GcGAAAAGGT 
*_******** 



150 

TGCAGACGAT 

TGCAGACGAT 

TGCAGACGAT 

TGCAGACGAT 

TGCAGACGAT 

TGCAGACGAT 

TGCAGACGAT 

TGCAGACGAT 

TGCAGACGAT 

TGCAGACGAT 

TGCAGACGAT 
********** 



msa236683 
msa236683 .2{ 

msa236683 . 

msa236683 . 
msa236683 .2{ 

msa236683 . 
msa236683 .2(310 

msa23 S683 . 

msa236683 . 

msa236683 . 
msa236683 ,2{ 



.2{310_090) 
310_18RS2l} 
2(310_2603} 
2{310~A909" 
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Consensus 



151 

ACTGAATTAA 

ACTGAATTAA 

ACTGAATTAA 

ACTGAATTAA 

ACTGAATTAA 

ACTGAATTAA 

ACTGAATTAA 

ACTGAATTAA 

ACTGAATTAA 

ACTGAATTAA 

ACTGAATTAA 
********** 



AACTAAAAGG 

AACTAAAAGG 

AACTAAAAGG 

AACTAAAAGG 

AACTAAAAGG 

AACTAAAAGG 

AACTAAAAGG 

AACTAAAAGG 

AACTAAAAGG 

AACTAAAAGG 

AACTAAAAGG 
********** 



TGAAAAACTA 
TGAAAAACTA 
TGAAAAACTA 
TGAAAAACTA 
TGAAAAACTA 
TGAAAAACTA 
TGAAAAACTA 
TGAAAAACTA 
TGAAAAACTA 
TGAAAAACTA 
TGAAAAACTA 



AAATATGTTA 
AAATATGTTA 
AAATATGTTA 
AAATATGTTA 
AAATATGTTA 
AAATATGTTA 
AAATATGTTA 
AAATATGTTA 
AAATATGTTA 
AAATATGTTA 
AAATATGTTA 



200 

GTAGAGGTGG 
GTAGAGGTGG 
GTAGAGGTGG 
GTAGAGGTGG 
GTAGAGGTGG 
GTAGAGGTGG 
GTAGAGGTGG 
GTAGAGGTGG 
GTAGAGGTGG 
GTAGAGGTGG 
GTAGAGGTGG 



********** ********** ********** 



msa236683 
msa236683 .2{ 

msa236683 . 

msa236683 . 
msa236683 .2 { 

msa236683 . 
msa236683 .2(310 

msa236683 . 

msa236683 . 

msa236683 . 
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.2{310_090 
310_18RS21 
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201 

ATTGAAATTA 

ATTGAAATTA 

ATTGAAATTA 

ATTGAAATTA 

ATTGAAATTA 

ATTGAAATTA 

ATTGAAATTA 

ATTGAAATTA 

ATTGAAATTA 

ATTGAAATTA 

ATTGAAATTA 
********** 



GAAAAAGCTT 

GAAAAAGCTT 

GAAAAAGCTT 

GAAAAAGCTT 

GAAAAAGCTT 

GAAAAAGCTT 

GAAAAAGCTT 

GAAAAAGCTT 

GAAAAAGCTT 

GAAAAAGCTT 

GAAAAAGCTT 
********** 



TACAAGTTTT 

TACAAGTTTT 

TACAAGTTTT 

TACAAGTTTT 

TACAAGTTTT 

TACAAGTTTT 

TACAAGTTTT 

TACAAGTTTT 

TACAAGTTTT 

TACAAGTTTT 

TACAAGTTTT 
********** 



TGAAATTTCA 

TGAAATTTCA 

TGAAATTTCA 

TGAAATTTCA 

TGAAATTTCA 

TGAAATTTCA 

TGAAATTTCA 

TGAAATTTCA 

TGAAATTTCA 

TGAAATTTCA 

TGAAATTTCA 
********** 



250 

GTTGCAGATA 

GTTGCAGATA 

GTTGCAGATA 

GTTGCAGATA 

GTTGCAGATA 

GTTGCAGATA 

GTTGCAGATA 

GTTGCAGATA 

GTTGCAGATA 

GTTGCAGATA 

GTTGCAGATA 
********** 



msa23668 
msa236683 .2 
msa236683 
msa236683 
msa236683 .2 
msa236683 
msa236683 .2(31 
msa236683 
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3 .2(310_090" 

(310_18RS21 : 

.2{310_2603 

.2(310_A909 

(310_CJB110' 

.2{310_H36B 
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Consensus 



251 

AGCTAACTAT 
AGCTAACTAT 
AGCTAACTAT 
AGCTAACTAT 
AGCTAACTAT 
AGCTAACTAT 
AGCTAACTAT 
AGCTAACTAT 
AGCTAACTAT 
AGCTAACTAT 
AGCTAACTAT 



AGATATTGGC 

AGATATTGGC 

AGATATTGGC 

AGATATTGGC 

AGATATTGGC 

AGATATTGGC 

AGATATTGGC 

AGATATTGGC 

AGATATTGGC 

AGATATTGGC. 

AGATATTGGC 



GCCTCTACGG 
GCCTCTACGG 
GCCTCTACGG 
GCCTCTACGG 
GCCTCTACGG 
GCCTCTACGG 
GCCTCTACGG 
GCCTCTACGG 
GCCTCTACGG 
GCCTCTACGG 
GCCTCTACGG 



GTGGTTTTAC 
GTGGTTTTAC 
GTGGTTTTAC 
GTGGTTTTAC 
GTGGTTTTAC 
GTGGTTTTAC 
GTGGTTTTAC 
GTGGTTTTAC 
GTGGTTTTAC 
GTGGTTTTAC 
GTGGTTTTAC 



300 

TGATGTTATG 
TGATGTTATG 
TGATGTTATG 
TGATGTTATG 
TGATGTTATG 
TGATGTTATG 
TGATGTTATG 
TGATGTTATG 
TGATGTTATG 
TGATGTTATG 
TGATGTTATG 



********** ********** ********** ********** ********** 



msa236683 
msa236683 . 2( 
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301 

CTACAATCAG 

CTACAATCAG 

CTACAATCAG 

CTACAATCAG 

CTACAATCAG 

CTACAATCAG 

CTACAATCAG 

CTACAATCAG 

CTACAATCAG 

CTACAATCAG 

CTACAATCAG 
********** 



GAGCGCGTTT 

GAGCGCGTTT 

GAGCGCGTTT 

GAGCGCGTTT 

GAGCGCGTTT 

GAGCGCGTTT 

GAGCGCGTTT 

GAGCGCGTTT 

GAGCGCGTTT 

GAGCGCGTTT 

GAGCGCGTTT 
********** 



AGTTTACGCA 

AGTTTACGCA 

AGTTTACGCA 

AGTTTACGCA 

AGTTTACGCA 

AGTTTACGCA 

AGTTTACGCA 

AGTTTACGCA 

AGTTTACGCA 

AGTTTACGCA 

AGTTTACGCA 
********** 



GTAGATGTAG 

GTAGATGTAG 

GTAGATGTAG 

GTAGATGTAG 

GTAGATGTAG 

GTAGATGTAG 

GTAGATGTAG 

GTAGATGTAG 

GTAGATGTAG 

GTAGATGTAG 

GTAGATGTAG 
********** 



350 

GAACAAATCA 

GAACAAATCA 

GAACAAATCA 

GAACAAATCA 

GAACAAATCA 

GAACAAATCA 

GAACAAATCA 

GAACAAATCA 

GAACAAATCA 

GAACAAATCA 

GAACAAATCA 
********** 
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351 

ATTAGTTTGG 

ATTAGTTTGG 

ATTAGTTTGG 

ATTAGTTTGG 

ATTAGTTTGG 

ATTAGTTTGG 

ATTAGTTTGG 

ATTAGTTTGG 

ATTAGTTTGG 

ATTAGTTTGG 

ATTAGTTTGG 
********** 



AAGTTACGTC 

AAGTTACGTC 

AAGTTACGTC 

AAGTTACGTC 

AAGTTACGTC 

AAGTTACGTC 

AAGTTACGTC 

AAGTTACGTC 

AAGTTACGTC 

AAGTTACGTC 

AAGTTACGTC 
********** 



AGGATCATCG 

AGGATCATCG 

AGGATCATCG 

AGGATCATCG 

AGGATCATCG 

AGGATCATCG 

AGGATCATCG 

AGGATCATCG 

AGGATCATCG 

AGGATCATCG 

AGGATCATCG 
********** 



TGTTCGTTCT 

TGTTCGTTCT 

TGTTCGTTCT 

TGTTCGTTCT 

TGTTCGTTCT 

TGTTCGTTCT 

TGTTCGTTCT 

TGTTCGTTCT 

TGTTCGTTCT 

TGTTCGTTCT 

TGTTCGTTCT 
********** 



400 

ATGGAACAAT 

ATGGAACAAT 

ATGGAACAAT 

ATGGAACAAT 

ATGGAACAAT 

ATGGAACAAT 

ATGGAACAAT 

ATGGAACAAT 

ATGGAACAAT 

ATGGAACAAT 

ATGGAACAAT 
********** 
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msa236683 
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401 

ATAATTTTAG 
ATAATTTTAG 
ATAATTTTAG 
ATAATTTTAG 
ATAATTTTAG 
ATAATTTTAG 
ATAATTTTAG 
ATAATTTTAG 
ATAATTTTAG 
ATAATTTTAG 
ATAATTTTAG 



GTATGCCCAA 
GTATGCCCAA 
GTATGCCCAA 
GTATGCCCAA 
GTATGCCCAA 
GTATGCCCAA 
GTATGCCCAA 
GTATGCCCAA 
GTATGCCCAA 
GTATGCCCAA 
GTATGCCCAA 



AAAGAAGATT 
AAAGAAGATT 
AAAGAAGATT 
AAAGAAGATT 
AAAGAAGATT 
AAAGAAGATT 
AAAGAAGATT 
AAAGAAGATT 
AAAGAAGATT 
AAAGAAGATT 
AAAGAAGATT 



TCAAGGAGGG 
TCAAGGAGGG 
TCAAGGAGGG 
TCAAGGAGGG 
TCAAGGAGGG 
TCAAGGAGGG 
TCAAGGAGGG 
TCAAGGAGGG 
TCAAGGAGGG 
TCAAGGAGGG 
TCAAGGAGGG 



450 

ACTGC CTGAA 
ACTGCCTGAA 
ACTGC CTGAA 
ACTGCCTGAA 
ACTGCCTGAA 
ACTGCCTGAA 
ACTGCCTGAA 
ACTGCCTGAA 
ACTGCCTGAA 
ACTGCCTGAA 
ACTGCCTGAA 



********** ********** ********** ********** ********** 



msa23668 
msa236683 .2 
msa236683 
msa236683 
msa236683 .2 
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msa236683 .2(31 
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451 

TTTGCATCGA 
TTTGCATCGA 
TTTGCATCGA 
TTTGCATCGA 
TTTGCATCGA 
TTTGCATCGA 
TTTGCATCGA 
TTTGCATCGA 
TTTGCATCGA 
TTTGCATCGA 
TTTGCATCGA 



TAGATGTCTC 
TAGATGTCTC 
TAGATGTCTC 
TAGATGTCTC 
TAGATGTCTC 
TAGATGTCTC 
TAGATGTCTC 
TAGATGTCTC 
TAGATGTCTC 
TAGATGTCTC 
TAGATGTCTC 



ATTTATCTCT 

ATTTATCTCT 

ATTTATCTCT 

ATTTATCTCT 

ATTTATCTCT ' 

ATTTATCTCT 

ATTTATCTCT 

ATTTATCTCT 

ATTTATCTCT 

ATTTATCTCT 

ATTTATCTCT 



CTTAATTTGA 
CTTAATTTGA 
CTTAATTTGA 
CTTAATTTGA 
CTTAATTTGA 
CTTAATTTGA 
CTTAATTTGA 
CTTAATTTGA 
CTTAATTTGA 
CTTAATTTGA 
CTTAATTTGA 



500 

TTTTaCCAGC 
TTTTaCCAGC 
TTTTaCCAGC 
TTTTaCCAGC 
TTTTaCCAGC 
TTTTaCCAGC 
TTTTaCCAGC 
TTTTaCCAGC 
TTTTaCCAGC 
TTTTaCCAGC 
TTTTgCCAGC 



********** ********** ********** ********** ****_***** 
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msa236683 . 

msa236683 . 
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msa236683 . 
msa236683 .2(310 
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310_18RS2l} 
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501 

TCTAAAAGAA 

TCTAAAAGAA 

TCTAAAAGAA 

TCTAAAAGAA 

TCTAAAAGAA 

TCTAAAAGAA 

TCTAAAAGAA 

TCTAAAAGAA 

TCTAAAAGAA 

TCTAAAAGAA 

TCTAAAAGAA 
********** 



ATTTTAGTGG 
ATTTTAGTGG 
ATTTTAGTGG 
ATTTTAGTGG 
ATTTTAGTGG 
ATTTTAGTGG 
ATTTTAGTGG 
ATTTTAGTGG 
ATTTTAGTGG 
ATTTTAGTGG 
ATTTTAGTGG 



ATGGTGGACA 
ATGGTGGACA 
ATGGTGGACA 
ATGGTGGACA 
ATGGTGGACA 
ATGGTGGACA 
ATGGTGGACA 
ATGGTGGACA 
ATGGTGGACA 
ATGGTGGACA 
ATGGTGGACA 



********** ********** 



AGTAGTGGCA 

AGTAGTGGCA 

AGTAGTGGCA 

AGTAGTGGCA 

AGTAGTGGCA 

AGTAGTGGCA 

AGTAGTGGCA 

AGTAGTGGCA 

AGTAGTGGCA 

AGTAGTGGCA 

AGTAGTGGCA 
********** 



550 

TTAATTAAAC 

TTAATTAAAC 

TTAATTAAAC 

TTAATTAAAC 

TTAATTAAAC 

TTAATTAAAC 

TTAATTAAAC 

TTAATTAAAC 

TTAATTAAAC 

TTAATTAAAC 

TTAATTAAAC 
********** 



msa236683 
msa236683 .2{ 

msa236683 . 

msa236683 . 
msa236683 . 2{ 

msa236683 . 
msa236683 .2(310 
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msa236683 . 
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2{310_M781} 
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551 

CACAATTTGA 

CACAATTTGA 

CACAATTTGA 

CACAATTTGA 

CACAATTTGA 

CACAATTTGA 

CACAATTTGA 

CACAATTTGA 

CACAATTTGA 

CACAATTTGA 

CACAATTTGA 
********** 



AGCAGGTCGT 

AGCAGGTCGT 

AGCAGGTCGT 

AGCAGGTCGT 

AGCAGGTCGT 

AGCAGGTCGT 

AGCAGGTCGT 

AGCAGGTCGT 

AGCAGGTCGT 

AGCAGGTCGT 

AGCAGGTCGT 
********** 



GAGCAAATTG 

GAGCAAATTG 

GAGCAAATTG 

GAGCAAATTG 

GAGCAAATTG 

GAGCAAATTG 

GAGCAAATTG 

GAGCAAATTG 

GAGCAAATTG 

GAGCAAATTG 

GAGCAAATTG 
********** 



GTAAAAATGG 

GTAAAAATGG 

GTAAAAATGG 

GTAAAAATGG 

GTAAAAATGG 

GTAAAAATGG 

GTAAAAATGG 

GTAAAAATGG 

GTAAAAATGG 

GTAAAAATGG 

GTAAAAATGG 
********** 



600 

TATTGTCAAA 

TATTGTCAAA 

TATTGTCAAA 

TATTGTCAAA 

TATTGTCAAA 

TATTGTCAAA 

TATTGTCAAA 

TATTGTCAAA 

TATTGTCAAA 

TATTGTCAAA 

TATTGTCAAA 
********** 



msa236683 
msa236683 ,2{ 

msa236683 . 

msa236683 . 
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msa236683 . 
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msa236683. 
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601 

GACAAGTTGG 

GACAAGTTGG 

GACAAGTTGG 

GACAAGTTGG 

GACAAGTTGG 

GACAAGTTGG 

GACAAGTTGG 

GACAAGTTGG 

GACAAGTTGG 

GACAAGTTGG 

GACAAGTTGG 
********** 



TTCATGAAAA 

TTCATGAAAA 

TTCATGAAAA 

TTCATGAAAA 

TTCATGAAAA 

TTCATGAAAA 

TTCATGAAAA 

TTCATGAAAA 

TTCATGAAAA 

TTCATGAAAA 

TTCATGAAAA 
********** 



GGTTTTGACA 

GGTTTTGACA 

GGTTTTGACA 

GGTTTTGACA 

GGTTTTGACA 

GGTTTTGACA 

GGTTTTGACA 

GGTTTTGACA 

GGTTTTGACA 

GGTTTTGACA 

GGTTTTGACA 
********** 



ACAGTGACCA 

ACAGTGACCA 

ACAGTGACCA 

ACAGTGACCA 

ACAGTGACCA 

ACAGTGACCA 

ACAGTGACCA 

ACAGTGACCA 

ACAGTGACCA 

ACAGTGACCA 

ACAGTGACCA 
********** 



650 

ATTTCACGAA 

ATTTCACGAA 

ATTTCACGAA 

ATTTCACGAA 

ATTTCACGAA 

ATTTCACGAA 

ATTTCACGAA 

ATTTCACGAA 

ATTTCACGAA 

ATTTCACGAA 

ATTTCACGAA 
********** 



msa23668 
msa236683 .2 
msa236683 
msa236683 
msa236683 .2 
msa236683 
msa236683 .2(31 
msa236683 
msa236683 
msa236683 
msa236683 .2 



3 .2(310^090} 
(310_18RS21} 
.2{310_2603*} 
.2{310_A909} 
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651 

AGATTATGGA 
AGATTATGGA 
AGATTATGGA 
AGATTATGGA 
AGATTATGGA 
AGATTATGGA 
AGATTATGGA 
AGATTATGGA 
AGATTATGGA 
AGATTATGGA 
AGATTATGGA 



TATACGGTTA 
TATACGGTTA 
TATACGGTTA 
TATACGGTTA 
TATACGGTTA 
TATACGGTTA 
TATACGGTTA 
TATACGGTTA 
TATACGGTTA 
TATACGGTTA 
TATACGGTTA 



AACATCTTGA 
AACATCTTGA 
AACATCTTGA 
AACATCTTGA 
AACATCTTGA 
AACATCTTGA 
AACATCTTGA 
AACATCTTGA 
AACATCTTGA 
AACATCTTGA 
AACATCTTGA 



TTTTTCGCCC 
TTTTTCGCCC 
TTTTTCGCCC 
TTTTTCGCCC 
TTTTTCGCCC 
TTTTTCGCCC 
TTTTTCGCCC 
TTTTTCGCCC 
TTTTTCGCCC 
TTTTTCGCCC 
TTTTTCGCCC 



700 

aTTCAAGGTG 
aTTCAAGGTG 
aTTCAAGGTG 
aTTCAAGGTG 
aTTCAAGGTG 
aTTCAAGGTG 
aTTCAAGGTG 
gTTCAAGGTG 
gTTCAAGGTG 
gTTCAAGGTG 
aTTCAAGGTG 
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Consensus ********** ********** ********** ********** _********* 



msa236683 
msa236683 .2{ 

msa236683 . 

msa236683 . 
rasa236683 .2{ 

msa236683 . 
msa236683.2{310 
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msa236683 . 
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701 

GACATGGAAA 

GACATGGAAA 

GACATGGAAA 

GACATGGAAA 

GACATGGAAA 

GACATGGAAA 

GACATGGAAA 

GACATGGAAA 

GACATGGAAA 

GACATGGAAA 

GACATGGAAA 
********** 



TATTGAGTTT 

TATTGAGTTT 

TATTGAGTTT 

TATTGAGTTT 

TATTGAGTTT 

TATTGAGTTT 

TATTGAGTTT 

TATTGAGTTT 

TATTGAGTTT 

TATTGAGTTT 

TATTGAGTTT 
********** 



TTAATGCATT 

TTAATGCATT 

TTAATGCATT 

TTAATGCATT 

TTAATGCATT 

TTAATGCATT 

TTAATGCATT 

TTAATGCATT 

TTAATGCATT 

TTAATGCATT 

TTAATGCATT 
********** 



TGCAAAAGTG 

TGCAAAAGTG 

TGCAAAAGTG 

TGCAAAAGTG 

TGCAAAAGTG 

TGCAAAAGTG 

TGCAAAAGTG 

TGCAAAAGTG 

TGCAAAAGTG 

TGCAAAAGTG 

TGCAAAAGTG 
********** 



750 

TCAAGATCCA 

TCAAGATCCA 

TCAAGATCCA 

TCAAGATCCA 

TCAAGATCCA 

TCAAGATCCA 

TCAAGATCCA 

TCAAGATCCA 

TCAAGATCCA 

TCAAGATCCA 

TCAAGATCCA 
********** 



msa236683 
msa236683 .2{ 

msa236683 . 

msa236683 . 
msa236683 .2 { 

msa236S83 . 
msa236683.2{310 

msa236683 . 

msa236683 . 

msa236683 . 
msa236683 .2{ 



.2{310_090} 
310_18RS21; 
2{310_2603 
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310_CJB110 
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751 

CAAAATCTTG 

CAAAATCTTG 

CAAAATCTTG 

CAAAATCTTG 

CAAAATCTTG 

CAAAATCTTG 

CAAAATCTTG 

CAAAATCTTG 

CAAAATCTTG 

CAAAATCTTG 

CAAAATCTTG 
********** 



TGCTTGACCA 

TGCTTGACCA 

TGCTTGACCA 

TGCTTGACCA 

TGCTTGACCA 

TGCTTGACCA 

TGCTTGACCA 

TGCTTGACCA 

TGCTTGACCA 

TGCTTGACCA 

TGCTTGACCA 
********** 



AATACAAGAT 
AATACAAGAT 
AATACAAGAT 
AATACAAGAT 
AATACAAGAT 
AATACAAGAT 
AATACAAGAT 
AATACAAGAT 
AATACAAGAT 
AATACAAGAT 
AATACAAGAT 



GTTATAGAAA 
GTTATAGAAA 
GTTATAGAAA 
GTTATAGAAA 
GTTATAGAAA 
GTTATAGAAA 
GTTATAGAAA 
GTTATAGAAA 
GTTATAGAAA 
GTTATAGAAA 
GTTATAGAAA 



********** ********** 



800 

AAGCACATAA 

AAGCACATAA 

AAGCACATAA 

AAGCACATAA 

AAGCACATAA 

AAGCACATAA 

AAGCACATAA 

AAGCACATAA 

AAGCACATAA 

AAGCACATAA 

AAGCACATAA 
********** 



msa236683.2{310_090} 
msa236683 .2{310_18RS2l} 
msa236683 .2{310__2603) 
msa236683 .2{310_A909} 
msa236683 .2{310_CJB110} 
msa236683 .2{310_H36B} 
msa236683.2{310_JM9130013} 
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801 

GGAATTTAAG 

GGAATTTAAG 

GGAATTTAAG 

GGAATTTAAG 

GGAATTTAAG 

GGAATTTAAG 

GGAATTTAAG 

GGAATTTAAG 

GGAATTTAAG 

GGAATTTAAG 

GGAATTTAAG 
********** 



AAAAATGAAG 

AAAAATGAAG 

AAAAATGAAG 

AAAAATGAAG 

AAAAATGAAG 

AAAAATGAAG 

AAAAATGAAG 

AAAAATGAAG 

AAAAATGAAG 

AAAAATGAAG 

AAAAATGAAG 
********** 



825 

AAGAG 

AAGAG 

AAGAG 

AAGAG 

AAGAG 

AAGAG 

AAGAG 

AAGAG 

AAGAG 

AAGAG 

AAGAG 
***** 



SEQ ZD NO. 6812 
STRAIN 2603 frame: 1 

MAKE RVD VLAYKQGL FDTRE QAKRGVMAGMV I NVI NGE RYDKPGE KVADDTE LKL KGEKL K 
YVSRGGLKLEKALQVFE I SVADKLT ID I GASTGG FTD VML QSGARL VYAVDVGTNQL VWK 
LiRQDHRVRSMEQYNFRYAQKEDFKEGLPEFAS I DVS F I SLNL I L PALKE I L VDGGQ WAX. 
I KPQFEAGREQ I GKNG I VKDKLWEKVLTTVTNFTKDYGYTVKHLDFSPIQGGHGNI EFL 
MHLQKCQDPQNLVLDQI QDVI EKAHKEFKKNEEE 

SEQ ZD NO. 6813 
STRAIN 090 frame: 1 

AKERVD VliAY KQGL FDT REQAKRG VMAGMV I NV I NGERYD KP GE KVADDTELKLKGE KL K 
YVSRGGLKLEKALQVFE I S VADKLTI DIGASTGGFTDVMLQSGARLVYAVDVGTNQLVWK 
LRQDHRVRSMEQYNFRYAQKEDFKEGLPEFASIDVSFISLNLILPALKEILVDGGQVVAL 
I KPQFEAGREQI GKNG I VKDKLVHEKVLTTWNFTKD YGYTVKHLDFSP IQGGHGN I EFL 
MHLQKCQDPQNLVLDQI QDVI EKAHKEFKKNEEE 

SEQ ZD NO. 6814 
STRAIN A909 frame: 1 

AKERVDVXAYKQ^LFDTREQAKRGVMAGMVINVINGERYDKPGEKVADDTELKLKGEKLK 
YVSRGGLKLEKALQVFE I SVADKLT IDIGASTGGFTDVMLQSGARLVYAVDVGTNQLVWK 
LRQDHRVRSMEQYNFRYAQKEDFKEGLPEFAS I DVS F I S LNLI LPALKE I LVDGGQWAL 
I KPQFEAGREQ I GKNG I VKDKL VHEKVLTTVTNFT KD YG YTVKHLDFS P I QGGHGN I EFL 
MHLQKCQDPQNLVLDQI QDVI EKAHKEFKKNEEE 

SEQ ZD NO. 6815 

STRAIN 18RS21 frame: 1 

AKERVDVIAYKQGLFDTREQAKRGVMAGMVINVINGERYDKPGEKVA 
YVSRGGLKLEKALQVFE I SVADKLT I D I GASTGG FTD VMLQSGARL VYAVDVGTNQL VWK 
LRQDHRVRSMEQYNFRYAQKEDFKEGLPE FAS IDVSFI SLNLI LPALKE ILVDGGQWAL 
IK^QFEIAGREQIGKNGIVKDKLVHEKVLTTVTNFTKDYGYTVKHLDFSPIQGGHGNIEFL 
MHLQKCQDPQNLVLDQI QDVI EKAHKEFKKNEEE 

SEQ ZD NO. 6816 
STRAIN M732 frame: 1 

AKERVDVLAYKQGLFDTREQAKRGVmGLVINVINGERYDKPGEKVADDTELKLKGEKLK 
YVSRGGLKLEKALQVFE I SVADKLT I D I GASTGGFTD VMLQSGARL VYAVDVGTNQL VWK 
LRQDHRVRSMEQYNFRYAQKEDFKEGLPE FAS I DVS F I SLNLI LPALKE I LVDGGQWAL 
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I KPQFEAGREQ I GKNG I VKDKLVHE KVLTT VTNFTKD YG YTVKHLD F S P VQGGHGN I EFL 
MHLQKCQDPQNLVLDQI QDVI EKAHKEFKKNEEE 

SEQ ID NO. 6817 
STRAIN COH1 frame: 1 

AKERVDVLAYKQGLFDTREQAKRGVMAGL V I NVI NGERYDKPGE KVADDTEL KLKGE KLK 
YVSRGGLKLEKALQVFE I S VADKLT I D I GASTGG FTD VMLQSGARLVYAVDVGTNQLVWK 
LRQDHRVRSMEQYNFRYAQKEDFKEGLPEFAS IDVSF I SLNL ILPALKE ILVDGGQWAL 
I KPQFEAGREQ I GKNGI VKDKLVHE KVLTT VTNFTKD YGYTVKHLDFS PVQGGHGN I E FL 
MHLQKCQDPQNLVLDQI QDVI EKAHKEFKKNEEE 

SEQ ID NO. 6818 
STRAIN M781 frame: 1 

AKERVDVLAYKQGL FDTREQAKRGVMAGLVI NVI NGERYDKPGE KVADDTEIjKIjKGE KLK 
YVSRGGLKLEKALQVFE I S VADKLT I D I GASTGG FTD VMLQSGARLVYAVDVGTNQLVWK 
LRQDHRVRSMEQYNFRYAQKEDFKEGLPEFAS IDVSF I SLNL I LPALKE ILVDGGQWAL 
I KPQFEAGREQI GKNGI VKDKLVHEKVLTTVTNFTKDYGYTVKHLDFSPVQGGHGNI EFL 
MHLQKCQDPQNLVLDQI QDVI EKAHKE FKKNEEE 

SEQ ZD NO. 6819 
STRAIN CJB 1 10 frame: 1 

AKERVDVLAYKQGLFDTREQAKRGVMAGMVINVINGERYDKPGEKVADDTELKLKGEKLK 
YVSRGGLKLEKALQVFE I SVADKLTIDIGASTGGFTDVMLQSGARLVYAVDVGTNQLWK 
LRQDHRVRSMEQYNFRYAQKEDFKEGLPEFAS I DVSF I SLNL ILPALKE ILVDGGQWAL 
I KPQFEAGREQ I GKNG I VKD KL VHE KVLTT VTNFT KD YG YTVKHLD F S P I QGGHGN I E FL 
MHLQKCQDPQNLVLDQI QDVI EKAHKEFKKNEEE 

SEQ ID NO. 6820 
STRAIN 1 169NT frame: 1 

AKERVDVLAYKQGLFDTREQAKRGVMAGLVI NVI NGERYDKPGEKVADDTELKLKGEKLK 
YVSRGGLKLEKALQVFE I S VADKLT I D I GASTGGFTDVMLQSGARLVYAVDVGTNQLVWK 
LRQDHRVRSMEQYNFRYAQKEDFKEGLPEFAS I DVSF I SLNL I LPALKE ILVDGGQWAL 
I KPQFEAGREQIGKNGI VKDKLVHEKVLTTVTNFTKDYGYTVKHLDFSP I QGGHGN I EFL 
MHLQKCQDPQNLVLDQI QDVI EKAHKEFKKNEEE 

SEQ ID NO. 6821 
STRAIN JM9 13 00 13 frame: 1 

AKERVDVLAYKQ^LFDTREQAKRGVMAGMVINVI NGERYDKPGEKVADDTELKLKGEKLK 
YVSRGGLKLEKALQVFE I S VADKLT ID I GAS TGG FTD VML QS GARLVYAVD VGTNQL VW K 
LRQDHRVRSMEQYNFRYAQKEDFKEGL PEFAS IDVS F I SLNL I LPALKE I LVDGGQWAL 
I KPQFEAGREQI GKNG I VKDKLVHEKVLTT VTNFTKD YGYTVKHLDF S P I QGGHGN I EFL 
MHLQKCQDPQNLVLDQI QDVI EKAHKE FKKNEEE 

SEQ ID NO. 6822 
STRAIN H36B frame: 1 

AKERVDVLAYKQGLFDTREQAKRGVMAGMVINVINGERYDKPGEKVADDTELKLKGEKLK 
YVSRGGLKLEKALQVFE I S VADKLT I D I GASTGGFTD VMLQSGARLVYAVDVGTNQLVWK 
LRQDHRVRSMEQYNFRYAQKEDFKEGLPE FAS IDVSFI SLNL I LPALKE I LVDGGQWAL 
I KPQFEAGREQI GKNG I VKD KL VHE KVLTT VTNFT KDYG YTVKHLD F S P I QGGHGN I E FL 
MHLQKCQDPQNLVLDQ I QDVI EKAHKE FKKNEEE 

PRETTY of : /biotmp/msa236800 . 2 { * } May 14, 2003 02:58 .. 
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msa23680 
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msa236800 
msa236800 
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msa236800 
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-AKERVDVLA 

-AKERVDVLA 

mAKERVDVLA 

-AKERVDVLA 

-AKERVDVLA 

-AKERVDVLA 

-AKERVDVLA 

-AKERVDVLA 

-AKERVDVLA 

-AKERVDVLA 

-AKERVDVLA 
********** 



YKQGL FDTRE 
YKQGLFDTRE 
YKQGLFDTRE 
YKQGLFDTRE 
YKQGLFDTRE 
YKQGLFDTRE 
YKQGLFDTRE 
YKQGLFDTRE 
YKQGLFDTRE 
YKQGLFDTRE 
YKQGLFDTRE 



QAKRGVMAGm 
QAKRGVMAGm 
QAKRGVMAGm 
QAKRGVMAGm 
QAKRGVMAGm 
QAKRGVMAGm 
QAKRGVMAGm 
QAKRGVMAG1 
QAKRGVMAGl 
QAKRGVMAGl 
QAKRGVMAGl 



********** *********_ 



VINVINGERY 

VINVINGERY 

VINVINGERY 

VINVINGERY 

VINVINGERY 

VINVINGERY 

VINVINGERY 

VINVINGERY 

VINVINGERY 

VINVINGERY 

VINVINGERY 
********** 



50 

DKPGEKVADD 

DKPGEKVADD 

DKPGEKVADD 

DKPGEKVADD 

DKPGEKVADD 

DKPGEKVADD 

DKPGEKVADD 

DKPGEKVADD 

DKPGEKVADD 

DKPGEKVADD 

DKPGEKVADD 
********** 



msa236800 
msa236800 .2 { 

msa236800 . 

msa236800 . 
msa236800 .2 { 

msa236800 . 
msa236800 .2(310 

msa236800 . 

msa236800 . 

msa236800 . 
msa236800 .2 { 
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51 

TELKLKGEKL 
TELKLKGEKL 
TELKLKGEKL 
TELKLKGEKL 
TELKLKGEKL 
TELKLKGEKL 
TELKLKGEKL 
TELKLKGEKL 
TELKLKGEKL 
TELKLKGEKL 
TELKLKGEKL 



KYVSRGGLKL 
KYVSRGGLKL 
KYVSRGGLKL 
KYVSRGGLKL 
KYVSRGGLKL 
KYVSRGGLKL 
KYVSRGGLKL 
KYVSRGGLKL 
KYVSRGGLKL 
KYVSRGGLKL 
KYVSRGGLKL 



EKALQVFEIS 
EKALQVFEIS 
EKALQVFEIS 
EKALQVFEIS 
EKALQVFEIS 
EKALQVFEIS 
EKALQVFEIS 
EKALQVFEIS 
EKALQVFEIS 
EKALQVFEIS 
EKALQVFEIS 



VADKLTIDIG 
VADKLTIDIG 
VADKLTIDIG 
VADKLTIDIG 
VADKLTIDIG 
VADKLTIDIG 
VADKLTIDIG 
VADKLTIDIG 
VADKLTIDIG 
VADKLTIDIG 
VADKLTIDIG 



100 

ASTGGFTDVM 
ASTGGFTDVM 
ASTGGFTDVM 
ASTGGFTDVM 
ASTGGFTDVM 
ASTGGFTDVM 
ASTGGFTDVM 
ASTGGFTDVM 
ASTGGFTDVM 
ASTGGFTDVM 
ASTGGFTDVM 
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msa236800 
m3a236800 -2 { 

msa236800 . 

msa236800 . 
msa236800.2{ 

msa236800 - 
msa236800 .2(310 

msa236800 . 

msa236800 . 

msa236800 . 
msa236800 .2{ 



.2{310_090; 
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Consensus 



LQSGARLVYA 
LQSGARLVYA 
LQSGARLVYA 
LQSGARLVYA 
LQSGARLVYA 
LQSGARLVYA 
LQSGARLVYA 
LQSGARLVYA 
LQSGARLVYA 
LQSGARLVYA 
LQSGARLVYA^ 



VDVGTNQLVW 
VDVGTNQLVW 
VDVGTNQLVW 
VDVGTNQLVW 
VDVGTNQLVW 
VDVGTNQLVW 
VDVGTNQLVW 
VDVGTNQLVW 
VDVGTNQLVW 
VDVGTNQLVW 
VDVGTNQLVW 



KLRQDHRVRS 
KLRQDHRVRS 
KLRQDHRVRS 
KLRQDHRVRS 
KLRQDHRVRS 
KLRQDHRVRS 
KLRQDHRVRS 
KLRQDHRVRS 
KLRQDHRVRS 
KLRQDHRVRS 
KLRQDHRVRS 



MEQYNFRYAQ 
MEQYNFRYAQ 
MEQYNFRYAQ 
MEQYNFRYAQ 
MEQYNFRYAQ 
MEQYNFRYAQ 
MEQYNFRYAQ 
MEQYNFRYAQ 
MEQYNFRYAQ 
MEQYNFRYAQ 
MEQYNFRYAQ 



KEDFKEGLPE 
KEDFKEGLPE 
KEDFKEGLPE 
KEDFKEGLPE 
KEDFKEGLPE 
KEDFKEGLPE 
KEDFKEGLPE 
KEDFKEGLPE 
KEDFKEGLPE 
KEDFKEGLPE 
KEDFKEGLPE 



********** ********** ********** ********** ********** 



msa236800 .2{310_090 
msa236800 . 2 {310_18RS21 
msa236800 .2{310_2603 
msa236800 .2{310_A909} 
msa236800 .2 {310__CJB110} 
rasa236800.2{310_H36B' 
msa236800.2{310__JM9130013 
msa236800.2{310_COHi; 
msa236800 .2{310_M732} 
msa236800.2{310_M78l} 
msa236800.2{310_1169NT) 

Consensus 



151 

FASIDVSFIS 
FASIDVSFIS 
FASIDVSFIS 
FASIDVSFIS 
FASIDVSFIS 
FASIDVSFIS 
FASIDVSFIS 
FASIDVSFIS 
FASIDVSFIS 
FASIDVSFIS 
FASIDVSFIS 



LNLILPALKE 
LNLILPALKE 
LNLILPALKE 
LNLILPALKE 
LNLILPALKE 
LNLILPALKE 
LNLILPALKE 
LNLILPALKE 
LNLILPALKE 
LNLILPALKE 
LNLILPALKE 



ILVDGGQWA 
ILVDGGQWA 
ILVDGGQWA 
ILVDGGQWA 
ILVDGGQWA 
ILVDGGQWA 
ILVDGGQWA 
ILVDGGQWA 
ILVDGGQWA 
ILVDGGQWA 
ILVDGGQWA 



LIKPQFEAGR 
LIKPQFEAGR 
LIKPQFEAGR 
LIKPQFEAGR 
LIKPQFEAGR 
LIKPQFEAGR 
LIKPQFEAGR 
LIKPQFEAGR 
LIKPQFEAGR 
LIKPQFEAGR 
LIKPQFEAGR 



200 

EQIGKNGIVK 
EQIGKNGIVK 
EQIGKNGIVK 
EQIGKNGIVK 
EQIGKNGIVK 
EQIGKNGIVK 
EQIGKNGIVK 
EQIGKNGIVK 
EQIGKNGIVK 
EQIGKNGIVK 
EQIGKNGIVK 



********** ********** ********** ********** ********** 



msa23680 
msa236800 . 2 
msa236800 
msa236800 
msa236800 .2 
msa236800 
msa236800 .2(31 
msa236800 
msa236800 
msa236800 
msa236800 .2 



0.2{310_090; 
{310_18RS21 
-2{310_2603 
.2{310_A909 
{310_CJB110 
.2{310_H36B; 
0_JM9130013' 
.2{310_COHl' 
•2{310_M732' 
.2{310_M781 
{310__1169NT 
Consensus 



201 

DKLVHEKVLT 
DKLVHEKVLT 
DKLVHEKVLT 
DKLVHEKVLT 
DKLVHEKVLT 
DKLVHEKVLT 
DKLVHEKVLT 
DKLVHEKVLT 
DKLVHEKVLT 
DKLVHEKVLT 
DKLVHEKVLT 



TVTNFTKDYG 
TVTNFTKDYG 
TVTNFTKDYG 
TVTNFTKDYG 
TVTNFTKDYG 
TVTNFTKDYG 
TVTNFTKDYG 
TVTNFTKDYG 
TVTNFTKDYG 
TVTNFTKDYG 
TVTNFTKDYG 



YTVKHLDFSP 
YTVKHLDFSP 
YTVKHLDFSP 
YTVKHLDFSP 
YTVKHLDFSP 
YTVKHLDFSP 
YTVKHLDFSP 
YTVKHLDFSP 
YTVKHLDFSP 
YTVKHLDFSP 
YTVKHLDFSP 



iQGGHGNIEF 
iQGGHGNIEF 
iQGGHGNIEF 
iQGGHGNIEF 
iQGGHGNIEF 
iQGGHGNIEF 
iQGGHGNIEF 
vQGGHGNIEF 
vQGGHGNIEF 
vQGGHGNIEF 
iQGGHGNIEF 



250 

LMHLQKCQDP 
LMHLQKCQDP 
LMHLQKCQDP 
LMHLQKCQDP 
LMHLQKCQDP 
LMHLQKCQDP 
LMHLQKCQDP 
LMHLQKCQDP 
LMHLQKCQDP 
LMHLQKCQDP 
LMHLQKCQDP 



********** ********** ********** _********* ********** 



msa236800 
msa236800 . 2 { 

msa236800 . 

msa236800 . 
msa236800 .2{ 

msa236800 . 
msa236800.2{310 

msa236800 . 

msa236800 . 

msa236800 . 
msa236800 .2{ 



.2{310_090} 
310_18RS21} 
2{310_2603 : 
2{310_A909; 
310_CJB110 ! 
2{310_H36B 
_JM9130013 
2{310_COH1 
2{310_M732 
2{310_M78l' 
310_1169NT} 
Consensus 



251 

QNLVLDQIQD 
QNLVLDQIQD 
QNLVLDQIQD 
QNLVLDQIQD 
QNLVLDQIQD 
QNLVLDQIQD 
QNLVLDQIQD 
QNLVLDQIQD 
QNLVLDQIQD 
QNLVLDQIQD 
QNLVLDQIQD 



VIEKAHKEFK 
VIEKAHKEFK 
VIEKAHKEFK 
VIEKAHKEFK 
VIEKAHKEFK 
VIEKAHKEFK 
VIEKAHKEFK 
VIEKAHKEFK 
VIEKAHKEFK 
VIEKAHKEFK 
VIEKAHKEFK 



********** ********** 



275 

KNEEE 

KNEEE 

KNEEE 

KNEEE 

KNEEE 

KNEEE 

KNEEE 

KNEEE 

KNEEE 

KNEEE 

KNEEE 
***** 
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Table 69: Comparative Sequences relating to SAG0032 



SEQ ID NO. 6901 
STRAIN 2603 

ATGAATAAAAAGGTACTATTGAC^TCGACAATGGCAGCTTCGCTATTATCAGTCGC^AGT 
GTTCAAGCACAAGAAACAGATACGACGTGGACAGCACGTA 

G ATTTGGTAAAG CAAGACAATAAAT CATCAT ATACTGTGAAAT ATGGTG ATACACTA^G C 
GTTATTT CAG AAGCAATGT CAATTGATATG AATGT CTTAG CAAAAAT A^TAA.CATTG CA 
GAT AT CAAT CTT ATTTAT CCTGAG ACAACACT GACAGTAACTTACGAT CAGAAGAGT CAT 
ACTGCCACTTCAATGAAAATAGAAACACCAGCAACAAATGCT 
ACTGTGGATTTGAAAACCAATCAAGTTTCTGTTGCAGACCAAA^ 

ATTT CGG AAGGTATGACAC CAGAAG CAG CAACAACGATTGTTT CG C CAATGAAG ACATAT 
T CTT CTG CG C CAGCTTTGAAAT CAAAAGAAGTATTAG CACAAGAG CAAG CTGTTAGT CAA 
G CAG CAG CTAATGAACAGGTAT CAC CAG CTCCTGTGAAGT CGATTACTT CAGAAGTT CCA 
G CAG CTAAAGAG GAAGTTAAAC CAACT CAG ACGT CAGT CAGT CAGT CAACAACAGTATCA 
CCAGCTTCTGTTGCCGCTGAAACACCAGCTCCAGTAGCTAAAGTAGCACCC4GTAAGAA 
GTAGCAGCCCCTAGAGTGGCA^GTGTTAAAGTAGTCACTCCTAAAGTAGAAA.CTGGTGCA 
TCACCAGAGCATGTATCAGCTCCAG 

AGTAAGTTACAAGCGACTGAAGTTAAGAGCGTTCCGGTAGCA.CA 

ACACCGGTAGCACAACCAGCTTCAACAACAAATGCAGTAGCT 

GGGCTCCAACCTCATGTTGCAGCrTATAAAGAA 

GAATT CAGTACATAC CGTG CGGGAGAT CCAGGTGATCAT GGTAAAGGTTTAG CAGTTGAC 
TTTATTGTAGGT ACTAAT CAAG CACTTGGTAATAAAGTT G CACAGTACTCTACACAAAAT 
ATGG CAG CAAATAACATTT CAT ATGTTAT CTGGCAACAAAA.GTTTTACTCAAATACAAAC 
AGTATTTATGGAC CTG CTAATACITGGAATG CAATGCCAGAT CGTGGTGGCGTTACTG C C 
AAC CACTATGACCACGTT CACGTAT CATTTAACAAATAATATAAAAAAGGAAG CTATTTG 
GCTTCITTTTTATATGCCTTGAATAGAC^ 



SEQ 10 NO. 6902 
STRAIN 090 

TGAGACAACACTGACAGTAACTTACGATCAGAAGAGTCATACT 
GAATGAAAATAGAAACACCAGCAACAAATGCT 

ACTGTGGATTTGAAAAC CAAT CAAGTTTCTGTTG CAGACC^AAAAGTTT C 
TCT CAATACAATTT CC4GAAGGTATGACACCAGAAGCAG CAACAACGATTG 
TTTCG C CAATGAAGACATATTCTTCTG CG C CAG CTTTGAAATCAAAA.GAA 
GTATT AG CACAAGAG CAAG CTGTTAGT CAAG CAG CAGCTAATGAACAGGT 
ATCAACAGCTCCTGTGAAGTCGATTACTTCAGAAG'TTCCAGCAGCTAAAG 
AGGAAGTTAAACCAACTCAGACGT CAGT CAGT CAGTCAACAACAGTATCA 
CCAGCTTCTGTTGCCGCTGAAACACCAGCTCCAGTAGCTAAAGTAGCAC 
GGTAAGAACTGTAGCAG CC C CTAGAGTGGCAAGTGTTAAAGTAGTCACT C 
CTAAAGT AGAAACTGGTGCAT CAC CAG AG CATGTAT CAG CTCCAG CAGTT 
C CTGTGACTACGACTTCAACAG CTACAGACA3TAAGTTACAAG CGACTGA 
AGTT AAGAG CGTT CCGGTAG CACAAZ\AAG CTC CAACAG CAACAC C GGTAG 
CACAACCAG CTT CAACAACAAATGCAGTAG CTGCACAT CCTGAAA^TGCA 
C4GGCTCCAACCTCATGTTGCAGCTTATA 

TGGAGTTAATGA^TTCAGTACATACCGTGC^GGTGATCCAGGTGATCATG 

GTAAAGGTTTAGCAGTCGACTTTATTGTAGGTAAAAACCAAGCACTTGGT 

AATGAAGTTGCACAGTACTCTACACAAAATATGGCAGCAAA 

AT ATGTTAT C TGGCAACAAAAGTTTTACT CAAATACAAATAGTATTTATG 

GACCTTGCTAATACTTGGAATGCAATGCCAGATCGTGGTGGCGTTACTGCC 

AACGATTATGACCATGTTCACGTATCATTTA^CAAATAATATAAAAAAGG 

AAGCTATTTGGCJ.T CTTTTTTATATGCCTTGAAT AGACTTT CAAGGTTCT 

TATATAATTTTTATTA 



SEQ XD NO. 6903 
STRAIN A909 

CTGATTTGGTAAAG CAAGACAATAAATCAT CATATACTGTGAA 
ATATGGTGATACACTAAG CGTTATTTCAGAAG CAATGTCAATTGATATGA 
ATGTCTTAG CAAAAATTAATAACATTGCAGATATCAATCTTATTTATC cT 
GAGACAACACTG aCAGTAACITACG ATCAGAAGAGTCATACT C 
AATGAAAATAGAAACACCAG C^CAAATGCTG CTGGTCAAACAaCAG cT A 
CTGTCGATTTGAAAACCAATCAAGTTTCTGTTGCAGACCAAAA^ 
CTCAATACAATTTCGGAAG GTATGACACCAGAAG CAG CAACAACGATTGT 
TTCGCCAATGAAGACATATTCTTcTGCGCC^GCITTGAAATCAAAAGAAG 
TATTAg CACAAGGGC aAG CTGTTAGTCAAGCAGCAGCTAATGAACAGGTA 
TCAc CAG CT C CTGTG AAGTCGATTACIT CAGAAGTTC CAg CAG CTAAAGA 
GGAAGTTAAACCAaCTCAgACGTCAgTCAGTCAGTCAACAACAGTATCAC 
CAgCTTCTGTTGCCGCTGAA^CACCAGCTCCAgTAGCTA 
GTAAGAACTGTAGCAG C C CCTAGAGTGG CAAGTGTTAAAGTAGT CACTC C 
TAAAGTAGAAACIX3GTGCATCACCAGAGCATGTATCAGCTCCAGCAGTTC 
CTGTGACTACGACTTCAACAGCTACAGACAGTAAGTTACAA 
GTTAAGAG CGTTCCGGTAGCACAAAAAGCT C CAACAGCAACACCGGT AG C 
ACAACCAGCTTCAACAACAAATGCAGTAGCTGC^ 
GGCTCCAACCTCATGTTGCIAGCTTATAAAGAAAA 

GGAGTTAATGAATTCAGTACATACCGTGCGGGAGATCCAGGTGATCATGG 
TAAAGGTTT AGCAGTTGACTTTATTGTAgGTAAAAAC CAAGCACTTGGTA 
ATGAAGTTG CACAGT ACT CTACACAAAAT ATGG CAGCaAATAACATTTCA 
TATGTTATCTGG CAACAAAAGTTTTACT CAAAT aCAAATAGTATTTATGG 
ACcTGCTAATACTTGGAATGCAATGCCAGATCGTGGTGGCGTTAcTGCCA 
ACCa CTATGAC CACGTT CACGT ATCATTTAACAAATaATATAAAAAAGGA 
AG CT aTTTGGCITCITTTTTATATG C CT^ CAAGGTT CTT 

ATATAATTTTTATTA 

SEQ ZD NO. 6904 
STRAIN H36B 
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CTGATTTGGTAAAGCAAGACAATAAATCATCATATAcTGTGAAATA 

TGGTGATACAcTAAGCGTTATTTCAGAAGCAATGTCaATTGATATGAATG 

TCTTAGCAAAAATTAATAACATTGCAGATATCAATCTTATTTATCcTGAG 

ACAACaCTGaCAGTAaCTTACX3ATCAGAAGAGTCATACTGCTACTTCAAT 

GAAAATAGAAACACCAGCAACAAATGCrGCTGGTCAAACAAC^ 

TCGATTTGAAAACCAATCAAGTTTCTG 

AATACAATTTCGGAAGGTATGACACCAGAAGCAGCAACAACGATTGTTTC 

GCCAATGAAGACATATTCTTCTGCGCCAGCTTTGAAATCAAAAGAAGTAT 

TAGCACAAGGGCAAG CTGTTAGTCAAGCAGCAG CTAATGAACAGGTATCA 

CCAG CT C CTGTG AAGTCGATTACTT CAGAAGTTC CAG CAG CTAAAGAGGA 

AGTTAAACCAACTCAGACGTCAGTCAGTCAGTCAACAACAGTATCACCAG 

CTTcTGTTGCCGCTGAAACACCAGCTCCAGTAGcTAAAGTAGCACCGGTA 

AGAACTGTAGCAGCCCcTAGAGTGGCAAGTGTTAAAGTAGTCACTCcTAA 

AGTAG AAACTGGTG CATCAC CAGAG CATGTAT CAG CT CCAGCAGTTC CTG 

TGACTACGACTT CAACAG CTACAGACAGT AAGTTACAAG CGACTG AAGTT 

AAGAGCGTTCCGGTAGCACAAAAAGCTCCAACAGCAAC^ 

ACCAG CTTCAACAACAAATG CAGTAGCTG CACAT CCTGAAAATG CAAGG C 

TCCAACCTCATGTTGCAG CT TATAAAG AAAAAGTAG CGT CAACTTATGGA 

GTTAATGAATTCAGTACATACCGTGCGGGAGATCCAGGTGATCATGGTAA 

AGGTTTAGCAGTTGACTTTATTGTAGGTAAAAAC 

AAGTTGCACAGTACTCT ACACAAAA t aTGG CAGCAAATAACATTT CATAT 

GTTATCTGG CaACAAAAGTTTT ACT CAAAT ACAAATAGTATTTAT GGACC 

TG CTAATACTTGGAATG CAATG CCAgAT CGTGGTGGCGTTACTG CCAAC C 

ACTATGACGACGTTCACCTATGATTTAACAAA 

TATTTGGCTTCTTTTTTATATGCC^^ 

TAATTTTTATTA 

SEQ XD NO. 6905 

STRAIN 18RS21 
CTGATTTGGTAAAG CAAGACAAT 

AAAT CAT CAT ATACTGTGAAATATGGTGAT ACAcTAAG C GTTATTTCAGA 
AGCAATGTCAATTGATATGAATGTCTTAGCAAAAaTAAATAACAT^ 
ATATCAATCTTATTTATCcTGAGAC^^CaCTGaCAGTAACTTACGATCAG 
AAGAGTCATACTGCCaCTTCAATGAAAATAGAAACACCAGCAaCAAATGC 
TGCTGGTCAaACAaCAGCTACTGTGGATTTGAAAACCAATC^ 
TTGCAGACCAAAAAGTTTCTCT CAATACAATTTCGGAAGGTATGACACCA 
GAAG CAG CAAC1AACGATTGTTT CG CCAATGAAGAC aTATTCTTcTGCGC C 
AG CTTTGAAaTCAAAAGAAGTATTAG CACAAGAG CAAG CTGTTAGTCAAG 
CAGCAiGCTAATGAACAGGTATCACCAG CT C CTGTGAAGTCGATTACTTCA 
GAAGTTCCAGCAGCTAAAGAGGAAGTTAAACCAACTCAGACGTCAGTCAG 
TCAGTCAACAACAGT AT CACCAG CTTCTGTTG CCGCTGAAACAC CAG CTC 
CAGTAG CTAAAGTAG CACCGGTAAGAACTGTAGCAGC CCCTAGAGTGGCA 
AGTGTTAAAGTAGT CACTCCTAAAGTAGAAACTGGTG CAT CACCAGAG CA 
TGTATCAGCTCCAGCAGTTCCTGTGACTACGACI^ 

GT AAGTTACAAG CG ACTGAAGTTAAGAG CGTT CCGGTAG CACAAAAAG CT 
CCAACAGCAACACCGGTAGCACAACCAGCTTCAACAAC^AATGCAGTAGC 
TG CACAT CCTGAAAATGCAGGG CT C CAAC CTCATGTTG CAGCTTATAAAG 
AAAAAGTAGCGTC^CTTATGG^GTTAATGAATTCAGTACATACCGTGCG 
GGAGATCCAGGTGATCATGGTAAAGGTTTAGCAGTTGACTTTATTGTAGG 
TACTAAT CAAG CACTTGGTAATAAAGTTG CACAGTACT cT ACACAAAATA 
TGGCAGCAAATAACATTTCATATGTTATCTG^ 

AATACAAACAGTATTTATGGAC CTGCTAATACTTGGAATG CAATGCCAGA 
TCGTGGTGGCGTTACTGCCAACCACTATGACCACGTTCACGTATCATTTA 
ACAAATAATATAAAAAAGGAAGCTATTTGGCTTCTTTTTTAT 
AATAGACTTTCAAGGTTCTTATATAATTTTTATT'A 

SEQ ID MO. 6906 

STRAIN COH1 
CTGATTT 

GGTAAAG CAAGACAATAAAT CATCATATACTGTGAAATATGGTGATACAC 
TAAG CGTTATTTCAGAAG CAATGT CAATTGATATGAATGTCTT AG CAAAA 
ATTAATAACATTGCAGATATCAATCTTATTTATCCTGAGACAACACTG^ 
AGTAACTTACGATCAGAAGAGT CATACTG CCACTT CAATGAAAATAGAAA 
CACCAGCAACAAATGCTGCTGGTCAAACAACAGcTACTGT 
ACCAATGAAGTTTTTGTTGC^GACCAAAAAGTT^ 

GGAAGGTATGACACCAGaaGCAGCAACAACGATTGTTTCGCCAATGAAGA 
CaTATTCTTCTGCGCCAGCTTTGAAATCAAA 

CAAG CTGTT AGTCAAGT AG CAG CTAATGAACAGGTATCACCAGCT CCTGT 

GAAGT CGATTACrTCAGAAGTTCCAGCAG CTAAAGAGGAAGTTAAAC CAA 

CTCAGACGTCAGT CAGT CAGTTAACAAC^GTATCACCAGCTTCTGTTGCC 

GCTGAAACACCAGCTCCAGTAGCTAAAGTAGCACCGGTAAGAACTGTAGC 

AGCCCCTAGAGTGGCAAGTGcTAAAGTAGTCACTCgTAAAGTAGAAACTG 

GTGCATCAC CAGAG C^TGTATCAGCTCCAGCAGTT CCTGTGACTACGACT 

TCACCAGCTA(^GAC^GTAAGTTACAAGCGACTGAAGTTAAGAGCGTTCC 

GGTAGCACAAAAAGCTCCAACAGCAACACCGGTAGCAC^ 

CAACAAATG CAGTAG CTGGACATCCTGAAAATGCAGGGCT 

GTTGCAG CIT AT AAAGAAAAAGTAG CGT CAACTTATGGAGTTAAT 

C^GTACATACCXSTGCGGGAGATCCAGGTGATCATGGTAAAGGTTTAGCAG 

TTGACTTTATTGTAGGT AAAAACCAAG CACTTGGTAATGAAGT^ 

Ta CTCTACACAAAATATGG CAGCAAATAACATTT CATATGTTAT CTGGCA 

ACAAAAGTTTTATT CAAATACAAAT AGTATTTATGGACCTGCTAATACTT 

GGAATGCAATGCCAGATCGTGGTGGCGTTA(nX3CCAACC^CTATGACCAC 

GTTCACGTAT CATTT AACAAATAAT ATAAAAAAGGAAG CTATTTGG CTT C 



i 
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TTTTTTATATGCCTTGAATAGACTTTCAAGGTTCTTATATAATTTTTATT 
A 

SEQ ID NO. 6907 
STRAIN M732 

CTGATTTGGTAAAG CAAGACAATAAAT CAT CATAT ACTGTGAAATATGGT 

GATACAnTAAG CGTT ATTT CAGAAG CAATGT CAATTG ATATG AATGT CTT 

AG CAAAAATT A^TAACATTG CAGAT AT CAAT CTTATTTAT C CTG AGACAA 

CACTGACAGTAACTT ACGAT CAGAAGAGT CA t ACTG CCACTT CAATG AAA 

ATAGAAACAC CAGCAACAAATG CTG CTGGT CAAACAACAG CTACTGT C G A 

TTTGAAAACCAATCAAGTTTTTGTTGCAGACCAAAAAGTTTCTCTCAATA 

CAATTTCGGAAGGTATGACACCAGAAGCAGCAACAACGATTGTTTCGCCA 

ATGAAGACATATTCTTCTGCGCCAGCTTTGAAATCAAAAGAAGTATTAGC 

ACAAGAG CAAG CTGTTAGT C AAGTAG CAG CTAATG AACAGGTAT CAC CAG 

CTCCTGTGAAGTCGATTACTTCAGAAGTTCCAGCAGCTAAAGAGGAAGTT 

AAACCAACTCAGACGTCAGTCAGTCAGTTAACAACAGTATCACCAGCTTC 

TGTTGCCGCTGAAACACCAGCTCCAGTAGCTAAAGTAGCACCGGTAAGAA 

CTGTAGCAGCCCCTAGAGTGGO^GTGCTAAAGTAGTCACTCCTAAAGTA 

GAAACTGGTGCATCACCAGA.GCATGTATCA,GCTCCAGCAGTTCCTGTGAC 

TACGACTT CACCAG CT ACAG ACAGT AAGTT ACAAG CG ACTGAAGTTAAG A 

GCGTTCCGGTAGCACAAAAAGCTCCAAC!AGCAaCACCGGTAGCACA^ 

GCTTCAACAACAAATGCAGTAGCTGCACATCCT 

AC CT CATGTTGCAG CTTATAAAGAAAAAGTAG CGT CAACTTATGGAGTTA 
ATGAATT CAGTACATAC CGTG CGGG AGAT C CAGGTGATCATGGTAAAGGT 
TTAGCAGTTGACTTTA t tgtaggtaaaaac cAAG CACTTGGTAATGAAGT 
TGCACAGTACTcTACACAAAATATGGCAGCAAATAACATT^ 
TCTGGGAACAAAAGTTTTATTCAAATAC^ 

AATACTTGGAATGCAATGCCAGATCGTGGTGGCX3TTACTGCCAACCACTA 
TGACCACGTTCACGTAT CATTTAACAAATAATATAAAAAAGGAAG CTATT 
TGGCTTCTTTTTTATATGCCTTGAATAGACTITTCAAGGTT CTTATATAAT 
TTTTATTA 

SEQ XD NO. 6908 
STRAIN M78 1 

CTGATTTGGTAAAG CAAGACAATAAATCATCATATACTGTGAAATATGGT 
GATACACTAAGCGTTATTT CAGAAGCAATGTCAATTG ATATG AATGT CTT 
AG CAAAAATTAATAACATTG CAGATAT CAATCTTATTTATCCTGAGACAA 
CACTGACAGTAACTTACGATCAGAAGAGTCATACrGC CAC^ CAATGAAA 
ATAGAAACACCAGCAACAAATGCTGCTGGT CAAACAACAGCTACTGTCGA 
TTTGAAAACCAATCAAGTTTTTGTTGCAGACCLAAA 

CAATTTCGGAAGGTATGACACCAGAAG CAG CAACAACGATTGTTTCGC CA 
ATGAAGACATATT CTT CTG CGC CAG CTTTGAAAT CAAAAGAAGTATTAG C 
ACAAGAG CAAGCTGTTAGT CAAGTAG CAG CTAATGAACAGGTAT CAC CAG 
CT CCTGTGAAGTCGATTACTTCAG AAGTT C CAGCAG CTAAAGAGGAAGTT 
AAAC CAACT CAGACGTCAGTCAGT CAGTTAACAACAGTAT CACCAG CTT C 
TGTTGCCGCTGAAACACCAG CTCCAGTAGCTAAAGTAGCACCGGTAAGAA 
CTGTAGCAG C CCCTAGAGTGGCAAGTG CTAAAGTAGTCACTCCTAAAGTA 
GAAACTGGTG CATCAC CAGAGCATGTATCAGCTC CAG CAGTT CCTGTGAC 
TACGACTTCACCAG CTACAGACAGTa aGTTACAAGCGACTGAAGTTAAGA 
GCGTTCCGGTAGCACAAAAAGCTCCAACAGCAACACC 
GCTTCAACAACAAATG CAGTAG CTGCACATCCTGAAAATG CAGGG CT CCA 
AC CTCATGTTG CAG CTTATAAAGAAAAAGTAG CGT CAACTTATGGAGTTA 
ATGAATTCA.GTACATACCGTGCGGGAGATCCAGGTGATCATGGTAAAGGT 
TTAGCAGTTGACTTTATTGTAGGTAAAAAC CAAG CACTTGGTAATGAAGT 
TG CACAGTACTCTACACAAA^TATGGCAG GAAATAACATTTCATATGTTA 
TCTGGCAAGAAAAGTTTTATTCAAATACAAA 

AATACTTGGAATGCAATGCCAGATCGTGGTGGCGTTACTGCCAACCACTA 
TGACCACGTT CACGTAT CATTTAACAAATAATATAAAAAAGGAAG CT aTT 
TGGCTTCTTTTTTATATGCCTTGAATAgACTTTCAAGGTTCTTATATAAT 
TTTTATTA 

SEQ XD NO. 6909 
STRAIN CJB 110 

CTGATTTGGTAAAG CAAGACAATAAAT CAT CATAT ACTGTGAAA 

TATGGTGATACACT AAG CGTTATTT CAGAAGCAATGT CAATTGATATGAA 

TGTCTTAGCAAAAATTAAT AACATTGCAGATAT CAAT CTT ATTT ATC CTG 

AGACAACACTGACAGTAACTTACGATCAGAAGAGTCATACT 

ATGAAAATAGAAACACCAGCAACAAATGCTGCTCGTCAAACACCAGCTAC 

TGTCGATTTGAAAACCAATCAAGTTTcTX3TTGCAGACCA^ 

TCAATACAATTTCGGAAGGTATGACACCAGAAGCAGCAAC^ 

T CGCCAATG AAGACATATTCITCTG CG CCAGCTT^ CAAAAGAAGT 

ATTAG CACAAGAG CAAGCTGTTAGT CAAG CAG CAG CTAATGAACAGGTAT 

CAACAGCTCCTX3TG AAGTCGATT ACTT CAGAAGTTCCAG CAG CTAAAGAG 

GAAGTTAAACCAACTCAGACGTCAGTCAGTCAGTCAACAACA 

AgCTTCTGTTG C CG CTG AAACACCAG CT C CAGTAG CT AAAGT AGCAC CGG 

TAAgAACTGTAG CAG CC CCTAG AGTGG CAAGTGTTAAAGTAGTCACT CCT 

AAAGTAGAAACTGGTGCATCAC CAGAG CATGTAT CAG CT C CAG CAGTTC C 

TGTGACTACGACTTCAACAGcTACAGACAGTaAGTTaCAAGCGACTGAAG 

TTAAGAGCGTTCCGGTAGCACAAAAAGCTCCAACAGCAACACCGGTAGCA 

CAACC^GCITCAACAACAAATGCAGTAGCrGCA(ZATCCT 

GCTCCAACCT CATGTTG CAG CTTAT aAAGAAAAAGTAGCGTCAACTTATG 

GAGTTAATG AATTCAGT ACATa C CGTG CAGGTGAT C CAgGTGAT CATGGT 

AAAGGTTTAGCAGTcGACITTATTGTAgGTAAAAACCAAGCACTTGGTA^ 
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TG AAGTTG CACAGTACT CT ACACAAAATATGG CAG CAAAT AACATTT CAT 
ATGTTATCTGGCAACAAAAGTTTTACTCAAATACAAATAGTATTTATGGA 
CCTGCTAATACTTGGAATGCAATGCCAGATCGTGGTGGCGTTACTGCCAA 
CCATTATGAC CATGTT CACGTAT CATTTAACAAATAATATAAAAAAGGAA 
GCTATTTGG CTT CTTTTTT ATATG C CTTG AATAG AC t TTCAAGGTT CTT A 
TATAATTTTTATTA 

SEQ ID NO. 6910 
STRAIN 1169NT 
CTGATTTG 

GTAAAGCMGACAATAAATCATCATATACTGTGAAATATGGTGATACACT 
AAGCGTTATTTCAGAAGCAATGTCAATTGATATGAATGTCTTAGCAAAAA 
TTAATAACATTG CAG AT AT CAATCTT ATTTAT C C TGAGACAACACTGACA 
GTAACTTACG AT CAg AAGAGTCAT ACTG C CACTTCAATGAAAATAGAAAC 
ACCAGCAACAAATG CTGCTGGTCAAACAAGAGCTACTGTGGATTTGAAAA 
CCAATCAAGTTTCTGTTGCAGACGAAAAAGTTTCTCT 
GAAGGTATGACACCAGAAG CAg CAACAACGATTGTTTCGC CAATGAAGAC 
ATATTCTTCTGCGC CAG CTTTg AAAT CAAAAGAAGTATTAGCACAAG AG C 
AAGCTGTTAGTCAAGC^GCAGCrAATGAAC^GGTATCACCAGCrCCrGTG 
AAGTCGATTACTTCAgAAGTT C CAg CAGCTAAAGAGG AAGTTAGAC CAa C 
TcAGACGT(^GTCAGTCAGTC^CMCAGTATCACCAgCTTCTGTTGCCG 
CTGAAACAC CAG CT C CAGT AG CTAAAGTAG CAC CGGT AAGAACTGTAG CA 
GCCCCAGCCCCrrAGAGTGGCAAGTGCTAAAGTAGTCACTCCTAAAGTAGA 
AACTGGTGCATCACCAGAGCATGTACCAGCTCCAGCAGTTCCTGTGACTA 
cGACTTCAACAG CTAC aGACAaTaAGTTACAAG CGACTGAAGTTAAgAG C 
GtTCCGGTgGCAGAAAAAGCTCCAACAGCAACACCGGTaGC^ 
TT cAACAACAAATGCAGTAGcTGCACATCCTGAAAATGCAGGACTCCAAC 
CTCATGTTG CAGCTTAT AAAGAAAAAGTAG CX3TCAACTTATGGAGTTAAT 
GAATTCAGTACATaC CGTG CGGGAGATCCAGGTG ATCATGGT AAAGGTTT 
AG CAGTTGACTTTATTGTagGTAAAAACCAAG CACTTGGTAATGAAGTTG 
CACAGTACTCTACACAAAATATGGGAGCAAAT^ CATATGTTAT C 

TGGCAACAAAAGTTTTACTCAAATACAAATAGTATTTATG 
TACTTGGAATGCAATGCCAGATCGTGGTGGCGTTACTGCCAACCACTATG 
AC CACGTTCACGTAT CATTTAACAAAT AATATAAAAAAGGAAGCTATTTG 
G CTTCTTTTTTATATG C CTTGAAT AGACTTT CAAGG t TCTTATATAATTT 
TTATTA 

SEQ XD NO. 6911 
STRAIN JM9 1300 13 

CTGATTTGGTAAAG CAAGACAATAAAT CAT CATATACT 
GTGAAATATGGTGATACACTAAGCGTTATTTCAGAAGCAATGTC7UVTTGA 
TATGAATGT CTTAG CAAAAATAAATAACATTG CAGAT ATCAAT CTTATTT 
AT CcTGAGACAACACTGACAGTAACTTACGAT CAGAAGAGTCATACTGC C 
ACTTCAATGAAAATAGAAACAC CAG CAACAAATG CTG CTGGT CAAACAAC 
AGCTACTGT GGATTTGAAAACCAAT CAAGTTT CTGTTG CAGACCAAAAAG 
TTTCTCTCAATACAATTTCGGAAGGTATGACACCAGAAGCAGC^ 
ATTGTTT CG CCAATG AAGACATATT CTT CTG CG C CAG CTTTGAAATCAAA 
AGAAGTATTAGCACAAGAGCAAGCTGTTAGTCAAGCAGCAGCTAATGAAC 
AGGTATCACCAG CT C CTGTGAAGTCGACTACTTCAGAAGTTCCAGCAG CT 
AAAGAGGAAGTTAAACCMCTCAGACGTC^GTCAGTCAGTCAACAACAGT 
ATCAC CAg CTTCTGTTGCCGCTGAAACAC CAGCT CCAGTAGCTAAAGTAG 
CACCGGTAAGAACTGTAGCAG C C CCTAgAGTGGCAAGTGTTAAAGTAGTC 
ACT C CTAAAGTAGAAACTGGTG CAT CACC AGAG CATGTATCAG CT CCAG C 
AGTT C CTGTGACTACGACTT CACCAG CTACAG aCAGT AAGTTACAAG CGA 
cTGAAGTTAAGAGCGTTCCGGTAG CACAAAAAG CTCCAACAG CAACACCG 
GTAGCaCAACCAGCTTCAACAACAAATGCAGTAGCTGC^ 
TGCAGGG CT CCAACCT CATGTTGCAGCTTATAAAG AAAAAGTAG CGT CAA 
CTTATGGAGTTAATGAATTCAGTACATACCGTGCGGGAGATCCAgGTGAT 
CATGGTAAAGGTTTAG CAGTTGACTTTATTGTAGGTACTAATCAAG CACT 
TGGTAATAAAGTTG CACAGTACTCTACACAAAATATGG CAGCAAATAACA 
TTT CATATGTTATCTGG CAACAAAAGTTTTACT CAAATACAAACAGTATT 
TATGGACCTGCTAATACTTGGAATGCAATGCCAGATCGTGGTGGCGTTAC 
TG CCAACCACTATG AC CACGTT CACGTAT CATTTAACAAATAATATAAAA 
AAGGAAG CTATTTGGCTT CTTTTTTATATG CCTTG AATAGACT^ CAAGG 
TT CTT AT AT AATTTTTATTA 

PRETTY of: /biotmp/msal67919 . 2 { * } March 11, 2003 08:55 .. 

1 50 

msal67919 . 2 { 322_C0H1 } 

msal67919.2{322JM78l} 

msal67919.2{322_M732} 

msal67919.2{322_18RS2l} 

msal67919.2{322_2603} atgaataaaa aggtactatt gacatcgaca atggcagctt cgctattatc 

msal67919.2{322_JM9130013} 

msal67919 .2 {322_090} 

meal 67 919 . 2 { 322_CJB110 } ■ 

msal67919 .2{322_A909} 

msal67919 .2{322_H36B} 

maal67919.2{322_1169NT} 

Consensus ********** ********** ********** ********** ********** 

51 100 



990 



WO 2004/018646 



PCT/US2003/026827 



Table 69: Comparative Sequences relating to SAG0032 



msal67919.2{322_COHl 
msal67919.2{322_M781 
msal67919 . 2 {322_M732 ; 
msal 67919 . 2 { 322_18RS21 ; 
msal67919.2{322_2603; 
msal67919 . 2 {322__JM913 0013 
msa!67919 . 2 {322_090 
msal67919 . 2 (322_CJB110 
msal67919 . 2 { 322_A909 } 
msal67919.2{322_H36B} 
msal67919 . 2 {322_1169NT} 

Consensus 



agtcgcaagt gttcaagcac aagaaacaga tacgacgtgg acagcacgta 



********** ********** ********** ********** ********** 



msal67919. 

msal67919. 

msal67919 . 
msal67919.2{ 

msa!67919 . 
msal67919.2{322 
msal67919 
msal67919.2{ 

msa!67919. 

msal67919. 
msal67919.2{ 



2{322_C0H1} 
2{322__M781} 
2{322__M732} 
322_18RS2l} 
2{322_2603} 
_JM9130013) 
.2{322_090) 
322__CJB110} 
2{322_A909} 
2{322_H36B} 
322_1169NT} 
Consensus 



101 150 

— ct gatttggtaa agcaagacaa taaatcatca 

ct gatttggtaa agcaagacaa taaatcatca 

. ct gatttggtaa agcaagacaa taaatcatca 

~~ — ct gatttggtaa agcaagacaa taaatcatca 

ctgtttcaga ggtaaaggct gatttggtaa agcaagacaa taaatcatca 
ct gatttggtaa agcaagacaa taaatcatca 



********** ********. 



•ct gatttggtaa agcaagacaa taaatcatca 

-ct gatttggtaa agcaagacaa taaatcatca 

•ct gatttggtaa agcaagacaa taaatcatca 

-ct gatttggtaa agcaagacaa taaatcatca 



msal67919. 

msal67919 . 

msal67919 . 
msal67919 .2{ 

msal67919. 
msa!67919.2{322 
msal67919 
msal67919.2{ 

msal67919 . 

msal67919 . 
msal67919.2{ 



2{322_C0H1 
2{322_M781 
2{322_M732 
322_18RS21 
2{322_2603 
_JM9130013; 
.2{322_090 
322_CJB110 
2{322_A909> 
2{322_H36B} 
322_1169NT} 
Consensus 



151 

tatactgtga 
tatactgtga 
tatactgtga 
tatactgtga 
tatactgtga 
tatactgtga 



aatatggtga 
aatatggtga 
aatatggtga 
aatatggtga 
aatatggtga 
aatatggtga 



tacactaagc 
tacactaagc 
tacantaagc 
tacactaagc 
tacactaagc 
tacactaagc 



gttatttcag 
gttatttcag 
gttatttcag 
gttatttcag 
gttatttcag 
gttatttcag 



200 

aagcaatgtc 
aagcaatgtc 
aagcaatgtc 
aagcaatgtc 
aagcaatgtc 
aagcaatgtc 



tatactgtga aatatggtga tacactaagc gttatttcag aagcaatgtc 

tatactgtga aatatggtga tacactaagc gttatttcag aagcaatgtc 

tatactgtga aatatggtga tacactaagc gttatttcag aagcaatgtc 

tatactgtga aatatggtga tacactaagc gttatttcag aagcaatgtc 



msal67919. 

msal67919 . 

msal67919 . 
msal 67 9 19. 2 { 

msal67919 . 
msal67919.2{322 
msal67919 
msal67919.2{ 

msal67919 . 

msal 679 19 . 
msal67919 . 2 { 



2{322_COHl} 
2{322_M781} 
2{322_M732} 
322_18RS21" 
2{322_2603 
_JM9130013 
.2{322_090} 
322_CJB110} 
2{322_A909} 
2{322_H36B} 
322_1169NT} 
Consensus 



201 

aattgatatg 
aattgatatg 
aattgatatg 
aattgatatg 
aattgatatg 
aattgatatg 



aatgtcttag 
aatgtcttag 
aatgtcttag 
aatgtcttag 
aatgtcttag 
aatgtcttag 



caaaaattaa 
caaaaattaa 
caaaaattaa 
caaaaataaa 
caaaaataaa 
caaaaataaa 



taacattgca 
taacattgca 
taacattgca 
taacattgca 
taacattgca 
taacattgca 



250 

gatatcaatc 
gatatcaatc 
gatatcaatc 
gatatcaatc 
gatatcaatc 
gatatcaatc 



aattgatatg aatgtcttag caaaaattaa taacattgca gatatcaatc 
aattgatatg aatgtcttag caaaaattaa taacattgca gatatcaatc 
aattgatatg aatgtcttag caaaaattaa taacattgca gatatcaatc 
aattgatatg aatgtcttag caaaaattaa taacattgca gatatcaatc 



msal67919.2{322_COHl 
msal67919.2{322_M78i; 
msal67919.2{322_M732' 
msal67919 . 2 { 322_18RS21 } 
msal67919 . 2{322_2603 } 
msal67919.2{322_JM913 0013} 
msal67919.2{322_090* 
msal67919 . 2 { 322_CJB110 
msal67919 . 2 { 322_A909 
msal67919 . 2 { 322_H36B) 
msal67919.2{322_1169NT) 

Consensus 



251 

ttatttatcc 
ttatttatcc 
ttatttatcc 
ttatttatcc 
ttatttatcc 
ttatttatcc 

ttatttatcc 
ttatttatcc 
ttatttatcc 
ttatttatcc 



TGAGACAACA 

TGAGACAACA 

TGAGACAACA 

TGAGACAACA 

TGAGACAACA 

TGAGACAACA 

TGAGACAACA 

TGAGACAACA 

TGAGACAACA 

TGAGACAACA 

TGAGACAACA 
********** 



CTGACAGTAA 
CTGACAGTAA 
CTGACAGTAA 
CTGACAGTAA 
CTGACAGTAA 
CTGACAGTAA 
CTGACAGTAA 
CTGACAGTAA 
CTGACAGTAA 
CTGACAGTAA 
CTGACAGTAA 



CTTACGATCA 
CTTACGATCA 
CTTACGATCA 
CTTACGATCA 
CTTACGATCA 
CTTACGATCA 
CTTACGATCA 
CTTACGATCA 
CTTACGATCA 
CTTACGATCA 
CTTACGATCA 



300 

GAAGAGTCAT 
GAAGAGTCAT 
GAAGAGTCAT 
GAAGAGTCAT 
GAAGAGTCAT 
GAAGAGTCAT 
GAAGAGTCAT 
GAAGAGTCAT 
GAAGAGTCAT 
GAAGAGTCAT 
GAAGAGTCAT 



********** ********** ********** 



msal67919 . 

msal67919- 

msal67919 . 
msa!67919.2{ 

msal67919 . 
msal67919.2{322 
msal67919 
msal67919.2{ 

msa!67919. 

msal67919 - 
msal67919.2{ 



2{322_COHl} 
2{322_M781} 
2{322_M732} 
322_18RS2l) 
2{322_2603) 
_JM9130013} 
.2{322_090} 
322_CJB110) 
2(322_A909) 
2{322_H36B} 
322_1169NT} 
Consensus 



301 

ACTGCcACTT 
ACTGCcACTT 
ACTGCcACTT 
ACTGCcACTT 
ACTGCcACTT 
ACTGCcACTT 
ACTGCcACTT 
ACTGCcACTT 
ACTGCtACTT 
ACTGCtACTT 
ACTGCcACTT 
*****_**** 



CAATGAAAAT 

CAATGAAAAT 

CAATGAAAAT 

CAATGAAAAT 

CAATGAAAAT 

CAATGAAAAT 

CAATGAAAAT 

CAATGAAAAT 

CAATGAAAAT 

CAATGAAAAT 

CAATGAAAAT 
********** 



AGAAACACCA 

AGAAACACCA 

AGAAACACCA 

AGAAACACCA 

AGAAACACCA 

AGAAACACCA 

AGAAACACCA 

AGAAACACCA 

AGAAACACCA 

AGAAACACCA 

AGAAACACCA 
********** 



GCAACAAATG 

GCAACAAATG 

GCAACAAATG 

GCAACAAATG 

GCAACAAATG 

GCAACAAATG 

GCAACAAATG 

GCAACAAATG 

GCAACAAATG 

GCAACAAATG 

GCAACAAATG 
********** 



350 

CTGCTGGTCA 

CTGCTGGTCA 

CTGCTGGTCA 

CTGCTGGTCA 

CTGCTGGTCA 

CTGCTGGTCA 

CTGCTGGTCA 

CTGCTGGTCA 

CTGCTGGTCA 

CTGCTGGTCA 

CTGCTGGTCA 
********** 



991 



WO 2004/018646 



PCT/US2003/026827 



Table 69: Comparative Sequences relating to SAG0032 



msal67919 . 

msal67919 . 

msal67919 . 
msal67919 .2{ 

msal67919 . 
msal67919.2{322 
msal67919 
msal67919.2{ 

rasal67919 . 

msal67919 . 
msal67919 .2{ 



2{322__COHl} 
2{322_M781} 
2{322_M732} 
322_18RS2l} 
2{322_2603} 
_JM9130013} 
.2{322_090} 
322_CJB110} 
2{322_A909) 
2{322_H36B} 
322_1169NT} 
Consensus 



351 

AACAaCAGCT 
AACAaCAGCT 
AACAaCAGCT 
AACAaCAGCT 
AACAaCAGCT 
AACAaCAGCT 
AACAC CAGCT 
AACAc CAGCT 
AACAaCAGCT 
AACAaCAGCT 
AACAaCAGCT 



ACTGTcGATT 
ACTGTcGATT 
ACTGTcGATT 
ACTGTgGATT 
ACTGTgGATT 
ACTGTgGATT 
ACTGTgGATT 
ACTGTgGATT 
ACTGTcGATT 
ACTGTcGATT 
ACTGTgGATT 



****_***** *****_**** 



TGAAAACCAA 

TGAAAACCAA 

TGAAAACCAA 

TGAAAACCAA 

TGAAAACCAA 

TGAAAACCAA 

TGAAAACCAA 

TGAAAACCAA 

TGAAAACCAA 

TGAAAACCAA 

TGAAAACCAA 
********** 



TCAAGTTT t T 

TCAAGTTT t T 

TCAAGTTT tT 

TCAAGTTTcT 

TCAAGTTTcT 

TCAAGTTTcT 

TCAAGTTTcT 

TCAAGTTTcT 

TCAAGTTTcT 

TCAAGTTTcT 

TCAAGTTTcT 
********_* 



400 

GTTGCAGACC 

GTTGCAGACC 

GTTGCAGACC 

GTTGCAGACC 

GTTGCAGACC 

GTTGCAGACC 

GTTGCAGACC 

GTTGCAGACC 

GTTGCAGACC 

GTTGCAGACC 

GTTGCAGACC 
********** 



msal67919 . 

msal67919 . 

msal67919» 
msal67919 .2{ 

msal67919 . 
msalS7919.2{322 
msalG7919 
msal67919 .2{ 

msal67919 . 

msal67919 . 
msal67919.2{ 



2f 32 2_C0H1 
2{322__M781 
2{322_M732] 
322_18RS2l" 
2{32 2_2603 
_JM9130013 
.2{322_090] 
322_CJB110: 
2f322_A909 
2{322_H36B 
322_1169NT, 
Consensus 



401 

AAAAAGTTTC 

AAAAAGTTTC 

AAAAAGTTTC 

AAAAAGTTTC 

AAAAAGTTTC 

AAAAAGTTTC 

AAAAAGTTTC 

AAAAAGTTTC 

AAAAAGTTTC 

AAAAAGTTTC 

AAAAAGTTTC 
********** 



TCTCAATACA 
TCTCAATACA 
TCTCAATACA 
TCTCAATACA 
TCTCAATACA 
TCTCAATACA 
TCTCAATACA 
TCTCAATACA 
TCTCAATACA 
TCTCAATACA 
TCTCAATACA 



ATTTCGGAAG 
ATTTCGGAAG 
ATTTCGGAAG 
ATTTCGGAAG 
ATTTCGGAAG 
ATTTCGGAAG 
ATTTCGGAAG 
ATTTCGGAAG 
ATTTCGGAAG 
ATTTCGGAAG 
ATTTCGGAAG 



******* * * * ********** 



GTATGACACC 

GTATGACACC 

GTATGACACC 

GTATGACACC 

GTATGACACC 

GTATGACACC 

GTATGACACC 

GTATGACACC 

GTATGACACC 

GTATGACACC 

GTATGACACC 
********** 



450 

AGAAGCAGCA 

AGAAGCAGCA 

AGAAGCAGCA 

AGAAGCAGCA 

AGAAGCAGCA 

AGAAGCAGCA 

AGAAGCAGCA 

AGAAGCAGCA 

AGAAGCAGCA 

AGAAGCAGCA 

AGAAGCAGCA 
********** 



msal67919.2{322_COHl} 
msal67919 . 2 {322__M78l} 
msal67919 . 2 { 322_M732 
msal67919.2{322_18RS21 
msal67919.2{322_2603 
msal67919 . 2{322_JM9130013 } 
msal67919.2{322_090} 
msal67919.2{322_CtTB110} 
msa!67919 . 2 { 322_A909 } 
msal67919.2{322_H36Bj 
msal67919.2{322_1169NT} 

Consensus 



451 

ACAACGATTG 

ACAACGATTG 

ACAACGATTG 

ACAACGATTG 

ACAACGATTG 

ACAACGATTG 

ACAACGATTG 

ACAACGATTG 

ACAACGATTG 

ACAACGATTG 

ACAACGATTG 
********** 



TTTCGCCAAT 

TTTCGCCAAT 

TTTCGCCAAT 

TTTCGCCAAT 

TTTCGCCAAT 

TTTCGCCAAT 

TTTCGCCAAT 

TTTCGCCAAT 

TTTCGCCAAT 

TTTCGCCAAT 

TTTCGCCAAT 
********** 



GAAGACATAT 

GAAGACATAT 

GAAGACATAT 

GAAGACATAT 

GAAGACATAT 

GAAGACATAT 

GAAGACATAT 

GAAGACATAT 

GAAGACATAT 

GAAGACATAT 

GAAGACATAT 
********** 



TCTTCTGCGC 

TCTTCTGCGC 

TCTTCTGCGC 

TCTTCTGCGC 

TCTTCTGCGC 

TCTTCTGCGC 

TCTTCTGCGC 

TCTTCTGCGC 

TCTTCTGCGC 

TCTTCTGCGC 

TCTTCTGCGC 
********** 



500 

CAGCTTTGAA 

CAGCTTTGAA 

CAGCTTTGAA 

CAGCTTTGAA 

CAGCTTTGAA 

CAGCTTTGAA 

CAGCTTTGAA 

CAGCTTTGAA 

CAGCTTTGAA 

CAGCTTTGAA 

CAGCTTTGAA 
********** 



msal67919 . 2 { 322_COHl } 
msal67919.2{322_M78l} 
msa!67919 . 2 { 322_M732 } 
msal67919 . 2 {322_18RS2l} 
msal67919 . 2 {322_2603 } 
msal67919 . 2 { 322_JM9130013 } 
msal€7919 .2 {322_090 } 
msal67919 . 2 { 322__CJB110 " 
msal67919.2{322_A909 ( 
msal67919.2{322_H36Bl 
msal67919.2{322_1169NT) 

Consensus 



501 

ATCAAAAGAA 
ATCAAAAGAA 
ATCAAAAGAA 
ATCAAAAGAA 
ATCAAAAGAA 
ATCAAAAGAA 
ATCAAAAGAA 
ATCAAAAGAA 
ATCAAAAGAA 
ATCAAAAGAA 
ATCAAAAGAA 



GTATTAGCAC 
GTATTAGCAC 
GTATTAGCAC 
GTATTAGCAC 
GTATTAGCAC 
GTATTAGCAC 
GTATTAGCAC 
GTATTAGCAC 
GTATTAGCAC 
GTATTAGCAC 
GTATTAGCAC 



AAGaGCAAGC 
AAGaGCAAGC 
AAGaGCAAGC 
AAGaGCAAGC 
AAGaGCAAGC 
AAGaGCAAGC 
AAGaGCAAGC 
AAGaGCAAGC 
AAGgGCAAGC 
AAGgGCAAGC 
AAGaGCAAGC 



TGTTAGTCAA 
TGTTAGTCAA 
TGTTAGTCAA 
TGTTAGTCAA 
TGTTAGTCAA 
TGTTAGTCAA 
TGTTAGTCAA 
TGTTAGTCAA 
TGTTAGTCAA 
TGTTAGTCAA 
TGTTAGTCAA 



550 

GtAGCAGCTA 
GtAGCAGCTA 
GtAGCAGCTA 
GcAGCAGCTA 
GcAGCAGCTA 
GcAGCAGCTA 
GcAGCAGCTA 
GcAGCAGCTA 
GcAGCAGCTA 
GcAGCAGCTA 
GcAGCAGCTA 



********** ********** ***_****** ********** *_******** 



msal67919. 

msa!67919. 

msal67919. 
msal67919.2{ 

rasal67919 . 
tnsal67919.2{322 
msal67919 
msal67919.2{ 

msal67919 . 

msal67919. 
msal67919.2{ 



2{322_C0H1 
2{322_M781 1 
2{322_M732 
322_18RS21 
2{322_2603 
_JM9130013 
.2{322_090 
322__CJB110} 
2{322_A909" 
2{322_H36B| 
322_1169NT^ 
Consensus 



551 

ATGAACAGGT 

ATGAACAGGT 

ATGAACAGGT 

ATGAACAGGT 

ATGAACAGGT 

ATGAACAGGT 

ATGAACAGGT 

ATGAACAGGT 

ATGAACAGGT 

ATGAACAGGT 

ATGAACAGGT 
********** 



ATCAcCAGCT 

ATCAc CAGCT 

ATCAcCAGCT 

ATCAcCAGCT 

ATCAcCAGCT 

ATCAcCAGCT 

ATCAaCAGCT 

ATCAaCAGCT 

ATCAcCAGCT 

ATCAcCAGCT 

ATCAcCAGCT 
****_***** 



CCTGTGAAGT 

CCTGTGAAGT 

CCTGTGAAGT 

CCTGTGAAGT 

CCTGTGAAGT 

CCTGTGAAGT 

CCTGTGAAGT 

CCTGTGAAGT 

CCTGTGAAGT 

CCTGTGAAGT 

CCTGTGAAGT 
********** 



CGATTACTTC 
CGATTACTTC 
CGATTACTTC 
CGATTACTTC 
CGATTACTTC 
CGATTACTTC 
CGATTACTTC 
CGATTACTTC 
CGATTACTTC 
CGATTACTTC 
CGATTACTTC 



600 

AGAAGTTCCA 
AGAAGTTCCA 
AGAAGTTCCA 
AGAAGTTCCA 
AGAAGTTCCA 
AGAAGTTCCA 
AGAAGTTCCA 
AGAAGTTCCA 
AGAAGTTCCA 
AGAAGTTCCA 
AGAAGTTCCA 



********** ********** 



msa!67919. 

msal67919. 

msal67919. 
msal67919.2{ 

msal67919. 
msal67919.2{322 
msal67919 
msal67919.2{ 

msal67919 . 

msal67919 . 
msal67919.2{ 



2{322_COHl} 
2(322__M781} 
2{322__M732' 
322_18RS21 
2{322_2603 
_JM9130013 
,2{322_090 
322_CJB110 
2{322_A909} 
2{322_H36B} 
322_1169NT} 
Consensus 



601 

GCAGCTAAAG 
GCAGCTAAAG 
GCAGCTAAAG 
GCAGCTAAAG 
GCAGCTAAAG 
GCAGCTAAAG 
GCAGCTAAAG 
GCAGCTAAAG 
GCAGCTAAAG 
GCAGCTAAAG 
GCAGCTAAAG 



AGGAAGTTAa 
AGGAAGTTAa 
AGGAAGTTAa 
AGGAAGTTAa 
AGGAAGTTAa 
AGGAAGTTAa 
AGGAAGTTAa 
AGGAAGTTAa 
AGGAAGTTAa 
AGGAAGTTAa 
AGGAAGTTAg 



********** *********_ 



ACCAACTCAG 

ACCAACTCAG 

ACCAACTCAG 

ACCAACTCAG 

ACCAACTCAG 

ACCAACTCAG 

ACCAACTCAG 

ACCAACTCAG 

ACCAACTCAG 

ACCAACTCAG 

ACCAACTCAG 
********** 



ACGTCAGTCA 

ACGTCAGTCA 

ACGTCAGTCA 

ACGTCAGTCA 

ACGTCAGTCA 

ACGTCAGTCA 

ACGTCAGTCA 

ACGTCAGTCA 

ACGTCAGTCA 

ACGTCAGTCA 

ACGTCAGTCA 
********** 



650 

GTCAGTtAAC 

GTCAGTtAAC 

GTCAGTtAAC 

GTCAGTcAAC 

GTCAGTcAAC 

GTCAGTCAAC 

GTCAGTcAAC 

GTCAGTcAAC 

GTCAGTcAAC 

GTCAGTcAAC 

GTCAGTcAAC 
******_*** 



992 



WO 2004/018646 



PCT/US2003/026827 



Table 69: Comparative Sequences relating to SAG0032 



msal67919 . 

msal67919 . 

msal67919 . 
msal67919.2{ 

msal67919 . 
msal67919.2{322 
msa!67919 
msal67919.2{ 

msal67919 . 

msa!67919 . 
msa!67919 .2{ 



2{322_COHl} 
2{322_M78l} 
2{322_M732) 
322_18RS21 ■ 
2{322_2603 
_JM9130013 
.2{322_090; 
322_CJB110 - 
2{322_A909l 
2{322_H36B 
322_1169NT 
Consensus 



651 

AACAGTATCA 

AACAGTATCA 

AACAGTATCA 

AACAGTATCA 

AACAGTATCA 

AACAGTATCA 

AACAGTATCA 

AACAGTATCA 

AACAGTATCA 

AACAGTATCA 

AACAGTATCA 
********** 



CCAGCTTCTG 
CCAGCTTCTG 
CCAGCTTCTG 
CCAGCTTCTG 
CCAGCTTCTG 
CCAGCTTCTG 
CCAGCTTCTG 
CCAGCTTCTG 
CCAGCTTCTG 
CCAGCTTCTG 
CCAGCTTCTG 



TTGCCGCTGA 
TTGCCGCTGA 
TTGCCGCTGA 
TTGCCGCTGA 
TTGCCGCTGA 
TTGCCGCTGA 
TTGCCGCTGA 
TTGCCGCTGA 
TTGCCGCTGA 
TTGCCGCTGA 
TTGCCGCTGA 



AACACCAGCT 
AACACCAGCT 
AACACCAGCT 
AACACCAGCT 
AACACCAGCT 
AACACCAGCT 
AACACCAGCT 
AACACCAGCT 
AACACCAGCT 
AACACCAGCT 
AACACCAGCT 



********** ********** ********** 



700 

CCAGTAGCTA 

CCAGTAGCTA 

CCAGTAGCTA 

CCAGTAGCTA 

CCAGTAGCTA 

CCAGTAGCTA 

CCAGTAGCTA 

CCAGTAGCTA 

CCAGTAGCTA 

CCAGTAGCTA 

CCAGTAGCTA 
********** 



msal67919.2{322_COHl} 
msal67919.2{322_M78l} 
msal67919 . 2{322__M732 } 
msal67919 . 2 { 322_18RS2l) 
msal67919.2{322_2603) 
msal67919.2{322_JM9130013} 
msal67919. 2 {322_090} 
msal67919.2{322_CJB110} 
msal67919 . 2 ( 322_A909 } 
msal67919.2{322_H36B} 
msal 67919 . 2 { 322_1169NT} 

Consensus 



701 

AAGTAGCACC 

AAGTAGCACC 

AAGTAGCACC 

AAGTAGCACC 

AAGTAGCACC 

AAGTAGCACC 

AAGTAGCACC 

AAGTAGCACC 

AAGTAGCACC 

AAGTAGCACC 

AAGTAGCACC 
********** 



GGTAAGAACT 

GGTAAGAACT 

GGTAAGAACT 

GGTAAGAACT 

GGTAAGAACT 

GGTAAGAACT 

GGTAAGAACT 

GGTAAGAACT 

GGTAAGAACT 

GGTAAGAACT 

GGTAAGAACT 
********** 



GTAG CAGCCCCTAG 

GTAG CAGCCCCTAG 

GTAG CAGCCCCTAG 

GTAG CAGCCCCTAG 

GTAG CAGCCCCTAG 

GTAG CAGCCCCTAG 

GTAG CAGCCCCTAG 

GTAG CAGCCCCTAG 

GTAG CAGCCCCTAG 

GTAG CAGCCCCTAG 

GTAGcagccc CAGCCCCTAG 

**** ********** 



750 

AGTGGCAAGT 

AGTGGCAAGT 

AGTGGCAAGT 

AGTGGCAAGT 

AGTGGCAAGT 

AGTGGCAAGT 

AGTGGCAAGT 

AGTGGCAAGT 

AGTGGCAAGT 

AGTGGCAAGT 

AGTGGCAAGT 
********** 



msal67919. 

msal67919. 

msal67919 . 
msal67919 .2{ 

msal67919 . 
msa!67919.2{322 
msa!67919 
msal67919.2{ 

msa!67919 . 

rasal67919. 
msal67919 ,2{ 



2{322_C0H1} 
2(322_M78l} 
2{322_M732} 
322_18RS2l} 
2{322_2603} 
_JM9130013) 
.2{322_090) 
322_CJB110} 
2f322_A909* 
2{322_H36B; 
322_1169NTJ 
Consensus 



msal67919. 

msal67919 . 

msal67919. 
msal67919.2{ 

msal67919. 
msal67919.2{322 
msal67919 
msal67919 .2{ 

msal67919 . 

msal67919 . 
msal67919.2{ 



2{322_COHl} 
2{322_M781} 
2{322_M732} 
322_18RS2l} 
2{322_2603} 
_JM9130013} 
.2{322_090j 
322_CJB110) 
2{322_A909} 
2{322_H36B} 
322_1169NT} 
Consensus 



751 

GcTAAAGTAG 

GcTAAAGTAG 

GcTAAAGTAG 

GtTAAAGTAG 

GtTAAAGTAG 

GtTAAAGTAG 

GtTAAAGTAG 

GtTAAAGTAG 

GtTAAAGTAG 

GtTAAAGTAG 

GcTAAAGTAG 
*_******** 

801 

At CAGCTCCA 

At CAGCTCCA 

At CAGCTCCA 

At CAGCTCCA 

At CAGCTCCA 

At CAGCTCCA 

At CAGCTCCA 

At CAGCTCCA 

At CAGCTCCA 

At CAGCTCCA 

AcCAGCTCCA 
*_******** 



TCACTCCTAA 

TCACTCCTAA 

TCACTCCTAA 

TCACTCCTAA 

TCACTCCTAA 

TCACTCCTAA 

TCACTCCTAA 

TCACTCCTAA 

TCACTCCTAA 

TCACTCCTAA 

TCACTCCTAA 
********** 



AGTAGAAACT 
AGTAGAAACT 
AGTAGAAACT 
AGTAGAAACT 
AGTAGAAACT 
AGTAGAAACT 
AGTAGAAACT 
AGTAGAAACT 
AGTAGAAACT 
AGTAGAAACT 

AGTAGAAACT 
********** 



GGTGCATCAC 
GGTGCATCAC 
GGTGCATCAC 
GGTGCATCAC 
GGTGCATCAC 
GGTGCATCAC 
GGTGCATCAC 
GGTGCATCAC 
GGTGCATCAC 
GGTGCATCAC 
GGTGCATCAC 



800 

CAGAGCATGT 
CAGAGCATGT 
CAGAGCATGT 
CAGAGCATGT 
CAGAGCATGT 
CAGAGCATGT 
CAGAGCATGT 
CAGAGCATGT 
CAGAGCATGT 
CAGAGCATGT 
CAGAGCATGT 



********** ********** 



TGACTACGAC 
TGACTACGAC 
TGACTACGAC 
TGACTACGAC 
TGACTACGAC 
TGACTACGAC 
TGACTACGAC 
TGACTACGAC 
TGACTACGAC 
TGACTACGAC 
TGACTACGAC 



GCAGTTCCTG 

GCAGTTCCTG 

GCAGTTCCTG 

GCAGTTCCTG 

GCAGTTCCTG 

GCAGTTCCTG 

GCAGTTCCTG 

GCAGTTCCTG 

GCAGTTCCTG 

GCAGTTCCTG 

GCAGTTCCTG 
********** ********** 



TTCAc CAGCT 

TTCAcCAGCT 

TTCAcCAGCT 

TTCAcCAGCT 

TTCAcCAGCT 

TTCAcCAGCT 

TTCAaCAGCT 

TTCAaCAGCT 

TTCAaCAGCT 

TTCAaCAGCT 

TTCAaCAGCT 
****_***** 



850 

ACAGACAgTA 

ACAGACAgTA 

ACAGACAgTA 

ACAGACAgTA 

ACAGACAgTA 

ACAGACAgTA 

ACAGACAgTA 

ACAGACAgTA 

ACAGACAgTA 

ACAGACAgTA 

ACAGACAaTA 
*******_** 



msal67919 . 2 ( 322_COHl } 
msal67919.2{322_M78l} 
msal67919 . 2 {322_M732 } 
msal67919 . 2 {322_18RS21 } 
msal67919 . 2 { 322_2 603 \ 
msal67919 . 2 {322_JM9130013 } 
msal67919.2{322_090} 
msal67919.2{322_CJB110> 
msal67919 . 2 { 322_A909 } 
msal67919 . 2 {322__H36B \ 
msal67919.2{322_1169NT} 

Consensus 



msal67919. 

msal67919. 

rasal67919. 
msa!67919.2{ 

msal67919. 
msal67919.2{322 
msal67919 
msal67919.2{ 

msal67919. 

msal67919. 
rasal67919.2{ 



2{322_C0H1 

2{322_M781 

2{322_M732 

322_18RS21} 

2 {322^2603} 

_JM9130013j 

.2{322_090} 

322_CJB110} 

2{322_A909} 

2{322_H36Bj 

322_1169NT} 



851 

AGTTACAAGC 

AGTTACAAGC 

AGTTACAAGC 

AGTTACAAGC 

AGTTACAAGC 

AGTTACAAGC 

AGTTACAAGC 

AGTTACAAGC 

AGTTACAAGC 

AGTTACAAGC 

AGTTACAAGC 
********** 

901 

ACAGCAACAC 
ACAGCAACAC 
ACAGCAACAC 
ACAGCAACAC 
ACAGCAACAC 
ACAGCAACAC 
ACAGCAACAC 
ACAGCAACAC 
ACAGCAACAC 
ACAGCAACAC 
ACAGCAACAC 



GACTGAAGTT 

GACTGAAGTT 

GACTGAAGTT 

GACTGAAGTT 

GACTGAAGTT 

GACTGAAGTT 

GACTGAAGTT 

GACTGAAGTT 

GACTGAAGTT 

GACTGAAGTT 

GACTGAAGTT 
********** 



CGGTAGCACA 
CGGTAGCACA 
CGGTAGCACA 
CGGTAGCACA 
CGGTAGCACA 
CGGTAGCACA 
CGGTAGCACA 
CGGTAGCACA 
CGGTAGCACA 
CGGTAGCACA 
CGGTAGCACA 



AAGAGCGTTC 
AAGAGCGTTC 
AAGAGCGTTC 
AAGAGCGTTC 
AAGAGCGTTC 
AAGAGCGTTC 
AAGAGCGTTC 
AAGAGCGTTC 
AAGAGCGTTC 
AAGAGCGTTC 

AAGAGCGTTC 
********** 



ACCAGCTTCA 
ACCAGCTTCA 
ACCAGCTTCA 
ACCAGCTTCA 
ACCAGCTTCA 
ACCAGCTTCA 
ACCAGCTTCA 
ACCAGCTTCA 
ACCAGCTTCA 
ACCAGCTTCA 
ACCAGCTTCA 



CGGTaGCACA 
CGGTaGCACA 
CGGTaGCACA 
CGGTaGCACA 
CGGTaGCACA 
CGGTaGCACA 
CGGTaGCACA 
CGGTaGCACA 
CGGTaGCACA 
CGGTaGCACA 
CGGTgGCACA 



900 
AAAAGCTCCA, 
AAAAGCTCCA 
AAAAGCTCCA 
AAAAGCTCCA 
AAAAGCTCCA 
AAAAGCTCCA 
AAAAGCTCCA 
AAAAGCTCCA 
AAAAGCTCCA 
AAAAGCTCCA 
AAAAGCTCCA 



****_***** ********** 



ACAACAAATG 
ACAACAAATG 
ACAACAAATG 
ACAACAAATG 
ACAACAAATG 
ACAACAAATG 
ACAACAAATG 
ACAACAAATG 
ACAACAAATG 
ACAACAAATG 
ACAACAAATG 



950 

CAGTAGCTGC 
CAGTAGCTGC 
CAGTAGCTGC 
CAGTAGCTGC 
CAGTAGCTGC 
CAGTAGCTGC 
CAGTAGCTGC 
CAGTAGCTGC 
CAGTAGCTGC 
CAGTAGCTGC 
CAGTAGCTGC 



993 



WO 2004/018646 



PCT/US2003/026827 



Table 69: Comparative Sequences relating to SAG0032 



Consensus ********** ********** ********** ********** ********** 



msal67919 
msal67919 
msal67919 

msal67919.2 
msal67919 
msal67919.2{32 
msal6791 

msa!67919 .2 
msal67919 
msalS7919 

rasal67919 .2 



.2{322_COHl} 
.2(322_M78l} 
.2{322_M73 2} 
{322_18RS21} 
.2{322_2603} 
2_JM9130013l 
9.2{322_090} 
{322_CJB110 
.2{322_A909* 
.2{322_H36B 
{322_1169NT 
Consensus 



951 

ACATCCTGAA 
ACATCCTGAA 
ACATCCTGAA 
ACATCCTGAA 
ACATCCTGAA 
ACATCCTGAA 
ACATCCTGAA 
ACATCCTGAA 
ACATCCTGAA 
ACATCCTGAA 
ACATCCTGAA 



AATGCAgGgC 
AATGCAgGgC 
AATGCAgGgC 
AATGCAgGgC 
AATGCAgGgC 
AATGCAgGgC 
AATGCAgGgC 
AATGCAgGgC 
AATGCAaGgC 
AATGCAaGgC 
AATGCAgGaC 



TCCAACCTCA 
TCCAACCTCA 
TCCAACCTCA 
TCCAACCTCA 
TCCAACCTCA 
TCCAACCTCA 
TCCAACCTCA 
TCCAACCTCA 
TCCAACCTCA 
TCCAACCTCA 
TCCAACCTCA 



********** ******_*_* ********** 



TGTTGCAGCT 

TGTTGCAGCT 

TGTTGCAGCT 

TGTTGCAGCT 

TGTTGCAGCT 

TGTTGCAGCT 

TGTTGCAGCT 

TGTTGCAGCT 

TGTTGCAGCT 

TGTTGCAGCT 

TGTTGCAGCT 
********** 



1000 

TATAAAGAAA 

TATAAAGAAA 

TATAAAGAAA 

TATAAAGAAA 

TATAAAGAAA 

TATAAAGAAA 

TATAAAGAAA 

TATAAAGAAA 

TATAAAGAAA 

TATAAAGAAA 

TATAAAGAAA 
********** 



msal67919 . 2 { 322_COHl } 
msal67919.2{322_M78l} 
msal67919 . 2 (322_M732 ) 
msal67919.2{322_18RS21| 
msal67919.2{322_2603" 
msal67919.2{322_JM9130013 
msal67919.2{322_090; 
msal67919 .2 {322__CJB110 
msa!67919.2{322_A909} 
msal67919 . 2 {322_H36B} 
msa!67919 . 2 { 322__1169NT} 

Consensus 



1050 
CCGTGCgGGa 
CCGTGCgGGa 
CCGTGCgGGa 
CCGTGCgGGa 
CCGTGCgGGa 
CCGTGCgGGa 
CCGTGCaGGt 
CCGTGCaGGt 
CCGTGCgGGa 
CCGTGCgGGa 
CCGTGCgGGa 

********** ********** ********** ********** ******_**_ 



1001 

AAGTAGCGTC 
AAGTAGCGTC 
AAGTAGCGTC 
AAGTAGCGTC 
AAGTAGCGTC 
AAGTAGCGTC 
AAGTAGCGTC 
AAGTAGCGTC 
AAGTAGCGTC 
AAGTAGCGTC 
AAGTAGCGTC 



AACTTATGGA 
AACTTATGGA 
AACTTATGGA 
AACTTATGGA 
AACTTATGGA 
AACTTATGGA 
AACTTATGGA 
AACTTATGGA 
AACTTATGGA 
AACTTATGGA 
AACTTATGGA 



GTTAATGAAT 
GTTAATGAAT 
GTTAATGAAT 
GTTAATGAAT 
GTTAATGAAT 
GTTAATGAAT 
GTTAATGAAT 
GTTAATGAAT 
GTTAATGAAT 
GTTAATGAAT 
GTTAATGAAT 



TCAGTACATA 
TCAGTACATA 
TCAGTACATA 
TCAGTACATA 
TCAGTACATA 
TCAGTACATA 
TCAGTACATA 
TCAGTACATA 
TCAGTACATA 
TCAGTACATA 
TCAGTACATA 



msal67919 . 

msal67919 . 

msal67919 . 
msal67919.2{ 

msal67919 . 
msal67919.2{322 
msal67919 
msal67919.2{ 

msal67919. 

msal67919 . 
msa!67 919.2{ 



2{322_C0H1 
2{322_M781 
2{322_M732 
322_18RS2i: 
2{322_2603' 
_JM9130013; 
.2{322_090 
322_CJB110 
2{322_A909 
2{322_H36B 
322_1169NT 
Consensus 



1051 

GATCCAGGTG 

GATCCAGGTG 

GATCCAGGTG 

GATCCAGGTG 

GATCCAGGTG 

GATCCAGGTG 

GATCCAGGTG 

GATCCAGGTG 

GATCCAGGTG 

GATCCAGGTG 

GATCCAGGTG 
********** 



ATCATGGTAA 

ATCATGGTAA 

ATCATGGTAA 

ATCATGGTAA 

ATCATGGTAA 

ATCATGGTAA 

ATCATGGTAA 

ATCATGGTAA 

ATCATGGTAA 

ATCATGGTAA 

ATCATGGTAA 
****** **** 



AGGTTTAGCA 
AGGTTTAGCA 
AGGTTTAGCA 
AGGTTTAGCA 
AGGTTTAGCA 
AGGTTTAGCA 
AGGTTTAGCA 
AGGTTTAGCA 
AGGTTTAGCA 
AGGTTTAGCA 

AGGTTTAGCA 
********** 



GTtGACTTTA 

GTtGACTTTA 

GTtGACTTTA 

GTtGACTTTA 

GTtGACTTTA 

GTtGACTTTA 

GTcGACTTTA 

GTcGACTTTA 

GTtGACTTTA 

GTtGACTTTA 

GTtGACTTTA 
**„******* 



1100 

TTGTAGGTAa 

TTGTAGGTAa 

TTGTAGGTAa 

TTGTAGGTAc 

TTGTAGGTAc 

TTGTAGGTAc 

TTGTAGGTAa 

TTGTAGGTAa 

TTGTAGGTAa 

TTGTAGGTAa 

TTGTAGGTAa 
*********_ 



msal67919.2{322_COHl: 
msal 6 7 9 19 . 2 j 3 2 2_M7 8 1 
msal67919 . 2{322_M732 
msal 67 919 . 2 { 322_18RS21 
msal67919 . 2 {322_2603 * 
msal67919 .2{322_JM9130013 
msal67919.2{322_090 
msa!67919 . 2 {322_CJB110 
msal67919 . 2 {322_A909 ; 
msal67919 . 2 {322_H36B 
tnsal67 919 . 2 { 322_1169NT 

Consensus 



1101 

aAAcCAAGCA 
aAAcCAAGCA 
aAAcCAAGCA 
t AA t CAAGCA 
t AAt CAAGCA 
t AAt CAAGCA 
aAAcCAAGCA 
aAAcCAAGCA 
aAAcCAAGCA 
aAAcCAAGCA 
aAAcCAAGCA 



CTTGGTAATg 
CTTGGTAATg 
CTTGGTAATg 
CTTGGTAATa 
CTTGGTAATa 
CTTGGTAATa 
CTTGGTAATg 
CTTGGTAATg 
CTTGGTAATg 
CTTGGTAATg 
CTTGGTAATg 



_**_****** *********- 



AAGTTGCACA 

AAGTTGCACA 

AAGTTGCACA 

AAGTTGCACA 

AAGTTGCACA 

AAGTTGCACA 

AAGTTGCACA 

AAGTTGCACA 

AAGTTGCACA 

AAGTTGCACA 

AAGTTGCACA 
********** 



GTACTCTACA 
GTACTCTACA 
GTACTCTACA 
GTACTCTACA 
GTACTCTACA 
GTACTCTACA 
GTACTCTACA 
GTACTCTACA 
GTACTCTACA 
GTACTCTACA 
GTACTCTACA 



1150 
CAAAATATGG 
CAAAATATGG 
CAAAATATGG 
CAAAATATGG 
CAAAATATGG 
CAAAATATGG 
CAAAATATGG 
CAAAATATGG 
CAAAATATGG 
CAAAATATGG 
CAAAATATGG 



********** ********** 



msal67919 . 2 f 322_COHl } 
msal67919 . 2 {322_M781 } 
msal67919.2{322_M732} 
msal67 919 . 2 {322_18RS21 ) 
msal€7919 . 2 {322_2603 } 
msal67919 . 2 {322_JM9130013 } 
msal67919.2{322_090} 
msal67 919 . 2 {322_CJB110 } 
msal67919 . 2 { 322_A909 " 
msal67919.2{322_H36B 
msal67919.2{322_1169NT} 

Consensus 



1151 

CAGCAAATAA 
CAGCAAATAA 
CAGCAAATAA 
CAGCAAATAA 
CAGCAAATAA 
CAGCAAATAA 
CAGCAAATAA 
CAGCAAATAA 
CAGCAAATAA 
CAGCAAATAA 
CAGCAAATAA 



CATTTCATAT 
CATTTCATAT 
CATTTCATAT 
CATTTCATAT 
CATTTCATAT 
CATTTCATAT 
CATTTCATAT 
CATTTCATAT 
CATTTCATAT 
CATTTCATAT 
CATTTCATAT 



GTTATCTGGC 
GTTATCTGGC 
GTTATCTGGC 
GTTATCTGGC 
GTTATCTGGC 
GTTATCTGGC 
GTTATCTGGC 
GTTATCTGGC 
GTTATCTGGC 
GTTATCTGGC 
GTTATCTGGC 



AACAAAAGTT 
AACAAAAGTT 
AACAAAAGTT 
AACAAAAGTT 
AACAAAAGTT 
AACAAAAGTT 
AACAAAAGTT 
AACAAAAGTT 
AACAAAAGTT 
AACAAAAGTT 
AACAAAAGTT 



1200 
TTAtTCAAAT 
TTAtTCAAAT 
TTAtTCAAAT 
TTAcTCAAAT 
TTAcTCAAAT 
TTACTCAAAT 
TTAcTCAAAT 
TTAcTCAAAT 
TTAcTCAAAT 
TTAcTCAAAT 
TTAcTCAAAT 



********** ********** ********** ********** ***_****** 



msal67919 . 

msal67919 . 

msa!67919 . 
msal67919.2{ 

msal67919 . 
msal67919.2{322 
msal67919 
msal67919.2{ 

msal67919. 

msa!67919 . 



2{322_C0H1) 
2{322_M781) 
2{322_M732} 
322_18RS21} 
2{322_2603} 
_JM9130013) 
.2{322_090} 
322_CJB110) 
2{322_A90 9] 
2{322_H36B} 



1201 

ACAAAtAGTA 
ACAAAtAGTA 
ACAAAtAGTA 
ACAAAcAGTA 
ACAAAcAGTA 
ACAAAcAGTA 
ACAAAtAGTA 
ACAAAtAGTA 
ACAAAtAGTA 
ACAAAtAGTA 



TTTATGGACC 
TTTATGGACC 
TTTATGGACC 
TTTATGGACC 
TTTATGGACC 
TTTATGGACC 
TTTATGGACC 
TTTATGGACC 
TTTATGGACC 
TTTATGGACC 



TGCTAATACT 
TG CT AAT ACT 
TGCTAATACT 
TGCTAATACT 
TGCTAATACT 
TGCTAATACT 
TGCTAATACT 
TGCTAATACT 
TGCTAATACT 
TGCTAATACT 



TGGAATGCAA 
TGGAATGCAA 
TGGAATGCAA 
TGGAATGCAA 
TGGAATGCAA 
TGGAATGCAA 
TGGAATGCAA 
TGGAATGCAA 
TGGAATGCAA 
TGGAATGCAA 



1250 
TGCCAGATCG 
TGCCAGATCG 
TGCCAGATCG 
TGCCAGATCG 
TGCCAGATCG 
TGCCAGATCG 
TGCCAGATCG 
TGCCAGATCG 
TGCCAGATCG 
TGCCAGATCG 



994 



WO 2004/018646 



PCT/US2003/026827 



Table 69: Comparative Sequences relating to SAG0032 

rasal67919.2{322_1169NT} ACAAAtAGTA TTTATGGACC TGCTAATACT TGGAATGCAA TGCCAGATCG 

Consensus *****-**** ********** ********** ********** ********** 



msal67919 . 

msal67919 . 

msal67919 . 
msal67919.2{ 

msal67919. 
msal67919.2{322 
msa!67919 
msal67919 .2{ 

msal67919 . 

msal67919 . 
msal67919.2{ 



2{322_COHl' 
2{322_M78l' 
2{322_M732 
322_18RS21 
2{322_2603 
_JM9130013 
■ 2{322__090 
322_CJB110 
2{322_A909' 
2{322_H36B} 
322_1169NT} 
Consensus 



1251 

TGGTGGCGTT 
TGGTGGCGTT 
TGGTGGCGTT 
TGGTGGCGTT 
TGGTGGCGTT 
TGGTGGCGTT 
TGGTGGCGTT 
TGGTGGCGTT 
TGGTGGCGTT 
TGGTGGCGTT 
TGGTGGCGTT 



ACTGCCAACC 
ACTGCCAACC 
ACTGCCAACC 
ACTGCCAACC 
ACTGCCAACC 
ACTGCCAACC 
ACTGCCAACC 
ACTGCCAACC 
ACTGCCAACC 
ACTGCCAACC 
ACTGCCAACC 



********** ********** 



AcTATGACCA 

ACTATGACCA 

AcTATGACCA 

AcTATGACCA 

AcTATGACCA 

AcTATGACCA 

AtTATGACCA 

AtTATGACCA 

AcTATGACCA 

AcTATGACCA 

AcTATGACCA 
* _ ** * ***** 



CGTTCACGTA 

CGTTCACGTA 

CGTTCACGTA 

CGTTCACGTA 

CGTTCACGTA 

CGTTCACGTA 

tGTTCACGTA 

tGTTCACGTA 

CGTTCACGTA 

CGTTCACGTA 

CGTTCACGTA 
_********* 



1300 

TCATTTAACA 

TCATTTAACA 

TCATTTAACA 

TCATTTAACA 

TCATTTAACA 

TCATTTAACA 

TCATTTAACA 

TCATTTAACA 

TCATTTAACA 

TCATTTAACA 

TCATTTAACA 
********** 



msal67919 . 

msal67919 . 

msa!67919 . 
msal67919 .2{ 

msal679T9 . 
msal67919.2{322 
msal67919 
msa!67919.2{ 

msal67919 . 

msa!67919 . 
msal67919.2{ 



2{322_COHl} 
2{322_M781} 
2{322_M732} 
322_18RS2l" 
2{322_2603 
_JM9130013 
.2{322_090} 
322_CJB110} 
2{322_A909} 
2{322_H36B) 
322_1169NT} 
Consensus 



1301 

AATAATATAA 
AATAATATAA 
AATAATATAA 
AATAATATAA 
AATAATATAA 
AATAATATAA 
AATAATATAA 
AATAATATAA 
AATAATATAA 
AATAATATAA 
AATAATATAA 



AAAAGGAAGC 
AAAAGGAAGC 
AAAAGGAAGC 
AAAAGGAAGC 
AAAAGGAAGC 
AAAAGGAAGC 
AAAAGGAAGC 
AAAAGGAAGC 
AAAAGGAAGC 
AAAAGGAAGC 
AAAAGGAAGC 



********** ********** 



TATTTGGCTT 
TATTTGGCTT 
TATTTGGCTT 
TATTTGGCTT 
TATTTGGCTT 
TATTTGGCTT 
TATTTGGCTT 
TATTTGGCTT 
TATTTGGCTT 
TATTTGGCTT 

TATTTGGCTT 
********** 



CTTTTTTATA 

CTTTTTTATA 

CTTTTTTATA 

CTTTTTTATA 

CTTTTTTATA 

CTTTTTTATA 

CTTTTTTATA 

CTTTTTTATA 

CTTTTTTATA 

CTTTTTTATA 

CTTTTTTATA 
********** 



1350 

TGCCTTGaAT 

TGCCTTGaAT 

TGCCTTGaAT 

TGCCTTGaAT 

TGCCTTGaAT 

TGCCTTGaAT 

TGCCTTGaAT 

TGCCTTGaAT 

TGCCTTGcAT 

TGCCTTGcAT 

TGCCTTGaAT 
*******_** 



msal67919 . 

msal67919 . 

msal67919 . 
msal67919 .2{ 

msal67919 . 
msa!67919»2{322 
msa!67919 
rasal67919.2{ 

msal67919 . 

msal67919 . 
msal67919 . 2{ 



2{322_C0H1 
2{322_M781 
2{322_M732 
322_18RS21 
2{322_2603 
_JM9130013 
.2{322_090 
322_CJB110 
2{322_A909; 
2{322_H36B; 
322_1169NT} 
Consensus 



1351 

AGACTTTCAA 
AGACTTTCAA 
AGACTTTCAA 
AGACTTTCAA 
AGACTTTCAA 
AGACTTTCAA 
AGACTTTCAA 
AGACTTTCAA 
AGACTTTCAA 
AGACTTTCAA 
AGACTTTCAA 



GGTTCTTATA 
GGTTCTTATA 
GGTTCTTATA 
GGTTCTTATA 
GGTTCTTATA 
GGTTCTTATA 
GGTTCTTATA 
GGTTCTTATA 
GGTTCTTATA 
GGTTCTTATA 
GGTTCTTATA 



******* *.* * ********** 



1382 

TAATTTTTAT TA 

TAATTTTTAT TA 

TAATTTTTAT TA 

TAATTTTTAT TA 

TAATTTTTAT TA 

TAATTTTTAT TA 

TAATTTTTAT TA 

TAATTTTTAT TA 

TAATTTTTAT TA 

TAATTTTTAT TA 

TAATTTTTAT TA 
********** ** 



SEQ ID NO. 6912 
STRAIN 2603 frame: 1 

MNKKVTiLTSTMAAS LLS VAS VQAQETDTTWTARTVSEVKADLVKQDNKS S YTVKYGD^ S 
VI SEAMS IDMNVLAKINNI ADI NL I YPETTLTVT YDQKSHTATSMKI ETPATNAAGQTTA 
TVDLKTNQVSVADQKVSLNT I SEGMTPEAATT I VS PMKTYSSAPALKSKEVLAQEQAVS Q 
AAANEQVSPAP VKS I TSEVPAAKEE VKPTQTS VS QSTTVS PASVAAETPAPVAKVAPVRT 
VAAPRVASVKVV^PKVETGASPEHVSAPAVPVTTTSPATDSKLQATEVKSVPVAQKAPTA 
TPVAQPASTTNAVAAHPENAGLQPHVAAYKEKVASTYGVNEFSTYRAGDPGDHGKGLxAVD 
FI VGTNQALGNKVAQ YSTQNMAANN I S YV I WQQKF YSNTNS I YGPANTWNAMPDRGGVTA 
NHYDHVHVSFNK. YKKGSYLASFLYALNRLSRFLYNFY 

SEQ ID NO. 6913 
STRAIN 090 frame: 2 

ETTLTVT YDQKSHTATSMKI ETPATNAAGQTPATVDLKTNQVS VADQKVS LNT I S EGMT P 
EAATT I VSPMKT YS S APAIiKSKEVLAQEQAVSQAAANEQVSTAP VKS I TSEVPAAKEEVK 
PTQTSVSQSTTVS PASVAAETPAPVAKVAP VRTVAAPRVAS VKWTPKVETGAS PEHVSA 
PAVPVTTTSTATDSKLQATEVKSVPVAQKAPTATPVAQPASTTNAVAAHPENAGLQPHVA 
AYKEKVAST YGVNE FSTYRAGDPGDHGKGLAVDF I VGKNQALGNE VAQYSTQNMAANN I S 
YVI WQQKFYSNTNS I YGPANTWNAMPDRGGVTANHYDHVHVSFNK . YKKGS YLAS FL YAL 
NRLSRFLYNFY 



SEQ ZD NO. 6914 

STRAIN A909 frame: 3 

DLVKQDNKS S YTVKYGDTL S VI SEAMS IDMNVIiAKINNIADINLI YPETTLTVTYDQKSH 
TATSMKI ETPATNAAGC/ITATVDLKTNQVS VADQKVS LNT I SEGMTPEAATT I VS PMKT Y 
SSAPALKSKEVLAQGGAVSQAAANEQVSPAPVKSITSEVPAAKEEVKPTGTSVSQSTTVS 
PASVAAETPAPVAKVAPVRTVAAPRVASVKVVTPKVETGASPEHVSAPAVPVTTTSTATD 
S KLQATE VKS VP VAQKA PTAT P VAQ P A STTNAVAAH P ENARLQPHVAAY KE KVAS TYGVN 
E FSTYRAGDPGDHGKGLAVDF I VGKNQALGNEVAQYSTQNMAANNIS YVI WQQKFYSNTN 
SI YGPANTWNAMPDRGGVTANHYDHVHVSFNK . YKKGS YLAS FLYALHRLSRFLYNFY 

SEQ ID NO. 6915 
STRAIN H36B frame: 3 

DLVKQDNKS S YTVKYGDTLS VI SEAMS IDMNVLAKINNI ADINL I YPETTLTVTYDQKSH 
TATSMKI ET PATNAAGQTTATVD LKTNQVS VADQKVS LNT I SEGMTPEAATTI VSPMKTY 
S SAPALKS KEVLAQGQAVS QAAANEQVSPAPVKS I TSEVPAAKEEVKPTQTSVSQSTTVS 
PAS VAAE TPAP VAKVAP VRT VAAP R VAS VKWT P KVETGA S P EHVS AP AVP VTTT S TATD 
S KLQATE VKS VP VAQ KAPTAT P VAQ PASTTNAVAAHPENARL QPHVAAYKE KVAS TYGVN 



995 



WO 2004/018646 



PCT/US2003/026827 



Table 69: Comparative Sequences relating to SAG0032 



E FST YRAGD PGDHGKGLAVD F I VGKNQALGNE VAQYSTQNMAANN I S YV I WQQKF YSNTN 
S I YGPANTWNAM PDRGG VTANH YDHVHVS FNK . YKKGSYLASFLYALHRLSRFLYNFY 

SEQ ID NO. 6916 
STRAIN 18RS21 frame: 3 

DLVKQDNKSSYTVKYGDTLSVI SEAMS I DMNVLAKINNIAD I NL I YPETTLTVTYDQKSH 
TATSMKI ETPATNAAGQTTATVDLKTNQVS VADQKVSLNT I S EGMT PEAATT I VS PMKT Y 
SSAPALKSKEVIJ\QEQAVSQAAANEQVSPAPVKSITSEVPAAKEEVKPTO/rSVSQSTTVS 
PASVAAETPAPVAKVAPVRTVAAPRVASVKVVTPKVETGASPEHVSAPAVPVTTTSPATD 
SKLQATEVKSVPVAQKAPTATPVAQPASTTNAVAAHPENAGLQPHVAAYKEKVASTYGVN 
EF STYRAGD PGDHGKGLAVDF I VGTNQALGNKVAQYSTQNMAANN I S YV I WQQKF YSNTN 
S I YGPANTWNAMPDRGGVTANH YDHVHVS FNK . YKKGS YLAS FLYALNRLSRFLYNFY 

SEQ ID NO. 6917 
STRAIN M732 frame: 3 

DLVKQDNKSSYTVKYGDTXSVI SEAMS IDMNVLAKINNI ADINLI YPETTLTVTYDQKSH 
TATSMKI ETPATNAAGQTTATVDLKTNQVF VADQKVSLNT I SEGMTPEAATT I VS PMKT Y 
S SAPALKS KE VLAQEQAVS Q VAANEQVSPAPVKS I TSEVPAAKEE VKPTQTS VSQLTTVS 
PAS VAAETPAP VAKVAP VRTVAAPRVA S AKWTP KVETGAS P E HVS APAVP VTTT S PATD 
S KLQ ATE VKS VP VAQ KAPTAS P VAQ PA S TTNAVAAHP ENAGL Q PHVAAYKE KVAS T YGVN 
EF STYRAGD PGDHGKGLAVDF I VGKNQALGNE VAQYSTQNMAANN I SYVI WQQKF YSNTN 
S I YGPANTWNAM PD RGG VTANHYDHVHVS FNK . YKKGS YIAS FLYALNRLSRFLYNFY 

SEQ ID NO. 6918 
STRAIN COH1 frame: 3 

DLVKQDNKSSYTVKYGDTLSVISEAMSIDMNVIJAKINNIADINLI YPETTLTVTYDQKSH 
TATSMKI ETPATNAAGQTTATVDLKTNQVFVADQKVSLNT I SEGMTPEAATT I VS PMKTY 
S SAPALKSKE VLAQEQAVSQVAANEQVSPAP VKS I TSF/VPAAKEEVKPTQTS VSQLTTVS 
PASVAAETPAPVAKVAPWTVAAPRVASAKVVTPKVETGASPEHVSAPAVPVTTTSPATD 
SKLQATEVKSVPVAQKAPTATPVAQPASTTNAVAAHPENAGLQPHVAAYKEKVAST^ 
EFSTYRAGD PGDHGKGLAVDF I VGKNQALGNEVAQYSTQNMAANNI SYVI WQQKFYSNTN 
SI YGPANTWNAM PDRGG VTANHYDHVHVS FNK . YKKGS YLAS FLYALNRLSRFLYNFY 

SEQ ID NO. 6919 
STRAIN M781 frame: 3 

DLVKQDNKSSYTVKYGDTLSVI SEAMS IDMNVLAKINNI ADINLI YPETTLTVTYDQKSH 
TATSMKI ET PATNAAGQTT ATVDL KTNQVFVADQ KVS LNT I SEGMTPEAATT I VS PMKTY 
S SAPALKSKEVLAQEQAVSQVAANEQVS PAPVKS I TSEVPAAKEE VKPTQTS VS QLTTVS 
PAS VAAE T PAP VAKVAP VRTVAAP RVASAKVVT P KVETGAS PEHVS APAVP VTTT S PATD 
S KLQ ATE VKS VP VAQ KAPT AT PVAQ PAS TTNAVAAHP ENAGLQPHVAAYKEKVASTYGVN 
EFSTYRAGDPGDHGKGLAVDFIVGKNQALGNEVAQYSTQNMAANNISYVI WQQKFYSNTN 
SI YGPANTWNAMPDRGGVTANH YDHVHVS FNK . YKKGS YLAS FLYALNRLSRFLYNFY 

SEQ ID NO. 6920 
STRAIN CJBUO frame: 3 

DLVKQDNKSSYTVKYGDTLSVI SEAMS I DMNVLAKINNI ADINLI YPETTLTVTYDQKSH 
TATSMKI ETPATNAAGQTPATVDLKTNQVS VADQKVS LNT I SEGMTPEAATT I VS PMKTY 
SSAPALKSKEVLAQEQAVSQAAANEQVSTAPVKS I TSEVPAAKEE VKPTQTS VSQSTTVS 
PAS VAAE TPAP VAKVAP VRTVAAP RVAS VKWT P KVETGAS PEHVS APAVP VTTTSTATD 
S KLQATEVKS VP VAQKAPTAT P VAQ PAS TTNAVAAH PENAGLQPHVAAYKEKVAS TYGVN 
EFSTYRAGD PGDHGKGLAVDF I VGKNQALGNEVAQYSTQNMAANNI SYVI WQQKFYSNTN 
S I YGPANTWNAMPDRGG VTANHYDHVHVS FNK . YKKGS YLAS FLYALNRLSRFLYNFY 

SEQ ID NO. 6921 
STRAIN 1 169NT frame: 3 

DLVKQDNKSSYTVKYGDTLSVI SEAMS IDMNVLAKINNI ADINLI YPETTLTVTYDQKSH 
TATSMKI ETPATNAAGQTTATVDLKTNQVSVADQKVSLNTI SEGMTPEAATT I VS PMKTY 
S S APALKSKEVLAQEGAVS QAAANEQVS PAPVKS ITSEVPAAKEEVRPTQTS VSQSTTVS 
PASVAAETPAP VAKVAP VRTVAAP APRVA SAKVVT PKVETGA S PEHVPAPAVP VTTT S TA 
TDNKLQATE VKS VP VAQ KAPTATP VAQ PAS TTNAVAAHPENAGLQ PHVAAYKE KVAS TYG 
VNEFSTYRAGDPGDHGKGLAVDFI VGKNQALGNEVAQYSTQNMAANNI SYVI WQQKF YSN 
TNS I YGPANTWNAMPDRGG VTANHYDHVHVS FNK . YKKGS YLAS FLYALNRLSRFLYNFY 

SEQ ID NO. 6922 
STRAIN JM9 1300 13 frame: 3 

DLVKQDNKS S YTVKYGDTL S VI SEAMS IDMNVLAKI NNI AD I NL I YPETTLTVTYDQKSH 
TATSMKI ET PATNAAGQTTATVDLKTNQVS VADQKVSLNT I SEGMTPEAATT I VS PMKTY 
SSAPALKSKEVLAQEQAVSQAAANEQVSPAPVKSITSEVPAAKEEVKPTQTSVSQSTTVS 
PAS VAAETPAP VAKVAP VRTVAAP RVA SVKVVT P KVETGAS PEHVSAPAVP VTTT S PATD 
SKLQATEVKSVPVAQKAPTATPVAQPASTTNAVAAHPENAGLQPHVAAYKEKVASTYGW 
EFSTYRAGD PGDHGKGLAVDF I VGTNQALGNKVAQYSTQNMAANN I SYVI WQQKFYSNTN 
S I YGPANTWNAM PDRGG VTANH YDHVHVS FNK . YKKGS YLAS FLYALNRLSRFLYNFY 

PRETTY Of : /biotmp/msa237 049.2{*} May 14, 2003 03:04 



1 50 

msa237049.2{322_COHl} dlvkqdnkss 

msa237049.2<322_M78l} dlvkqdnkss 

msa237049.2{322_M732} dlvkqdnkss 

msa237049.2{322_A909} dlvkqdnkss 

msa237049.2{322_H36B} dlvkqdnkss 

msa237049.2{322_090} 



996 



WO 2004/018646 
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Table 69: Comparative Sequences relating to SAG0032 



msa237049 .2f 322_CJB110 
msa237049.2{322_18RS21 
msa237049 .2{322_2603; 
msa237049 . 2 { 322_JM9130013 / 
msa237049 . 2 { 322_1 169NT; 

Consensus 



. dlvkqdnkss 

dlvkqdnkss 

mnkkvlltst maasllsvas vqaqetdttw tartvsevka dlvkqdnkss 

: dlvkqdnkss 

dlvkqdnkss 

********** ********** ********** ********** 



msa237049 . 

msa237049 . 

msa237049 . 

msa237049 . 

msa237049 . 
msa237049 
msa237049 .2{ 
msa237049 .2{ 

msa237049 . 
msa237049.2{322 
msa237049.2{ 



2f 322_COHl} 
2{322_M78l} 
2{322_M732} 
2{322_A909} 
2{322_H36B} 
.2{322_090} 
322_CJB110} 
322_18RS2lJ 
2{322_2603) 
_JM9130013} 
322_1169NT} 
Consensus 



51 

ytvkygdtls 
ytvkygdtls 
ytvkygdtxs 
ytvkygdtls 
ytvkygdtls 



vis earns idm 
viseamsidm 
vis earns idm 
viseamsidm 
viseamsidm 



nvlakinnia 
nvlakinnia 
nvlakinnia 
nvlakinnia 
nvlakinnia 



ytvkygdtls 
ytvkygdtls 
ytvkygdtls 
ytvkygdtls 
ytvkygdtls 



viseamsidm 
viseamsidm 
viseamsidm 
viseamsidm 
viseamsidm 



nvlakinnia 
nvlakinnia 
nvlakinnia 
nvlakinnia 
nvlakinnia 



dinliypETT 
dinliypETT 
dinliypETT 
dinliypETT 
dinliypETT 

ETT 

dinliypETT 
dinliypETT 
dinliypETT 
dinliypETT 
dinliypETT 



100 

LTVTYDQKSH 
LTVTYDQKSH 
LTVTYDQKSH 
LTVTYDQKSH 
LTVTYDQKSH 
LTVTYDQKSH 
LTVTYDQKSH 
LTVTYDQKSH 
LTVTYDQKSH 
LTVTYDQKSH 
LTVTYDQKSH 



*** ********** 



msa237049 . 

msa237049 . 

msa237049. 

msa237049 . 

msa237049 . 
msa237049 
msa237049 .2{ 
msa237049 .2{ 

msa237049 . 
msa237049.2{322 
msa237049 .2{ 



2{322_COHl 
2l322_M781 
2{322_M732; 
2{322_A909; 
2{322_H36B]' 
.2{322_090 
322_CJB110 
322_18RS21 
2{322_2603} 
_JM9130013} 
322_1169NT} 
Consensus 



101 

TATSMKI ETP 
TATSMKI ETP 
TATSMKI ETP 
TATSMKI ETP 
TATSMKI ETP 
TATSMKI ETP 
TATSMKI ETP 
TATSMKI ETP 
TATSMKI ETP 
TATSMKIETP 

TATSMKI ETP 
********** 



ATNAAGQTtA 
ATNAAGQTtA 
ATNAAGQTtA 
ATNAAGQTtA 
ATNAAGQTtA 
ATNAAGQTpA 
ATNAAGQTpA 
ATNAAGQTtA 
ATNAAGQTtA 
ATNAAGQTtA 
ATNAAGQTtA 



TVDLKTNQVf 
TVDLKTNQVf 
TVDLKTNQVf 
TVDLKTNQVs 
TVDLKTNQVs 
TVDLKTNQVs 
TVDLKTNQVs 
TVDLKTNQVs 
TVDLKTNQVs 
TVDLKTNQVs 
TVDLKTNQVs 



VADQKVSLNT 
VADQKVSLNT 
VADQKVSLNT 
VADQKVSLNT 
VADQKVSLNT 
VADQKVSLNT 
VADQKVSLNT 
VADQKVSLNT 
VADQKVSLNT 
VADQKVSLNT 
VADQKVSLNT 



150 

I SEGMTPEAA 
I SEGMTPEAA 
I SEGMTPEAA 
I SEGMTPEAA 
I SEGMTPEAA 
I SEGMTPEAA 
I SEGMTPEAA 
I SEGMTPEAA 
I SEGMTPEAA 
I SEGMTPEAA 
I SEGMTPEAA 



********_* *********_ ********** ********** 



msa237049.2{322_COHl} 
msa237049.2{322_M78l} 
msa237049.2{322_M732; 
msa237049 . 2f 322_A909 
msa237049 . 2 (322_H36B 
msa237049.2{322_090 
msa237049 . 2 { 322_CJB110 ; 
msa237049 .2{322_18RS2i; 
msa237049.2{322_2 603/ 
msa237049 . 2 {322_JM9130 013 ; 
msa237049 .2{322_1169NT 

Consensus 



151 

TTIVSPMKTY 
TTIVSPMKTY 
TTIVSPMKTY 
TTIVSPMKTY 
TTIVSPMKTY 
TTIVSPMKTY 
TTIVSPMKTY 
TTIVSPMKTY 
TTIVSPMKTY 
TTIVSPMKTY 
TTIVSPMKTY 



SSAPALKSKE 
SSAPALKSKE 
SSAPALKSKE 
SSAPALKSKE 
SSAPALKSKE 
SSAPALKSKE 
SSAPALKSKE 
SSAPALKSKE 
SSAPALKSKE 
SSAPALKSKE 
SSAPALKSKE 



VLAQeQAVSQ 
VLAQeQAVSQ 
VLAQeQAVSQ 
VLAQgQAVSQ 
VLAQgQAVSQ 
VLAQeQAVSQ 
VLAQeQAVSQ 
VLAQeQAVSQ 
VLAQeQAVSQ 
VLAQeQAVSQ 
VLAQeQAVSQ 



vAANEQVSpA 
vAANEQVSpA 
vAANEQVSpA 
aAANEQVSpA 
aAANEQVSpA 
aAANEQVS t A 
aAANEQVS t A 
aAANEQVSpA 
aAANEQVSpA 
aAANEQVSpA 
aAANEQVSpA 



200 

PVKSITSEVP 
PVKSITSEVP 
PVKSITSEVP 
PVKSITSEVP 
PVKSITSEVP 
PVKSITSEVP 
PVKSITSEVP 
PVKSITSEVP 
PVKSITSEVP 
PVKSITSEVP 
PVKSITSEVP 



********** ********** ****_***** _*******_* ********** 



msa237049.2(322_COHl} 
msa237049.2(322_M78l} 
msa237049.2(322_M732} 
msa237049.2{322_A909} 
msa237049.2{322_H36B; 
msa237049.2{322_090; 
msa237049 . 2 { 322__CJB110 
msa237049 . 2 {322_18RS21 
msa237049.2{322_2603 
msa237049.2{322_JM9130013 
msa237049 .2{322_1169NT; 

Consensus 



201 

AAKEEVkPTQ 
AAKEEVkPTQ 
AAKEEVkPTQ 
AAKEEVkPTQ 
AAKEEVkPTQ 
AAKEEVkPTQ 
AAKEEVkPTQ 
AAKEEVkPTQ 
AAKEEVkPTQ 
AAKEEVkPTQ 
AAKEEVrPTQ 



TSVSQ1TTVS 
TSVSQITTVS 
TSVSQITTVS 
TSVSQSTTVS 
TSVSQsTTVS 
TSVSQSTTVS 
TSVSQsTTVS 
TSVSQsTTVS 
TSVSQsTTVS 
TSVSQsTTVS 
TSVSQSTTVS 



******_*** *****_**** 



PASVAAETPA 
PASVAAETPA 
PASVAAETPA 
PASVAAETPA 
PASVAAETPA 
PASVAAETPA 
PASVAAETPA 
PASVAAETPA 
PASVAAETPA 
PASVAAETPA 
PASVAAETPA 
********** 



PVAKVAPVRT 

PVAKVAPVRT 

PVAKVAPVRT 

PVAKVAPVRT 

PVAKVAPVRT 

PVAKVAPVRT 

PVAKVAPVRT 

PVAKVAPVRT 

PVAKVAPVRT 

PVAKVAPVRT 

PVAKVAPVRT 
********** 



250 

VA. .APRVAS 

VA. .APRVAS 

VA. .APRVAS 

VA. .APRVAS 

VA. .APRVAS 

VA. .APRVAS 

VA. .APRVAS 

VA. .APRVAS 

VA. .APRVAS 

VA. .APRVAS 

VAapAPRVAS 
** _ _****** 



msa237049 
msa237049 
msa237049 
msa237049 
msa237049 
msa237 04 
msa237049 .2 
msa237049.2 
msa237049 
msa237049.2{32 
msa237049 .2 



|322_COHl} 
322_M78l} 
322J4732} 
322_A909} 
322_H36B} 
9 .2{322_090) 
{322_CJB110} 
{322_18RS2l} 
.2{322_2603} 
2_JM9130013} 
{322_1169NT} 
Consensus 



251 

aKWTPKVET 
aKWTPKVET 
aKWTPKVET 
vKWTPKVET 
vKWTPKVET 
vKWTPKVET 
vKWTPKVET 
VKWTPKVET 
vKWTPKVET 
vKWTPKVET 
aKWTPKVET 



GASPEHVsAP 
GASPEHVsAP 
GASPEHVsAP 
GASPEHVsAP 
GASPEHVsAP 
GASPEHVsAP 
GASPEHVsAP 
GASPEHVsAP 
GASPEHVsAP 
GASPEHVsAP 
GASPEHVpAP 



_********* *******_** 



AVPVTTTSpA 

AVPVTTTSpA 

AVPVTTTSpA 

AVPVTTTS t A 

AVPVTTTS t A 

AVPVTTTStA 

AVPVTTTS t A 

AVPVTTTSpA 

AVPVTTTSpA 

AVPVTTTSpA 

AVPVTTTStA 
********_* 



300 

KSVPVAQKAP 
KSVPVAQKAP 
KSVPVAQKAP 
KSVPVAQKAP 
KSVPVAQKAP 
KSVPVAQKAP 
KSVPVAQKAP 
KSVPVAQKAP 
KSVPVAQKAP 
KSVPVAQKAP 
KSVPVAQKAP 
**_******* ********** 



TDs KLQATE V 
TDs KLQATEV 
TDs KLQATEV 
TDs KLQATEV 
TDs KLQATEV 
TDs KLQATEV 
TDs KLQATEV 
TDs KLQATEV 
TDs KLQATEV 
TDs KLQATEV 
TDnKLQATEV 



msa237049 . 2 { 322_C0H1 } 
msa237049.2{322_M78l} 
msa237049.2{322_M732} 
msa237049 . 2 f 322_A909 } 
msa237049.2{322_H36B} 



301 

TAtPVAQPAS 
TAtPVAQPAS 
TASPVAQPAS 
TAtPVAQPAS 
TAtPVAQPAS 



TTNAVAAHPE 
TTNAVAAHPE 
TTNAVAAHPE 
TTNAVAAHPE 
TTNAVAAHPE 



NAgLQPHVAA 
NAgLQPHVAA 
NAgLQPHVAA 
NArLQPHVAA 
NArLQPHVAA 



YKEKVASTYG 
YKEKVASTYG 
YKEKVASTYG 
YKEKVASTYG 
YKEKVASTYG 



350 

VNEPSTYRAG 
VNEFSTYRAG 
VNEPSTYRAG 
VNEFSTYRAG 
VNEFSTYRAG 



997 



WO 2004/018646 
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Table 69: Comparative Sequences relating to SAG0032 



msa237049 .2{322_090} 
msa237049.2{322_CJB110} 
msa237049 . 2 { 322_18RS21 } 
msa237049 . 2 { 322_2603 } 
tnsa237049 . 2{322_JM9130013 } 
msa237049 . 2 { 322_1169NT} 

Consensus 



TAtPVAQPAS 
TAtPVAQPAS 
TAtPVAQPAS 
TAtPVAQPAS 
TAtPVAQPAS 
TAtPVAQPAS 



TTNAVAAHPE 
TTNAVAAHPE 
TTNAVAAHPE 
TTNAVAAHPE 
TTNAVAAHPE 
TTNAVAAHPE 



NAgLQ PHVAA 
NAgLQ PHVAA 
NAgLQPHVAA 
NAgLQPHVAA 
NAgLQPHVAA 
NAgLQPHVAA 



YKEKVASTYG 
YKEKVASTYG 
YKEKVASTYG 
YKEKVASTYG 
YKEKVASTYG 
YKEKVASTYG 



VNE FSTYRAG 
VNEFSTYRAG 
VNE FSTYRAG 
VNEFSTYRAG 
VNEFSTYRAG 
VNEFSTYRAG 



**_* ****** ********** **_******* ********** ********** 



rasa237049 
rasa237049 
msa237049 
rasa237049 
rasa237049 
msa23704 
msa237049 .2 
msa237049 .2 
tnsa237049 
msa237049.2{32 
msa237049 .2 



.2{322_COHl} 
.2{322_M78l} 
.2{322_M732} 
.2{322_A909) 
.2{322_H36B} 
9 .2{322_090 
{322_CJB110 
{322_18RS21 
.2{322_2603 
2_JM9130013 
{322_1169NT; 
Consensus 



351 

DPGDHGKGLA 
DPGDHGKGLA 
DPGDHGKGLA 
DPGDHGKGLA 
DPGDHGKGLA 
DPGDHGKGLA 
DPGDHGKGLA 
DPGDHGKGLA 
DPGDHGKGLA 
DPGDHGKGLA 
DPGDHGKGLA 



VDFIVGkNQA 
VDFIVGkNQA 
VDFIVGkNQA 
VDFIVGkNQA 
VDFIVGkNQA 
VDFIVGkNQA 
VDFIVGkNQA 
VDFIVGtNQA 
VDFIVGtNQA 
VDFIVGtNQA 
VDFIVGkNQA 



LGNeVAQYST 
LGNeVAQYST 
LGNeVAQYST 
LGNeVAQYST 
LGNeVAQYST 
LGNeVAQYST 
LGNeVAQYST 
LGNkVAQYST 
LGNkVAQYST 
LGNkVAQYST 
LGNeVAQYST 



QNMAANNISY 
QNMAANNISY 
QNMAANNISY 
QNMAANNISY 
QNMAANNISY 
QNMAANNISY 
QNMAANNISY 
QNMAANNISY 
QNMAANNISY 
QNMAANNISY 
QNMAANNISY 



400 

VIWQQKFYSN 
VIWQQKFYSN 
VIWQQKFYSN 
VIWQQKFYSN 
VIWQQKFYSN 
VIWQQKFYSN 
VIWQQKFYSN 
VIWQQKFYSN 
VIWQQKFYSN 
VIWQQKFYSN 
VIWQQKFYSN 



********** ******_*** ***_****** ********** ********** 



msa23 7049 .2f 322_COHl} 
msa237049 .2 {322_M78l} 
msa237049.2{322_M732; 
msa237049 .2{322_A909" 
msa237049.2{322_H36B 
msa237049.2{322_090 
msa237049 - 2 {322_CJB110 } 
msa237049.2{322_18RS2l} 
msa237049.2{322_2603" 
msa237049.2{322_JM9130013 
msa237049.2{322__1169NT 

Consensus 



401 

TNS I YGPANT 

TNS I YGPANT 

TNS I YGPANT 

TNS I YGPANT 

TNS I YGPANT 

TNS I YGPANT 

TNS I YGPANT 

TNS I YGPANT 

TNS I YGPANT 

TNS I YGPANT 

TNS I YGPANT 
********** 



WNAMPDRGGV 

WNAMPDRGGV 

WNAMPDRGGV 

WNAMPDRGGV 

WNAMPDRGGV 

WNAMPDRGGV 

WNAMPDRGGV 

WNAMPDRGGV 

WNAMPDRGGV 

WNAMPDRGGV 

WNAMPDRGGV 
********** 



TANHYDHVHV 

TANHYDHVHV 

TANHYDHVHV 

TANHYDHVHV 

TANHYDHVHV 

TANHYDHVHV 

TANHYDHVHV 

TANHYDHVHV 

TANHYDHVHV 

TANHYDHVHV 

TANHYDHVHV 
********** 



SFNK.YKKGS 

SFNK.YKKGS 

SFNK.YKKGS 

SFNK.YKKGS 

SFNK.YKKGS 

SFNK.YKKGS 

SFNK.YKKGS 

SFNK.YKKGS 

SFNK.YKKGS 

SFNK.YKKGS 

SFNK.YKKGS 
********** 



450 

YLASFLYALn 

YLASFLYALn 

YLASFLYALn 

YLASFLYALh 

YLAS FLYALh 

YLASFLYALn 

YLASFLYALn 

YLASFLYALn 

YLASFLYALn 

YLASFLYALn 

YLASFLYALn 
*********„ 



msa237049 
msa237049 
msa237049 
msa237049 
msa237049 
msa23704 
msa237049 .2 
msa237049.2 
msa237049 
msa237049.2{32 
msa237049.2 



.2{322_COHl* 
.2{322_M781 
.2(322_M732 
,2{322_A909 
.2{322_H36B/ 
9.2{322_090 
{322_CJB110 \ 
{322__18RS21 
,2{322_2603 
2_JM9130013 
{322_1169NT; 
Consensus 



451 460 

RLSRFLYNFY 

RLSRFLYNFY 

RLSRFLYNFY 

RLSRFLYNFY 

RLSRFLYNFY 

RLSRFLYNFY 

RLSRFLYNFY 

RLSRFLYNFY 

RLSRFLYNFY 

RLSRFLYNFY 

RLSRFLYNFY 
********** 
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Table 70: Comparative Sequences relating to SAG 1280 



SEQ ID. NO. 7001 
STRAIN 2603 

ATGGGAGGGAAAATG AAT CAAGAAGT CTTACTACAAATG ATGAG AG C CACT ATT C CT C 
GTGATAGAGCCTTGCTTGAGGCATTTTTATATTACCAAGC^ 

GGGATAGTCTTATTCATCAGTTTATGACCAATAGGCAAGAAATAAATAAGTCTGTTCAAG 

TACTTCACrTTGAGACAGATGTTT CAG CTTTTGTCCAGGCTAGTCCTTATGATACTGCTC 

ATGAT CTATTGAC CTAT ACACAAGTTTT CGG C CAAAGTGGT CTTCAAAAACTAG ATAAAC 

TATCGCCGTCTGAAAAAAACTTGGTGATAGAAGTGGCCTTGTTCAATCTGGCCACTCGTT 

TTCMTTATTGGATTCCAATGGACACTAGCAAACCATATCGCCGGATTCACTCTTACAAA 

AGAGTAGGG G AG CTAATTT GGT CAATGTGTAT CGTGTGG CTAATAATTT AGC GGAT CGTA 

TTAGT CG AG ATATTGAACAG'TTT CTCTTAACTTACGAGCCTG AG CTTGAAACTAGAG CTG 

ATGAAACTGTTCTAGAAAATGAAGAAACTGTTGATGAGCAC^AAACAAGTGTTCATCAAG 

CAATATCITTI'CGAGAAGAGGGCTCTCTGGTTATTGCTAGTTTGGATGTAGATTTGTCTC^ 

AACTAGATGTTCAAATAGG AAAAACCAGTCAT CT G C CAG CTT ATGAAGAGTTAT C CTTAC 

GACGT AAATTTGAG ATT CTAACAT ATTTTG AC CAAATTCGAAATGAACGTT C CAAAGT C C 

CAAGTTTTAGACGAGGTGATTTTGACACAGAGATGGAAATGACACCAGTCTTTGATGGCG 

AGGAATTACTTACTTATCTCGAAGCTGATGGCAGTCCCTATGAGCTGAAACGAACGCTGA 

CTACAGT CG AAG AAAAGGAATT AGAAAAAATTGGACAAG C CATTAGGAT AGAAAATCAAG 

AAAAATTGACT CAG CTAGGGATTGATTTAT CT CAGTTTG AC C CAG AC CG AGT CGGTATTT 

TATTGGATGCAGCAGGTCGTTTTCGTTTAAAAAATGCAGACCTTGCTTT 

ATCC CAAAG C CT CGGTAACT CAACTAG C C CTTGCGACAGAACTACT C CAAATGGGACTAA 

GTCATGAAAAGGTTGAATTTTT CTTTGGT AG C CAGU"1"1"1'C CATTG AAGAG CTG CG ACAAG 

TTGCCTACX3CCTTTTTATACCAAGAACTCAGCAGAGAAGATGCGGAGCAATTTG 

ATAAAGGTAATCAG C CAGATTTAACTCT CAGAGATTGGAAAAGCAAG CT AGAGAAAG CTG 

AGGGAAAAGAAGTAGTTGATGAAGAATTCG CX3GAAAATC CACTGGTT CAGAGAGTATTGG 

AGACT^ATCCTCTGGGGTCATTGGTTTCCrrATAAG 

TCAGCGATGCTCGATTGAACGGTTTGATTCGGAT^ 

TCATTGAACAAAAT CCAGTT CrTTATGTGAGGACCTGGGAAGAAGTCAGTCAGG CACTTC 
ATCAG CCAAAGG C^GAACCACAAACAGAGTTAGAAGAAG CGGACCAAGAATTAAACCTAT 
TCTCATTTCTGGAAGAGGAGCCAGTTCAGAGTATTGGACT CAG 
AAAATGGT CATAACGATACTGATCTTGAAGAAACAGATAAT CAAATT CCTGAAGAGGAAG 
TCGT CGAAACAATTC CAGAGATTC CAGTAACGGACI7IT?TATT TT CCAGAAGATTTGACGG 
ACTTTTATCCTAAGACTGCTAGAGATAAGGTTGAGACAAACATTG 
TAAAAAATCTAGAAGTAGAGCACCGCAATGCTrTCACCAAGTGAACAAGAACT 
AGTATGTAGG CTGGG GTGGACTAG CCAATGAATTTTTTGATGACTATAAT CCAAAATTTT 
CTAAGGAACGAGAAGAACTGAAGAGCCTAGTCACAGATAAAGAGTATTCGGATATGAAAC 
AGTC(2TCCCTGACAGCCTATTACACAGACCCATCCCTGATCCGTCA.GATGTGGGATAAGT 
TGGAAAGAGATGGCTTTAC^GGTGGCAAAATCCTAGATCCTTCCATGGGA^ 
TCTTTGCGGCTATGCCAAAACACITAAGAGAAAAGAGTGAGTTGTATGGC 
ATACTATTACAGGAGCTATTG CCAAACAC CTTCAT CC CAATAGTCAT ATTGA^ 
GATTTGAGACGGTGGCTTTTAACGACAATAGTTTTGATTTG^ 

TTGCCAATATACGAATTGCGGATAATAGGTACGATAG^CCTTACATGATTC^TGACTACT 
TTGT CAAAAAGT CACTTGATTT GCTTCATGATGGTGGACAAGTAG CGATT AT CT CTTC CA 
CAGGAACTATGGATAAGCGAACAGAAAACATCTTACAAGATATTCGTGAGACAACTGAAT 
TTCTTGGTGGGGTTCGACrGCCTGACTCTGCCTTTAAGGCCATT 
CAACGGATATGTTATTCTTCCAGAAACACTTAGACAAGGGATATGTGGCA 
CCTTTTCAGGTTCCATTCG CTATGACAAGGATAGTCG CAT CCTTATTTTG 
ATGGAGAATACAATAGCCAGGTGCTAGGAACCITACGAGGTCAGGAATT^ 
CACTT^CTGTTAAGGGGACTAGTGATGACTTGATTGCAAGTGTTC^ 

ACGTTAAGG CCC CAAGAGAG ATTGATAGAAATGAGGT CAT CATTAAC CCAGATGTGTTG A 

CCAAACAAGTCAATGATAC CTCCATT C CAG CTGAAATGAGGGAAAAT CTAGGTCAGTACA 

GTTTTGGTTATCAGGGGT CT ACAGTTTACTATCGAG^TAACLAAAGG CATT CGAGT CGGAA 

CCAAG ACGGAAGAAATCAGTTACTATGTCG ATGAAGAGGG CAACTTCAAAG CATGGGACA 

CCAAACATTCTCAAAAG CAG ATTGAT CGCTTTAATGC CTTAGAAGTC 

CT CTGGATGTCTATGTG AC CGATGATG CA.G CGAAACGTGGTCAGTTTAAGGGGTATT ATA 

AAAAGACAGTTTTCTATGAAGCTCCATTGTCI^ 

TC4GTCGATATTCGCAATGCCTACCAAGAAGTTATTGCCATTCAACGCTATTATGACTATG 

ATAAGGAGACCTTTAACCACTTGTTAGGCAAACTCAATCGT^ 

AAGACTATGGGTATTTGAATAGTGCrGTGAACCGCAATCT^ 

ATTC G CTTCTTG CT AGTTTG GAAG ATGAAAGT CTGGATC CAAGTGGAAAGT CTGTTATCT 
ATACTAAAT CC CTTG CCTTTGAGAAGGCTT CTAGTG CGTCOT 
TGCATACTGCCCTTGATGCCTTAAATTCGAGCnTGGCT 

CTTATATGATGT CTAT CTAT CAGGTTGAATCGCAGATGAC CTTG ATTGAGGAGTTAGGCG 
AC CT CATTATG CCTG AT CCTGAGAAGTATTTGAATGG AGAATTG ACCTAT GTTT CT CG CC 
AAGACrTTTCTTT CAGGGGATGT CGTCACTAAGTTAGAAGTGGTAGAT CTATTCGT CAAAC 
AAGACAATCAGGACTTTAACTGGT CACATTAT G CG GGACTTCTAGAAGCTAT CAAAC CAG 
CC CGTATTACTTTGGCAGACATTGATTAT CGAATCGGTTCACG CTGGATTCCTCTGG CTG 
TTTATCX3AAAATTTG CC C^GAAACCITTATGGGGAAAG C CTAT G AACTGTCAGACCAAG 
AAGTAGCGACA.GTCCTAGAAGTCAGTCCCATTGACGGC4GTTATC^ CAAT CTAAGT 

TTGC CTACACCTATT C CAACG CAACGG ATAGGAGTTTAGGTGTC C CTG CTTCACG CTATG 
ATAGTGGTCGAAAAATCTTTGAAAATCTCCTGAATTCCM^ 

AAGTTGTCGAAGGGGATAAGAAAAAGAATGTGACGGATGTAGAGAAAACAACGGTCCTGC 
GTGCCAAGG AAACACACCT ACAAGAACT CTTT CAAGGTTTTGTAG CAAAGTAT CCAGAAG 
TCCAACAAATGATTGAAGACACCTATAATAGGCTCTACA^ 

ATGATGGTAGT CATTTAACCATTG ATGGACir"G CT CAGA&TATCT CCTTACGTCCT GACC 
AAAAGAATG C CATT CAACGAATTGT CGAGGAAAAACX3TG CTCTTAOT 

GTT CAGGTAAAACACHTAC CATGCTTGGGG CAGGATT OVAACTGAAAGAACTCGGAATGG 
TACATAAACCACTTTATGTGGTGCCGTCTAGTCT'GACTGCT 

TGAAATTTTT CCCT ACCAAG AAAGT CTATGTGACTACTAAGAAAGACTTTGC CAAAG C CA 
AACG CAAG CAGTTTGTGTC C CGTATTATTACAGGGGACTATGATG CCATTGT CATTGGGG 
ATTCACAATTTGAGAAGATACCGATGAGTCGTGAAAAACAGGTC^ 

AACTTGAG CAACTCCGAGAAAT CAAG CTAGGAAGTGACAGTGATTACACGGTGAAAG AAG 
CGGAACGTTCGATTAAGGGATTAGAACACCAGTTGGAAGAACTCC^ 
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